The prompt will be "Solve this. Explain your answer" with an RPM attached as an image. For example:
The AI must be able to solve 8 out of 10 puzzles of my choosing. I will only choose puzzles that I can solve.
If there's a consensus that chatbots can/can't do this I may not bother doing the test myself.
As of market creation the best commercially available LLMs fail embarrassingly:
Chatgpt 4o:
https://www.perplexity.ai/search/solve-this-explain-your-answer-ohIVE8CaQ3OIu9ODzaEXcg
Claude Sonnet 1.5:
https://www.perplexity.ai/search/solve-this-explain-your-answer-Ay.Kpyc9Tfm6uKBT3KUylQ
Rules:
- Must be an general purpose ai, can't be something made specifically to solve certain kinds of problems.
- I will not bet.
@Shai I will note they have no problem "seeing" what's in the image. They can describe any shape when asked.