Learning to Reason with LLMs

Sep 13, 2024

Learning to Reason with LLMs

Posted by Dan Breeden in categories: mathematics, robotics/AI

Some big claims here: https://openai.com/index/learning-to-reason-with-llms/

OpenAI o1 ranks in the 89th percentile on competitive programming questions (Codeforces), places among the top 500 students in the US in a qualifier for the USA Math Olympiad (AIME), and exceeds human PhD-level accuracy on…

We are introducing OpenAI o1, a new large language model trained with reinforcement learning to perform complex reasoning. o1 thinks before it answers—it can produce a long internal chain of thought before responding to the user.

0 comments