“The king is dead”—Claude 3 surpasses GPT-4 on Chatbot Arena for the first time

Anthropic’s Claude 3 Opus large language model (LLM) has surpassed OpenAI’s GPT-4 on Chatbot Arena for the first time, marking a significant moment in the AI language model space. The victory of Claude 3 over GPT-4 has garnered attention on social media, with software developer Nick Dobos tweeting “RIP GPT-4.” Chatbot Arena, run by Large Model Systems Organization (LMSYS ORG), is a platform where users rate the outputs of two unlabeled LLMs, helping to calculate the “best” models in aggregate and populate the leaderboard. This is crucial for researchers who struggle to measure the performance of AI chatbots due to their varying outputs. The rise of Claude 3 has led to some users replacing ChatGPT in their daily workflow, potentially impacting ChatGPT’s market share. Additionally, Google’s Gemini Advanced is gaining traction in the AI assistant space, posing competition for OpenAI. Despite this, OpenAI is preparing to release a major new successor to GPT-4 Turbo, possibly named GPT-4.5 or GPT-5, indicating that the AI language model space will continue to be full of competition and interesting shakeups on the Chatbot Arena leaderboard in the future.

