ChatGPT 4 Becomes The 2nd Best AI Chatbot for The First Time

In a groundbreaking development, Anthropic’s cutting-edge artificial intelligence model, Claude 3 Opus, has seized the coveted position at the helm of the Chatbot Arena leaderboard. This triumph marks a significant shift in the landscape, relegating OpenAI’s GPT-4 to the runner-up position for the first time since its inception last year.

Diverging from traditional methods of benchmarking AI models, the LMSYS Chatbot Arena adopts a unique approach, emphasizing human judgment. Participants are tasked with assessing and ranking the responses generated by two distinct models when presented with identical prompts in a blind test.

This benchmark has been ruled by OpenAI’s GPT 4 for so long that any other AI model that comes close is named “GPT 4 class”, which is why this is such a noteworthy achievement for Claude 3.

ALSO READ

OpenAI “Accidentally” Leaks ChatGPT’s Next Major Upgrade

Claude 3 Opus, the biggest model in the Claude 3 family, took the top spot in the leaderboard with over 70,000 new votes. The best part is that even the smaller Claude 3 models performed well. Claude 3 Haiku is the smallest model in the series, meant to run on consumer devices similar to Google’s Gemini Nano. It is achieving impressive results without being significantly large like GPT 4 or Claude Opus.

All three Claude models managed to take the top 10 rankings in these benchmarks. Opus was at the top, Sonnet took the fourth spot alongside Gemini Pro, and Haiku was at sixth with an older version of GPT 4.

📢 For the latest Tech & Telecom news, videos and analysis join ProPakistani's WhatsApp Group now!

Follow ProPakistani on Google News & scroll through your favourite content faster!

ChatGPT 4 Becomes The 2nd Best AI Chatbot for The First Time

Aasil Ahmed

Latest News

PM’s Dissatisfaction Over Performance Sees Unprecedented Reshuffle at…

McKinsey & Company’s Proposal for FBR’s Digitization Approved

Glassware Importers to Pay Revised Rates of Duties and Taxes

FBR’s Data Protection Efforts Commended by OECD Assessment Team

SECP Introduces Swift Complaint Resolution Platform SECP-XS

Now Trending

lens

Spotify Launches ‘Your K-Pop Persona’ to Celebrate the K-Pop Fandom

Timothée Chalamet Rocks the ’60s Look as Bo…

Fahad Mustafa Makes a Cheeky Joke About Shoaib Ma…

Sami Khan and Sonya Hussyn Set to Sizzle On-Scree…

Viral Indian Street Vendor Caught Mixing His Bodi…

Junaid Khan Denies Collaboration with Khushi Kapo…

perspective

A Love Letter to Pakistan: A Foreign CEO Reflects on 5 Years

NFTs: The Next Big Thing to Redefine Proof of Own…

What are NFTs and Why are they the Future?

Reassessing the cost of the crisis, while busines…

5 Ultimate Rules of Entrepreneurship from a VC’s …

Arvelon Co-Founder Speaks on The “Fair̶…

ProPakistani Community