The leaderboard “you can't game,” funded by the companies it ranks

Artificial intelligence models are growing rapidly, and competition is fierce. With so many players out there, who will be the best – and who decides? Arena, formerly LM Arena, has emerged as the de facto public leaderboard for frontier LMs, influencing the funding, launch, and PR cycle. In just seven months, startup grows from UC Berkeley PhD research project Its value is estimated at 1.7 billion dollars.

see as equity Host Rebecca Belen meets the co-founders of Arena Anastasios Angelopoulos And Wei-Lin Chiang About how their platform became the leading leaderboard for Frontier AI models, and how they are trying to create a neutral benchmark despite supporting projects from companies like OpenAI, Google, and Anthropic.

They explain how Arena works and why it’s harder to play games than static benchmarks, what “structural neutrality” really means, why the cloud currently tops expert leaderboards in legal and medical use cases, and how the company is expanding beyond chat to benchmark agents, coding, and real-world tasks with a new enterprise product.

subscribe equity youtube, apple podcasts, overcast, spotify And all castes. You can also follow equities x And threadsOn @EquityPod.

Source link

Please follow and like us: