The case for fixing everything
MIT Technology Review · 3h ago · Breaking
GODOT INDEX
+0.2
Calculate your personal displacement risk. Cross-reference your profession, country, and industry against real-time benchmark progression and adoption data.
Independent security audits for Model Context Protocol servers. Permission scanner. Agent templates.
| # | Model | Score | 24h | 7d | 7d chart | Org value | Queries/day | Category |
|---|---|---|---|---|---|---|---|---|
| 1 | Claude 3.7 SonnetNEW Anthropic | 94.2 | 2.1% | 5.8% | $8.5B | 142K | Frontier | |
| 2 | GPT-4o OpenAI | 91.8 | 0.3% | 1.2% | $157B | 891K | Frontier | |
| 3 | Gemini 2.0 Ultra Google DeepMind | 89.3 | 1.7% | 4.1% | $1.8T | 324K | Frontier | |
| 4 | Grok 3 xAI | 86.1 | 4.2% | 9.3% | $24B | 67K | Frontier | |
| 5 | Llama 4 ScoutNEW Meta AI | 81.4 | 6.8% | 14.2% | $1.4T | 2.1M | Open Source | |
| 6 | Mistral Large 3 Mistral AI | 79.2 | 3.1% | 6.7% | $1.1B | 89K | Open Source | |
| 7 | o4-miniNEW OpenAI | 78.1 | 8.4% | 22.1% | $157B | 234K | Reasoning | |
| 8 | Claude 3.7 Haiku Anthropic | 76.4 | 1.1% | 3.2% | $8.5B | 412K | Frontier | |
| 9 | Gemini 2.0 Flash Google DeepMind | 74.8 | 0.4% | 0.9% | $1.8T | 1.1M | Frontier | |
| 10 | DeepSeek V3 DeepSeek | 73.1 | 0.6% | 2.8% | $8B | 98K | Open Source |