LLM Models
376 models
| # | Model ↕ | Provider ↕ | Score ↕? | Context ↕? | Speed ↕? | MMLU ↕? | GPQA ↕? | HumanEval ↕? | SWE-B ↕? | Input $/1M ↕? | Output $/1M ↕? | ⭐ Top ↓? | OSS ↕? | Released ↕? |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
351 | Alibaba | 67 | 32K | - | 89.0 | 59.0 | 87.0 | 37.0 | $2.40 | $9.60 | - | - | - | |
352 | Naver | 66 | 4K | - | 79.0 | - | 55.0 | - | $12.00 | $24.00 | - | - | - | |
353 | Yandex | 65 | 32K | - | 69.0 | - | 62.0 | - | $2.00 | $4.00 | - | - | - | |
354 | Meta | 64 | 128K | 60 | 88.6 | 50.7 | 89.0 | 34.1 | - | - | - | ✓ OSS | - | |
355 | Sber | 64 | 32K | - | 68.0 | - | 60.0 | - | $4.00 | $8.00 | - | - | - | |
356 | Mistral AI | 62 | 128K | 90 | 84.0 | 47.2 | 92.1 | 32.6 | $2.00 | $6.00 | - | - | - | |
357 | Alibaba/Qwen | 60 | 128K | 80 | 86.1 | 49.0 | 86.5 | 23.7 | - | - | - | ✓ OSS | - | |
358 | Alibaba/Qwen | 59 | 128K | 90 | 80.0 | 42.0 | 92.3 | 30.0 | - | - | - | ✓ OSS | - | |
359 | 59 | 2M | 100 | 85.9 | 46.2 | 84.1 | 26.9 | $1.25 | $5.00 | - | - | - | ||
360 | 57 | 1M | 280 | 83.0 | 45.0 | 85.0 | 22.0 | $0.100 | $0.400 | - | - | - | ||
361 | Meta | 56 | 128K | 150 | 83.6 | 46.7 | 80.5 | 21.8 | - | - | - | ✓ OSS | - | |
362 | 51 | 1M | 250 | 78.9 | 37.0 | 78.9 | 16.2 | $0.075 | $0.300 | - | - | - | ||
363 | 49 | 8K | 120 | 75.2 | 38.4 | 74.0 | 14.7 | $0.650 | $0.650 | - | ✓ OSS | - | ||
364 | Cohere | 48 | 128K | 85 | 80.4 | 30.1 | 74.2 | 16.8 | $2.50 | $10.00 | - | - | - | |
365 | Meta | 37 | 128K | 500 | 63.4 | 24.7 | 58.3 | 9.5 | - | - | - | ✓ OSS | - | |
366 | Baidu | - | 128K | - | - | - | - | - | $0.100 | $0.100 | - | - | - | |
367 | Alibaba | - | 1M | - | - | - | - | - | $0.200 | $0.600 | - | - | - | |
368 | Alibaba | - | 131K | - | - | - | - | - | $0.090 | $0.450 | - | - | - | |
369 | Baidu | - | 66K | - | - | - | - | - | $0.680 | $2.81 | - | - | - | |
370 | ByteDance | - | 32K | - | - | - | - | - | $0.300 | $0.900 | - | - | - | |
371 | Zhipu AI | - | 128K | - | - | - | - | - | $0.000 | $0.000 | - | - | - | |
372 | Moonshot AI | - | 8K | - | - | - | - | - | $2.00 | $6.00 | - | - | - | |
373 | Tencent | - | 256K | - | - | - | - | - | $0.000 | $0.000 | - | - | - | |
374 | DeepSeek | - | 164K | - | - | - | - | - | $0.287 | $0.431 | - | - | - | |
375 | Baidu | - | 131K | - | - | - | - | - | - | - | - | - | - | |
376 | Sber | - | 8K | - | - | - | - | - | $0.800 | $1.60 | - | - | - |
Score = MMLU×20% + GPQA×30% + HumanEval×25% + SWE-Bench×25%⭐ Top - average TECHAGENT user ratingtok/s - output tokens/sec via API
Don't see a model you know? Sign in to suggest it.

