LLM Models
26 models
| # | Model ↕ | Provider ↕ | Score ↕? | Context ↕? | Speed ↕? | MMLU ↕? | GPQA ↕? | HumanEval ↕? | SWE-B ↕? | Input $/1M ↕? | Output $/1M ↕? | ⭐ Top ↓? | OSS ↕? | Released ↕? |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1 | Microsoft | - | 131K | - | - | - | - | - | $0.080 | $0.350 | - | ✓ OSS | Oct 2025 | |
2 | DeepSeek | - | 128K | - | - | - | - | - | $0.290 | $0.290 | - | ✓ OSS | Jan 2025 | |
3 | DeepSeek | - | 131K | - | - | - | - | - | $0.800 | $0.800 | - | ✓ OSS | Jan 2025 | |
4 | DeepSeek | 75 | 64K | 30 | 90.8 | 71.5 | 92.6 | 49.2 | $0.700 | $2.50 | - | ✓ OSS | Jan 2025 | |
5 | Microsoft | - | 16K | - | - | - | - | - | $0.065 | $0.140 | - | ✓ OSS | Jan 2025 | |
6 | Meta | - | 131K | - | - | - | - | - | $0.100 | $0.320 | - | ✓ OSS | Dec 2024 | |
7 | Alibaba/Qwen | - | 128K | - | - | - | - | - | $0.660 | $1.00 | - | ✓ OSS | Nov 2024 | |
8 | Alibaba/Qwen | - | 131K | - | - | - | - | - | $0.040 | $0.100 | - | ✓ OSS | Oct 2024 | |
9 | Meta | - | 131K | - | - | - | - | - | $0.027 | $0.201 | - | ✓ OSS | Sep 2024 | |
10 | Meta | - | 131K | - | - | - | - | - | $0.051 | $0.335 | - | ✓ OSS | Sep 2024 | |
11 | Alibaba/Qwen | - | 131K | - | - | - | - | - | $0.360 | $0.400 | - | ✓ OSS | Sep 2024 | |
12 | Nousresearch | - | 131K | - | - | - | - | - | $0.700 | $0.700 | - | ✓ OSS | Aug 2024 | |
13 | Meta | - | 131K | - | - | - | - | - | $0.020 | $0.030 | - | ✓ OSS | Jul 2024 | |
14 | Meta | - | 131K | - | - | - | - | - | $0.400 | $0.400 | - | ✓ OSS | Jul 2024 | |
15 | Mistral AI | 45 | 128K | 180 | 68.0 | 32.0 | 73.4 | 15.0 | $0.020 | $0.030 | - | ✓ OSS | Jul 2024 | |
16 | Nousresearch | - | 8K | - | - | - | - | - | $0.140 | $0.140 | - | ✓ OSS | May 2024 | |
17 | Meta | - | 8K | - | - | - | - | - | $0.140 | $0.140 | - | ✓ OSS | Apr 2024 | |
18 | Microsoft | - | 66K | - | - | - | - | - | $0.620 | $0.620 | - | ✓ OSS | Apr 2024 | |
19 | Mistral AI | - | 4K | - | - | - | - | - | $0.110 | $0.190 | - | ✓ OSS | Sep 2023 | |
20 | DeepSeek | 68 | 64K | 45 | 88.5 | 59.1 | 89.4 | 42.0 | $0.200 | $0.800 | - | ✓ OSS | - | |
21 | Meta | 64 | 128K | 60 | 88.6 | 50.7 | 89.0 | 34.1 | - | - | - | ✓ OSS | - | |
22 | Alibaba/Qwen | 60 | 128K | 80 | 86.1 | 49.0 | 86.5 | 23.7 | - | - | - | ✓ OSS | - | |
23 | Alibaba/Qwen | 59 | 128K | 90 | 80.0 | 42.0 | 92.3 | 30.0 | - | - | - | ✓ OSS | - | |
24 | Meta | 56 | 128K | 150 | 83.6 | 46.7 | 80.5 | 21.8 | - | - | - | ✓ OSS | - | |
25 | 49 | 8K | 120 | 75.2 | 38.4 | 74.0 | 14.7 | $0.650 | $0.650 | - | ✓ OSS | - | ||
26 | Meta | 37 | 128K | 500 | 63.4 | 24.7 | 58.3 | 9.5 | - | - | - | ✓ OSS | - |
Score = MMLU×20% + GPQA×30% + HumanEval×25% + SWE-Bench×25%⭐ Top - average TECHAGENT user ratingtok/s - output tokens/sec via API
Don't see a model you know? Sign in to suggest it.

