TECHAGENT - MY AI LIFE
LLM Models

R1 Distill Qwen 32B

Open Source

DeepSeek

DeepSeek R1 Distill Qwen 32B is a distilled large language model based on [Qwen 2.5 32B](https://huggingface.co/Qwen/Qwen2.5-32B), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). It outperforms OpenAI's o1-mini across various benchmarks, achieving new...

openrouter.ai ↗
📖Context128K tokens
🚀Speed-
💵Input price$0.290/1M
💸Output price$0.290/1M
🧠Parameters32.764B

Benchmarks

Gray bar = dataset average
Composite ScoreScore = MMLU×20% + GPQA×30% + HumanEval×25% + SWE-Bench×25%-

HF Open LLM Leaderboard

IFEval · BBH · MATH · GPQA · MuSR · MMLU-Pro

Average
23.0
View on HF →
IFEvalInstruction following
41.9%avg 79.9
BBHBig Bench Hard
17.1%avg 48.7
MATH Lvl 5Competition mathematics
17.1%avg 37.8
GPQAGraduate-level science Q&A
4.6%avg 12.9
MuSRMultistep soft reasoning
16.1%avg 10.7
MMLU-ProProfessional knowledge
41.0%avg 40.0

Details

Provider
DeepSeek
Parameters
32.764B
Context
128K tokens
Speed
-
Open Source
Yes
License
Open (varies)
Released
Jan 2025

Reviews

No reviews yet

Log in to rate