Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

list of open-source publicly-available reasoning llms #239

Open
andre15silva opened this issue Feb 17, 2025 · 3 comments
Open

list of open-source publicly-available reasoning llms #239

andre15silva opened this issue Feb 17, 2025 · 3 comments

Comments

@andre15silva
Copy link
Member

andre15silva commented Feb 17, 2025

https://huggingface.co/deepseek-ai/DeepSeek-R1

QwQ
Qwen's reasoning model with 32B parameter

@monperrus
Copy link
Contributor

Arcee-Maestro-7B
RL trained reasoning model based on DeepSeek-R1-Distill-Qwen-7B with further GPRO training for reasoning, math and coding
https://huggingface.co/arcee-ai/Arcee-Maestro-7B-Preview

@monperrus
Copy link
Contributor

OpenThinker-32B
fine-tuned reasoning model of Qwen/Qwen2.5-32B-Instruct on the DeepSeek-R1 distilled OpenThoughts-114k dataset
https://huggingface.co/open-thoughts/OpenThinker-32B

@monperrus
Copy link
Contributor

Sky-T1
UC Berkeley's reasoning model with 32B parameters
https://huggingface.co/NovaSky-AI/Sky-T1-32B-Preview

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants