Issues · huggingface/trl

[Tracking issue] General dataset support

#2071 opened Sep 15, 2024 by qgallouedec

Open

[Tracking issue] Integrate native liger-kernel losses

#2495 opened Dec 17, 2024 by qgallouedec

Open 4

[Tracking issue] Wrong loss scaling when accumulating gradient

#2617 opened Jan 23, 2025 by qgallouedec

Open

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

180 Open 1,232 Closed

🐛 bug 🚀 deepspeed 🏋 GRPO

#2745 opened Feb 3, 2025 by 3rdAT

5 tasks done

PLZ make padding_free for DataCollatorForChatML. ✨ enhancement 🏋 GKD 🙋 help from community wanted

#2736 opened Feb 2, 2025 by YooSungHyun

✨ enhancement 🏋 GRPO

#2734 opened Feb 2, 2025 by sunildkumar

🏋 GKD ❓ question

#2732 opened Feb 2, 2025 by YooSungHyun

5 tasks done

🐛 bug 🏋 GRPO

#2731 opened Feb 2, 2025 by abacaj

5 tasks done

🐛 bug

#2719 opened Jan 31, 2025 by JohnConnor123

5 tasks done

🏋 GRPO 🏋 Reward

#2715 opened Jan 31, 2025 by korbinian-hoermann

🏋 GRPO 🏋 Reward

#2712 opened Jan 31, 2025 by accupham

3 tasks

🏋 DPO ✨ enhancement

#2710 opened Jan 31, 2025 by lucasjinreal

🐛 bug 🏋 GRPO ⚡ PEFT

#2709 opened Jan 31, 2025 by willccbb

⏳ needs more info ⚡ PEFT 🏋 PPO

#2707 opened Jan 30, 2025 by kooryan

✨ enhancement 🏋 GRPO

#2706 opened Jan 30, 2025 by nch0w

🏋 GRPO ❓ question

#2703 opened Jan 30, 2025 by arnavgarg1

5 tasks done

✨ enhancement 🏋 GRPO

#2702 opened Jan 30, 2025 by Superskyyy

✨ enhancement 🏋 GRPO

#2701 opened Jan 30, 2025 by Superskyyy

🏋 GRPO ⚡ PEFT

#2698 opened Jan 30, 2025 by gagan3012

5 tasks done

⚡accelerate ⚡ PEFT 🏋 PPO

#2696 opened Jan 30, 2025 by daehuikim

5 tasks done

ProTip! Type g i on any issue or pull request to go back to the issue listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issues: huggingface/trl

Issues list