Skip to content
@PRIME-RL

PRIME-RL

Researching scalable (RL) methods on language models.

Pinned Loading

  1. PRIME PRIME Public

    Scalable RL solution for advanced reasoning of language models

    Python 1.3k 81

  2. ImplicitPRM ImplicitPRM Public

    Repo of paper "Free Process Rewards without Process Labels"

    Python 126 5

Repositories

Showing 2 of 2 repositories
  • PRIME Public

    Scalable RL solution for advanced reasoning of language models

    PRIME-RL/PRIME’s past year of commit activity
    Python 1,313 Apache-2.0 81 7 1 Updated Feb 19, 2025
  • ImplicitPRM Public

    Repo of paper "Free Process Rewards without Process Labels"

    PRIME-RL/ImplicitPRM’s past year of commit activity
    Python 126 Apache-2.0 5 9 0 Updated Jan 16, 2025

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…