Skip to content

Pull requests: allenai/OLMo

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Updated HF Conversion Script for Public Use
#826 opened Apr 10, 2025 by aman-17 Loading…
hf_olmo needs datasets dependency
#822 opened Apr 7, 2025 by IanMagnusson Loading…
Fix example usage documentation
#819 opened Apr 3, 2025 by ved1beta Loading…
Updated memmap_dtype to automatically choose based on vocab size type/bug An issue about a bug
#795 opened Feb 12, 2025 by aman-17 Loading…
Decoupled Momentum Optimization
#771 opened Dec 24, 2024 by peter-sk Loading…
Adds support for converting from safetensors
#740 opened Oct 23, 2024 by soldni Loading…
Create an eval-only script for existing ckpts
#736 opened Oct 20, 2024 by liujch1998 Loading…
Add regression tests for training
#730 opened Oct 7, 2024 by 2015aroras Loading…
Docs model ladder
#708 opened Aug 19, 2024 by IanMagnusson Draft
Add OLMoE checkpoints and run config
#707 opened Aug 19, 2024 by 2015aroras Loading…
DNM: Loss issue checkpoint with refine1b setups
#682 opened Jul 31, 2024 by undfined Loading…
[wip] Kylel/readme
#681 opened Jul 31, 2024 by kyleclo Draft
Ladder 1xC
#677 opened Jul 27, 2024 by AkshitaB Loading…
Alternative evals
#675 opened Jul 23, 2024 by AkshitaB Loading…
1 task done
MoE
#639 opened Jun 30, 2024 by Muennighoff Loading…
muP implementation
#637 opened Jun 28, 2024 by AkshitaB Loading…
Unit tests
#635 opened Jun 26, 2024 by AkshitaB Loading…
Config for Amberish experiments at 1B
#621 opened Jun 12, 2024 by drschwenk Loading…
Normal baselines
#618 opened Jun 12, 2024 by AkshitaB Loading…
added git ref to the config keys
#617 opened Jun 11, 2024 by drschwenk Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.