Release 3.0.0 #89

ghost · 2024-03-28T01:16:28Z

This release contains the code to prepare for the move to 7b parameters as well as concentrating the rewards on fewer models and improving the speed that validators pick up new best models.

In addition to the previously announced change from 8k to 4k sequence length, we have also adjusted the future tokenizer from gpt3_5 to gpt4. To compensate, the block at which these new changes will take effect has been moved out one week to April 15, 2024 ~8:00 AM at block 2,786,061.

To reiterate the final set of changes that will occur at that block are:

The parameter limit will be raised to 6.9 billion.
The size limit for the hugging face repo for the model will be raised to 15 gigabytes.
New The tokenizer used for evaluation will become https://huggingface.co/Xenova/gpt-4
New The sequence length used for inference will be 4096.
When loading the pretrained model for inference the torch_dtype will be bfloat16 and the attn_implementation will be flash_attention_2.
- https://huggingface.co/docs/transformers/perf_infer_gpu_one#flashattention-2
New Allowed model types has been adjusted to include new model types (Phi and Gemma) and remove those not supporting flash attention.

Validators: You should upgrade immediately to align your weight distributions to the new model. Additionally, you may need to upgrade your machine by April 15, 2024 to support the following requirement changes:

You must have a GPU with at least 48 gigabytes of memory that can support at least 38 TFLOPs for half precision (Bfloat 16) operations.
You must have a GPU that supports flash attention 2 and bfloat 16: https://github.com/Dao-AILab/flash-attention?tab=readme-ov-file#installation-and-features
New You must have at least 1 TB of disk space.

added 3 spaces. A very small PR, but adds readability

Support for larger models at a future block.

Update README.md

Eval loop adjustments.

…putation.

Add a new tokenizer for 7B.

dougsillars and others added 30 commits February 16, 2024 15:21

Update README.md

4e5cc6d

added 3 spaces. A very small PR, but adds readability

Update model size on downloads based on block.

5409309

Use optimizations at new block for inference.

8c13811

Limit model types based on block.

c8cb2b8

Run inference with sequence length based on block.

e8206a7

Doc updates.

9f8ae23

Adjust temperature to prioritize top 1 model.

3f5748c

Adjust to only keep 10 best models + eval up to 15 new per loop.

bd7501c

Check for updates to models with incentive first.

90b870f

Remove notebook and update cadence for check.

28916ff

Update to only 6 min, 14 max models by default.

ea91667

Fix docs + increase time for eval + adjust sample model parameters.

c83a787

Refactor to use ModelParameters + pass sequence length.

4957e80

Rename to Model Criteria for clarity.

e520ff1

Update docs to point to correct line for ModelCriteria.

d8af206

Update to use 6.9 params, 8192 seqeuence length, and block 2735661.

99afe25

Update to 24 pages and add clarify TFLOPs required.

56a3713

Update documentation on vali requirements and flash-attn requirements.

0f26862

Merge branch 'dev' into next_milestone

fadbe82

Merge pull request #83 from RaoFoundation/next_milestone

7ae6d0c

Support for larger models at a future block.

Merge branch 'dev' into eval_loop_adjustments

5213654

Merge pull request #76 from dougsillars/main

3c1c44a

Update README.md

Merge pull request #84 from RaoFoundation/eval_loop_adjustments

99e0588

Eval loop adjustments.

Add a new tokenizer for 7B

fd4681c

Bump to 6 minute timeouts and go back to random iterator start.

cd9819a

Update to 4k seq length + lower pages + adjust tokenizer.

fe2a0c3

Pass pad token id to avoid instantiating new tokenizer every loss com…

fca0dd4

…putation.

Add Model Criteria for block 0 and improve logging.

732f904

Calculate average loss correctly in log_step.

4309982

Move to GPT4 tokenizer instead of GPT3_5.

e8bfe81

Sid and others added 2 commits March 27, 2024 15:22

Push switchover block out by a week.

0771aaa

Merge pull request #88 from RaoFoundation/update_tokenizer

c0cf96c

Add a new tokenizer for 7B.

ghost approved these changes Mar 28, 2024

View reviewed changes

ghost changed the title ~~Release 2.3.2~~ Release 3.0.0 Mar 28, 2024

ghost merged commit 18f0056 into main Mar 28, 2024

This pull request was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Release 3.0.0 #89

Release 3.0.0 #89

ghost commented Mar 28, 2024 •

edited by ghost

Loading

Release 3.0.0 #89

Release 3.0.0 #89

Conversation

ghost commented Mar 28, 2024 • edited by ghost Loading

ghost commented Mar 28, 2024 •

edited by ghost

Loading