llm.rs

Migration of Karpathy's llm.c project into Rust

Development Process

The development steps taken to migrate llm.c into Rust

1. Utilizing c2rust

Using c2rust, train_gpt2.c was translated from Karpathy's llm.c project to Rust.

2. Utilizing GPT4

Although the transpilation of c2rust was successful, all the for loops have been turned into while loops.

Using GPT-4, we are able to convert all the while loops back into for loops.

3. Utilizing Mate

Furthermore, using Mate, we converted some of these for loops into iter() functions using the Rayon library.

4. Manual Updates

Currently, the project is undergoing manual updates to find performance improvements

Performance

Plots were generated by the plot_gen/plot.py script where the results, namely step duration, being extracted directly from running code from the following repositories on the same environemnt:

Language	Repository	Notes
C	https://github.com/karpathy/llm.c	Original implementation from Karpathy
Rust	.
C++	https://github.com/zhangpiu/llm.cpp	Fastest C++ implementation from the available
Mojo	https://github.com/dorjeduck/llm.mojo

Intel Core i7-9700 8-core

C	Rust	C++
2.374s	1.263s	2.202s

Intel Xeon E5-2690 v3 12-core

C	Rust	C++	Mojo
2.110s	2.908s	1.037s	6.190s

Quick Start

Install python dependencies, output tokenized dataset, and load in the weights:

make setup

Run the training script:

make train

This will run cargo build --release from the llm-rs cargo project after which the binary will be copied into the main project folder.

TODO

Fix types to remove unnecessary casts
Restructure the training script for improved readability
Implement the latest version of the tokenizer
Implement the latest version of the data loader
Improve speed to match the performance of the C implementation
Migrate the testing script
Fix tinystories dataset download

Name		Name	Last commit message	Last commit date
Latest commit History 128 Commits
.github/workflows		.github/workflows
images		images
llm-rs		llm-rs
plot_gen		plot_gen
.actrc		.actrc
.gitignore		.gitignore
.gitpod.yml		.gitpod.yml
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
prepro_tinyshakespeare.py		prepro_tinyshakespeare.py
prepro_tinystories.py		prepro_tinystories.py
requirements.txt		requirements.txt
train_gpt2.py		train_gpt2.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

llm.rs

Development Process

1. Utilizing c2rust

2. Utilizing GPT4

3. Utilizing Mate

4. Manual Updates

Performance

Intel Core i7-9700 8-core

Intel Xeon E5-2690 v3 12-core

Quick Start

TODO

About

Releases

Packages

Languages

License

yijunyu/llm.rs

Folders and files

Latest commit

History

Repository files navigation

llm.rs

Development Process

1. Utilizing c2rust

2. Utilizing GPT4

3. Utilizing Mate

4. Manual Updates

Performance

Intel Core i7-9700 8-core

Intel Xeon E5-2690 v3 12-core

Quick Start

TODO

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages