Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Larger models available or planned? #33

Open
jaggzh opened this issue Jul 3, 2024 · 8 comments
Open

Larger models available or planned? #33

jaggzh opened this issue Jul 3, 2024 · 8 comments

Comments

@jaggzh
Copy link

jaggzh commented Jul 3, 2024

Anything larger than 2.7B cooking? I'm itching to test the larger capacity and its scaling against the larger small LLMs of comparable size (or comparable resource use).

@ridgerchu
Copy link
Owner

Hi, sorry that until now we still do not have enough resources to scale up our model... Thanks for your interest!

@Ovid
Copy link

Ovid commented Jul 29, 2024

@ridgerchu Has there been any movement here that you can publicly discuss? This work seems significant enough in terms of environmental impact that it seems like many companies would be very interested in pushing this forward.

@ridgerchu
Copy link
Owner

ridgerchu commented Jul 30, 2024

@Ovid Unfortunately, as of now, we haven't been able to secure a feasible sponsor that would allow us to scale this work😢 Despite the potential environmental impact and efficiency gains, finding the right support to move this research forward has been challenging. We're continuing to explore opportunities and remain hopeful about the future prospects of this technology...

@Melaron
Copy link

Melaron commented Aug 15, 2024

@ridgerchu, that is surprising to hear. When I read the article, I thought that there would be a rush to utilize this — it seems quiet so far. I hope you can find a good sponsor soon.

@ridgerchu
Copy link
Owner

ridgerchu commented Aug 15, 2024

Thank you all for your interest! We have found some interesting companies or organizations that may fund us to scale up. Although negotiations are still ongoing, we're hopeful that we can develop at least a Mistral 7B level model.

@Ovid
Copy link

Ovid commented Jan 1, 2025

@ridgerchu Happy New Year!

Any news on funding you can share?

@ridgerchu
Copy link
Owner

Happy New Year @Ovid !

Actually we are starting trying to scale up the model, we first step may start with a medium-size model with near-SOTA performance, if it successfully scale we will also open source it!

@impredicative
Copy link

impredicative commented Jan 6, 2025

Will this matmulfreellm preserve efficiency gains on top of bitnet, e.g. https://github.com/microsoft/BitNet and https://github.com/kyegomez/BitNet ? If it does, then someone like Sandia could perhaps help with funding.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants