Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

support for CohereForAI/c4ai-command-r-v01 #3330

Closed
xusenlinzy opened this issue Mar 12, 2024 · 7 comments
Closed

support for CohereForAI/c4ai-command-r-v01 #3330

xusenlinzy opened this issue Mar 12, 2024 · 7 comments
Labels
help wanted Extra attention is needed

Comments

@xusenlinzy
Copy link

xusenlinzy commented Mar 12, 2024

CohereForAI/c4ai-command-r-v01 is a large language model with open weights optimized for a variety of use cases including reasoning, summarization, and question answering. Command-R has the capability for multilingual generation evaluated in 10 languages and highly performant RAG capabilities.

@simon-mo simon-mo added the help wanted Extra attention is needed label Mar 12, 2024
@simon-mo
Copy link
Collaborator

Contribution welcomed! https://docs.vllm.ai/en/latest/models/adding_model.html

@sungkim11
Copy link

I have read it on Twitter that Cohere will support this effort. Are they involved?

@dekelcohen
Copy link

dekelcohen commented Mar 17, 2024

👍
Is there someone already working on this ?
I saw recent activity in transformers repo to add Cohere Command-R.

@dekelcohen
Copy link

Command-R arch is very close to Lllama2. almost identical code in HF
LayerNorm is different compared to Lllama RMSNorm

@Playerrrrr
Copy link

So does vLLM support it now or not?

@simon-mo
Copy link
Collaborator

It's supported in main branch, which you can use when building from source. It will be in next release (ETA tomorrow)

@0x77dev
Copy link

0x77dev commented Apr 6, 2024

Looks like it was released in v0.4.0 / #3433

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

6 participants