Merge pull request #823 from jxmai/feature/#789

Fix #789: Update README with instructions for running the quantized L…
PromtEngineer · Sep 20, 2024 · b654a59 · b654a59
2 parents a1dea3b + b4322d4
commit b654a59
Showing 1 changed file with 1 addition and 0 deletions.
diff --git a/README.md b/README.md
@@ -71,6 +71,7 @@ pip install -r requirements.txt
 
 LocalGPT uses [LlamaCpp-Python](https://github.com/abetlen/llama-cpp-python) for GGML (you will need llama-cpp-python <=0.1.76) and GGUF (llama-cpp-python >=0.1.83) models.
 
+To run the quantized Llama3 model, ensure you have llama-cpp-python version 0.2.62 or higher installed.
 
 If you want to use BLAS or Metal with [llama-cpp](https://github.com/abetlen/llama-cpp-python#installation-with-openblas--cublas--clblast--metal) you can set appropriate flags: