Add int8 models to Llama2 and Llama3 #1734

james77777778 · 2024-08-05T03:42:07Z

The params is the number reported by Model.summary() which should be consistent with other models.
All models can run inference using colab T4.
You can check out the outputs of llama3_instruct_8b_en_int8 from this colab:
https://colab.research.google.com/drive/1KbUzNsY0906HFcn7FzRiBDtOtO-08msj?usp=sharing

mattdangerw

Looks good! Maybe mention int8 in the description?

mattdangerw · 2024-08-05T22:11:37Z

keras_nlp/src/models/llama/llama_presets.py

@@ -25,6 +25,16 @@
        },
        "kaggle_handle": "kaggle://keras/llama2/keras/llama2_7b_en/1",
    },
+    "llama2_7b_en_int8": {
+        "metadata": {
+            "description": "LLaMA 2 7B Quantized Base model",


Maybe LLaMA 2 7B base model with weight quantized to int8.

I have updated the description to follow Gemma's convention.

Additionally, I used "with activation and weights quantized to int8." to indicate that we are using dynamic int8 quantization instead of weights-only quantization.

WDYT?

Add int8 models

48b199e

mattdangerw reviewed Aug 5, 2024

View reviewed changes

Update llama's descriptions

61d1a5d

james77777778 force-pushed the add-int8-models branch from 78b7862 to 61d1a5d Compare August 6, 2024 01:02

james77777778 requested a review from mattdangerw August 6, 2024 01:05

mattdangerw approved these changes Aug 6, 2024

View reviewed changes

mattdangerw merged commit 9fa1237 into keras-team:master Aug 6, 2024
7 checks passed

james77777778 deleted the add-int8-models branch August 7, 2024 00:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add int8 models to Llama2 and Llama3 #1734

Add int8 models to Llama2 and Llama3 #1734

james77777778 commented Aug 5, 2024

mattdangerw left a comment

mattdangerw Aug 5, 2024

james77777778 Aug 6, 2024

mattdangerw Aug 6, 2024

Add int8 models to Llama2 and Llama3 #1734

Add int8 models to Llama2 and Llama3 #1734

Conversation

james77777778 commented Aug 5, 2024

mattdangerw left a comment

Choose a reason for hiding this comment

mattdangerw Aug 5, 2024

Choose a reason for hiding this comment

james77777778 Aug 6, 2024

Choose a reason for hiding this comment

mattdangerw Aug 6, 2024

Choose a reason for hiding this comment