diff --git a/README.md b/README.md
index 1f5cf8fd..5f6b94dc 100644
--- a/README.md
+++ b/README.md
@@ -73,6 +73,17 @@ Here are some example models that can be downloaded:
 > [!NOTE]
 > You should have at least 8 GB of RAM available to run the 7B models, 16 GB to run the 13B models, and 32 GB to run the 33B models.
 
+### Quantization Level
+
+Model Quantization is a technique used to reduce the size of large neural networks with an acceptable reduction of its capabilities and accuracy.
+
+By default, Ollama uses 4-bit quantization. To try other quantization levels, please try the other tags.
+   ```
+   ollama run llama2:70b-text-q2_K
+   ollama run llama2:70b-text-q8_0
+   ```
+The higher the number, the more accurate the model is, but the slower it runs, and the more memory it requires.
+
 ## Customize a model
 
 ### Import from GGUF