diff --git a/docs/README_LINUX.md b/docs/README_LINUX.md
index d8df713c4..61ea19a4b 100644
--- a/docs/README_LINUX.md
+++ b/docs/README_LINUX.md
@@ -175,7 +175,8 @@ These instructions are for Ubuntu x86_64 (other linux would be similar with diff
     * GGUF ONLY for CUDA GPU (keeping CPU package in place to support CPU + GPU at same time):
       ```bash
       pip uninstall -y llama-cpp-python-cuda
-      pip install https://github.com/jllllll/llama-cpp-python-cuBLAS-wheels/releases/download/textgen-webui/llama_cpp_python_cuda-0.2.18+cu118-cp310-cp310-manylinux_2_31_x86_64.whl
+      python -m pip install llama-cpp-python --prefer-binary --upgrade --extra-index-url=https://jllllll.github.io/llama-cpp-python-cuBLAS-wheels/AVX2/cu118
+      # pip install https://github.com/jllllll/llama-cpp-python-cuBLAS-wheels/releases/download/textgen-webui/llama_cpp_python_cuda-0.2.18+cu118-cp310-cp310-manylinux_2_31_x86_64.whl
       ```
     * GGUF ONLY for CPU-AVX (can be used with -cuda one above)
       ```bash