-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Req: Deepseek-coder-33b-instruct Prompt Template #1082
Comments
Sure done, thanks for suggestion. |
Thank you @pseudotensor I receive below error(attached text file).
HOWEVER, it worked with Llamacpp 0.2.14 using the same installation method from H2oGPT docs What do you think?! Does it make sense ? |
The HF model works for me. For GGUF you have:
This sounds like a bug in llama.cpp or llama_cpp_python in handling the file. For me I run:
and I get the same thing:
So they probably fixed something. But this issue is still open: abetlen/llama-cpp-python#840 |
I tried latest from jllllll for same version but latest 0.2.17 and it still fails in same way. Which exact link did you use? |
I have compiled it using your code with changing Llama version: pip uninstall -y llama-cpp-python |
for whatever reason that doesn't fail in same way. Instead I get OOM.
With 14 from jlll doing:
gives: So the OOM is expected on my 24GB board I guess for the 33B model, but the vocab error is odd and 14 from jjlll fixes or maybe if you recompile yourself fixes. I actually don't expect it's jllll's fault, since 14 worked for me. I'm guessing llama_cpp_python or llama.cpp teams are not not stable in their code changes. I suspect jlll is using same commands all the time. |
…sions of llama_cpp_python, see abetlen/llama-cpp-python#840
@pseudotensor llama-cpp-python 0.2.18 is working fine. Anything between that and 0.2.14 are not working. |
Thanks. .14 was messed up too, the responses were all wrong for GGUF models. .18 is back to normal, thanks. |
0.18 is bad unless one builds directly. |
It seems very promising Llm, please define its prompt template.
Website for more info:
https://deepseekcoder.github.io/
Link to huggingface:
https://huggingface.co/deepseek-ai/deepseek-coder-33b-instruct
The text was updated successfully, but these errors were encountered: