Getting convert-pt-to-ggml.py to work being an absolute ignorant #658
-
Humbly asking for some guidance here... I have tried to get some Icelandic audio transcribed using the compiled multilingual ggml's but the results have been a bit disappointing. I would very much like to try using an Icelandic-trained model instead and found this promising: However, the provided pytorch_model.bin should be converted into ggml, and my attempts to do so using a portable hit&run solution with https://winpython.github.io/ have failed, as I have next to no idea of how exactly to do this (if I knew my way around python, I'd probably not be trying the cpp-way in the first place). I struggle to see, where the paths should go from within my winpython-folder, Sorry to ask so stupidly, but the instructions seem to presuppose a level of python-understanding, I simply do not possess. Would anyone kindly try to lead me through these hoops, please? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 6 replies
-
Well, I don't know about winpython (I'm on Linux myself), but I can explain some things. About ~/.cache/whisper, I don't think it's needed for Huggingface models at all? As for path locations - I think you can pretty much decide them yourself.
It's a slightly simplified version but hopefully you get the idea. So, just clone the repositories somewhere and give them as parameters for the script. As you can see, I use "whisper_openai" and not "whisper_openai/whisper" here. The script will complain about not finding assets, if it's not right. One more thing, it seems the script is using a python module called transformers. So you'll need to install that somehow, if you haven't already. Good luck! |
Beta Was this translation helpful? Give feedback.
Well, I don't know about winpython (I'm on Linux myself), but I can explain some things.
First of all, for Huggingface models you'll have to use the h5 variant of the script: convert-h5-to-ggml.py.
There are some instructions in the beginning of that script, well I guess you knew that but just in case...
About ~/.cache/whisper, I don't think it's needed for Huggingface models at all?
The tilde character ~ in the path is just a shortcut, it means user's home directory ($HOME env variable).
The dot in front of directory name means that it's a hidden directory.
But since you're on Windows I guess you can just forget about those and just use the full path names.
As for path locations - I thin…