-
Notifications
You must be signed in to change notification settings - Fork 592
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
RuntimeError: failed to load voice "ja" #323
Comments
Hi, the MaskGCT is built in a Linux environment. For a better coding experience, it is recommended that Linux be used to reproduce. For people who are having problems configuring the env on a Windows machine, you can try to follow this blog post: https://www.cnblogs.com/v3ucn/p/18511187 |
@zachysaur I had the same issue on Windows. problem solved by: After that if you got character unicode error, just turn on UTF-8 (BETA) language for non-unicode programs in regional and language settings If you get mbrola dlls error, put those two files from zip to: C:\Program Files (x86)\eSpeak\command_line It should now work. |
still same error even after following everything on this blog 2024-11-03 08:38:00.9208389 [W:onnxruntime:Default, onnxruntime_pybind_state.cc:993 onnxruntime::python::CreateExecutionProviderInstance] Failed to create CUDAExecutionProvider. Require cuDNN 9.* and CUDA 12.*, and the latest MSVC runtime. Please install all dependencies as mentioned in the GPU requirements page (https://onnxruntime.ai/docs/execution-providers/CUDA-ExecutionProvider.html#requirements), make sure they're in the PATH, and that your GPU is supported. |
You could try this repo. It worked for me correctly on Windows. |
How is the quality of your local generations? Did you try to recreate the demo page examples? Mine outputs lower quality than those ones. For instance, the whisper voice comes out as something between a whisper and a low voice: |
i got everything to pth i have used every possible tool made your repo got some error you need to fix them |
I am just an another user. I was asking for their experience as I am also getting lower quality results, though I somehow made it work by zelenooki87 method |
with your experience you telling someone to mess up his whole setting ad remove everything https://www.youtube.com/@socialapps1194 my youtube channel |
...Dude, were you sleepy when you were reading my comments? When did I ask you to do anything? zelenooki87 shared his method for working with this on Win 11, and I got it working, but my version outputs somehow wonky results, so I was asking for his observations. I wasn't asking you to do anything; I was just asking him. I don't know if the models are different or if this method messes something up, but I couldn't reproduce the demo page examples with the same quality. Also, FYI, you don't need to modify or remove anything. You can just try this in another environment in a separate folder if you're curious. Not that I'm asking you to, though... |
(venv) F:\maskgct\maskgct>python app.py
./models/tts/maskgct/g2p\sources\g2p_chinese_model\poly_bert_model.onnx
Error: Could not load the specified mbrola voice file.
Error: Could not load the specified mbrola voice file.
Traceback (most recent call last):
File "F:\maskgct\maskgct\app.py", line 20, in
from models.tts.maskgct.g2p.g2p_generation import g2p, chn_eng_g2p
File "F:\maskgct\maskgct\models\tts\maskgct\g2p\g2p_generation.py", line 10, in
from models.tts.maskgct.g2p.utils.g2p import phonemizer_g2p
File "F:\maskgct\maskgct\models\tts\maskgct\g2p\utils\g2p.py", line 30, in
phonemizer_ja = EspeakBackend(
File "F:\maskgct\maskgct\venv\lib\site-packages\phonemizer\backend\espeak\espeak.py", line 49, in init
self._espeak.set_voice(language)
File "F:\maskgct\maskgct\venv\lib\site-packages\phonemizer\backend\espeak\wrapper.py", line 249, in set_voice
raise RuntimeError( # pragma: nocover
RuntimeError: failed to load voice "ja"
(venv) F:\maskgct\maskgct>
The text was updated successfully, but these errors were encountered: