Skip to content

Implement gpt sovits v3 #1673

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 8 commits into from
Apr 19, 2025
Merged

Implement gpt sovits v3 #1673

merged 8 commits into from
Apr 19, 2025

Conversation

ooe1123
Copy link
Contributor

@ooe1123 ooe1123 commented Mar 23, 2025

@kyakuno
Copy link
Collaborator

kyakuno commented Apr 19, 2025

モデルをアップロードしました。
https://storage.googleapis.com/ailia-models/gpt-sovits-v3/bigvgan_model.onnx.prototxt

@kyakuno
Copy link
Collaborator

kyakuno commented Apr 19, 2025

Traceback (most recent call last):
  File "/Users/kyakuno/Desktop/ailia/ailia-models-ax/audio_processing/gpt-sovits-v3/gpt-sovits-v3.py", line 855, in <module>
    main()
  File "/Users/kyakuno/Desktop/ailia/ailia-models-ax/audio_processing/gpt-sovits-v3/gpt-sovits-v3.py", line 829, in main
    vgan=vgan,
         ^^^^
UnboundLocalError: cannot access local variable 'vgan' where it is not associated with a value

@kyakuno
Copy link
Collaborator

kyakuno commented Apr 19, 2025

gpt-sovits v3では日本語でアクセントが反映されるようになっている。

        # pitch falling
        elif a1 == 0 and a2_next == a2 + 1 and a2 != f1:
            phones.append("]")
        # pitch rising
        elif a2 == 1 and a2_next == 2:
            phones.append("[")
['a', '[', 'cl', 'k', 'u', 's', 'u', 'k', 'a', 'b', 'u', 'sh', 'i', 'k', 'i', 'g', 'a', ']', 'i', 'sh', 'a', 'd', 'e', 'w', 'a', '#', 'a', ']', 'i', 'n', 'o', '#', 'j', 'i', '[', 'ts', 'u', 'y', 'o', 'o', 'k', 'a', 'n', 'o', '#', 't', 'a', '[', 'm', 'e', ']', 'n', 'o', '#', 'g', 'i', ']', 'j', 'u', 'ts', 'u', 'o', '#', 'k', 'a', '[', 'i', 'h', 'a', 'ts', 'u', '#', 'sh', 'i', '[', 't', 'e', '#', 'i', '[', 'm', 'a', ']', 's', 'u', '.']

@kyakuno
Copy link
Collaborator

kyakuno commented Apr 19, 2025

独自のユーザ辞書も定義されている。

USERDIC_CSV_PATH = os.path.join(current_file_path, "ja_userdic", "userdict.csv")
USERDIC_BIN_PATH = os.path.join(current_file_path, "ja_userdic", "user.dict")
USERDIC_HASH_PATH = os.path.join(current_file_path, "ja_userdic", "userdict.md5")

@kyakuno
Copy link
Collaborator

kyakuno commented Apr 19, 2025

axはアックスにマップされている。

ax,1348,1348,227,名詞,固有名詞,一般,*,*,*,アックス,アックス,アックス,1/4,*

@kyakuno
Copy link
Collaborator

kyakuno commented Apr 19, 2025

日本語がかなり強化されている印象。

@kyakuno kyakuno merged commit dcdab20 into master Apr 19, 2025
@kyakuno kyakuno deleted the gpt-sovits-v3 branch April 19, 2025 12:49
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants