Skip to content

Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English

License

Notifications You must be signed in to change notification settings

pengzhendong/g2p-mix

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

47 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

g2p-mix

Usage

$ pip install g2p-mix
$ python

Mandarin

>>> from g2p_mix import G2pMix
>>> G2pMix().g2p("你这个idea, 不太make sense。", sandhi=True, return_seg=True)
[
  {"word": "", "lang": "ZH", "pos": "r", "phones": [["n", "i3"]]},
  {"word": "这个", "lang": "ZH", "pos": "r", "phones": [["zh", "e4"], ["g", "e5"]]},
  {"word": "idea", "lang": "EN", "pos": null, "phones": ["AY0", "D", "IY1", "AH0"]},
  {"word": ",", "lang": "SYM", "pos": "x", "phones": []},
  {"word": "不太", "lang": "ZH", "pos": "d", "phones": [["b", "u2"], ["t", "ai4"]]},
  {"word": "make", "lang": "EN", "pos": null, "phones": ["M", "EY1", "K"]},
  {"word": "sense", "lang": "EN", "pos": null, "phones": ["S", "EH1", "N", "S"]},
  {"word": "", "lang": "SYM", "pos": "x", "phones": []},
]

Cantonese

>>> G2pMix(jyut=True).g2p("你这个idea, 不太make sense。", return_seg=True)
[
  {"word": "", "lang": "ZH", "pos": "PRON", "phones": [["n", "ei5"]]},
  {"word": "這個", "lang": "ZH", "pos": "PRON", "phones": [["z", "e3"], ["g", "o3"]]},
  {"word": "idea", "lang": "EN", "pos": null, "phones": ["AY0", "D", "IY1", "AH0"]},
  {"word": ",", "lang": "SYM", "pos": "x", "phones": []},
  {"word": "", "lang": "ZH", "pos": "ADV", "phones": [["b", "at1"]]},
  {"word": "", "lang": "ZH", "pos": "ADV", "phones": [["t", "aai3"]]},
  {"word": "make", "lang": "EN", "pos": null, "phones": ["M", "EY1", "K"]},
  {"word": "sense", "lang": "EN", "pos": null, "phones": ["S", "EH1", "N", "S"]},
  {"word": "", "lang": "SYM", "pos": "x", "phones": []},
]

About

Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages