-
Notifications
You must be signed in to change notification settings - Fork 47
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Unrecognized encoded Chinese text file #142 #143
Labels
Comments
What is the expected encoding? |
Chinese encoding, maybe GB18030 |
From my side, GB2312 was recognized as EUC-JP with confidence 0.99 if the text is short (10 characters). But correct if it's text is long (>200 characters) |
Any chance we're gonna get an update on that one, given the low activity of late? My library has an open issue depending on it 😅 |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Unrecognized encoded Chinese text file #142
I have uploaded the corresponding file
The text was updated successfully, but these errors were encountered: