Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Bug]: UnicodeEncodeError: 'charmap' codec can't encode character '\u274c' in position 0: character maps to <undefined> #1120

Closed
3 tasks
monuminu opened this issue Sep 11, 2024 · 5 comments
Labels
autoresolved awaiting_response Maintainers or community have suggested solutions or requested info, awaiting filer response bug Something isn't working stale Used by auto-resolve bot to flag inactive issues triage Default label assignment, indicates new issue needs reviewed by a maintainer

Comments

@monuminu
Copy link

Do you need to file an issue?

  • I have searched the existing issues and this bug is not already filed.
  • My model is hosted on OpenAI or Azure. If not, please look at the "model providers" issue and don't file a new one here.
  • I believe this is a legitimate bug, not just a question. If this is a question, please use the Discussions area.

Describe the bug

File "C:\Users\mrajguru.conda\envs\kotaemon\lib\site-packages\graphrag\index\api.py", line 79, in build_index
progress_reporter.error(output.workflow)
File "C:\Users\mrajguru.conda\envs\kotaemon\lib\site-packages\graphrag\index\progress\rich.py", line 127, in error
self._console.print(f"❌ [red]{message}[/red]")
File "C:\Users\mrajguru.conda\envs\kotaemon\lib\site-packages\rich\console.py", line 1683, in print
with self:
File "C:\Users\mrajguru.conda\envs\kotaemon\lib\site-packages\rich\console.py", line 864, in exit
self._exit_buffer()
File "C:\Users\mrajguru.conda\envs\kotaemon\lib\site-packages\rich\console.py", line 822, in _exit_buffer
self._check_buffer()
File "C:\Users\mrajguru.conda\envs\kotaemon\lib\site-packages\rich\console.py", line 2024, in _check_buffer
self._write_buffer()
File "C:\Users\mrajguru.conda\envs\kotaemon\lib\site-packages\rich\console.py", line 2060, in _write_buffer
legacy_windows_render(buffer, LegacyWindowsTerm(self.file))
File "C:\Users\mrajguru.conda\envs\kotaemon\lib\site-packages\rich_windows_renderer.py", line 19, in legacy_windows_render
term.write_text(text)
File "C:\Users\mrajguru.conda\envs\kotaemon\lib\site-packages\rich_win32_console.py", line 403, in write_text
self.write(text)
File "C:\Users\mrajguru.conda\envs\kotaemon\lib\encodings\cp1252.py", line 19, in encode
return codecs.charmap_encode(input,self.errors,encoding_table)[0]
UnicodeEncodeError: 'charmap' codec can't encode character '\u274c' in position 0: character maps to

Steps to reproduce

No response

Expected Behavior

No response

GraphRAG Config Used

# Paste your config here

Logs and screenshots

No response

Additional Information

  • GraphRAG Version:
  • Operating System:
  • Python Version:
  • Related Issues:
@monuminu monuminu added bug Something isn't working triage Default label assignment, indicates new issue needs reviewed by a maintainer labels Sep 11, 2024
@natoverse
Copy link
Collaborator

Is your input text English and UTF-8 encoded?

@natoverse natoverse added the awaiting_response Maintainers or community have suggested solutions or requested info, awaiting filer response label Sep 11, 2024
@Sivan22
Copy link

Sivan22 commented Sep 12, 2024

i had the same problem with Hebrew text.

@natoverse
Copy link
Collaborator

We have some notes on non-English text here: #696

Copy link

This issue has been marked stale due to inactivity after repo maintainer or community member responses that request more information or suggest a solution. It will be closed after five additional days.

@github-actions github-actions bot added the stale Used by auto-resolve bot to flag inactive issues label Sep 20, 2024
Copy link

This issue has been closed after being marked as stale for five days. Please reopen if needed.

@github-actions github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Sep 25, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
autoresolved awaiting_response Maintainers or community have suggested solutions or requested info, awaiting filer response bug Something isn't working stale Used by auto-resolve bot to flag inactive issues triage Default label assignment, indicates new issue needs reviewed by a maintainer
Projects
None yet
Development

No branches or pull requests

3 participants