Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

UnicodeDecodeError: 'charmap' codec can't decode byte 0x9d in position 15703: character maps to <undefined> #5

Open
Llaves opened this issue Nov 22, 2021 · 2 comments

Comments

@Llaves
Copy link

Llaves commented Nov 22, 2021

It appears this is caused by a double quote char in cp1252 not matching the position in UTF-8. The fix is in line 72 -

    soup = BeautifulSoup(open(os.path.join("doc.qt.io", c[-1]), encoding="utf8"), "lxml")
@acbetter
Copy link
Owner

acbetter commented Feb 3, 2022

The script usually run in macOS system. I haven't reproduced this issue yet, maybe relate to operating system?

@okletzmayr
Copy link
Contributor

Just adding this as an FYI, I downloaded/compiled the resources for my PR on macOS as well. So I guess it's possible that an implementation of wget on Windows defaults to the cp1252 encoding.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants