-
Notifications
You must be signed in to change notification settings - Fork 865
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Encoding issues / Umlaut is not decoded correctly #117
Comments
In PHP, use: utf8_decode |
In javascript, the following to to the same: var decoded_content = self.utf8_decode(content);
|
Having the same issue. Cyrillics are decoded into gibberish:
|
having same issues with korean language |
Проблема с этом куске Line 101 in b0f9519
|
@alekciy Thank you for the tip, I have added utf8 decoder in that line and it worked. |
Though this might not get merged. In case somebody needs this fix, you can clone the repo, apply the fix yourself and rebuild the package with:
The |
А если cp1251? Например, платежки по ГОСТ Р 56042-2014 формат ST00011. В идеале добавить бы детектор кодировки. |
@alekciy I don't think there is a reliable way to detect text encoding, especially when it's CP encodings. It would probably be better to add an encoding parameter to the Scanner class. |
I have trouble decoding the QR code from this PDF (on page 27).
It seems the Umlaut in the last line is not decoded correctly. Screenshot from the live demo:
The last line should read ..."für Gartenarbeit und Entsorgung"...
I can decode the QR Code just fine in Java using ZXing.
If I set the the CHARACTER_SET decoding hint to "ISO-8859-1" the decoded result is exactly the same as pictured in the screenshot, so I suspect that somewhere ISO-8859-1 is assumed in InstaScan.
Here's the QR Code I used for easier copy/pasting:
Is there a way to specify the encoding to use, or is this a bug?
The text was updated successfully, but these errors were encountered: