Notepad3 does not recognize encoding correctly.

Created on 27 Nov 2019  路  16Comments  路  Source: rizonesoft/Notepad3

Notepad3 does not recognize encoding correctly. The Chinese Character is garbled when opening an txt with English words and Chinese Character.

configuration encoding detection

Most helpful comment

From ver _5.19.1128.2690 on, Notepad3 is pushed to be more UTF-8 friendly (less current system's ANSI code-page bound).
(Fallback prefer UTF-8 (valid) over system's ANSI CP if not forced by the switch shown above )

All 16 comments

Could you please send us a sample file of your problem packed in a zip file. :thinking:

Hello @liwenbus ,
Question: which version of Notepad3 do you use ? ("Help" --> "About...")

Feel free to test the BETA version "Notepad3Portable_5.19.1127.2688_BETA" or higher.
See "Notepad3 BETA-channel access #1129" or here Notepad3Portable_5.19.1127.2688_BETA.

I sometimes get garbled text. This problem has also occurred in Japanese text since Notepad2.

Notepad 3 remembers the character code, so it was only when it was first opened.

@maboroshin : The "_remember encoding in file history (recent files)_" feature changed from "_force this encoding_" to "_suggestion on unreliable encoding detection_". The reason for this: It has been in-transparent to the user, why this encoding has been chosen and also gets lost, if file is dropped from history.

I feel less garbled than before. If I get garbled, I will inform you of the file.

I read about 50 files, but they are not reproduced garble.

Could you please send us a sample file of your problem packed in a zip file. 馃

Hello. The version is v5.19.815.2595. Here are the files.
Test.zip

@hpwamr : please analyze ...

The version is v5.19.815.2595

Hello @liwenbus ,

  • Indeed you are using the latest Notepad3 Release version.
  • In my "Out-of-the-Box" Notepad3 v5.19.815.2595, your file is correctly encoded !
  • Maybe your "Notepad3.ini" is not correctly updated. 馃
    For v5.19.815.2595, it should be: SettingsVersion=4 and encoded as: Unicode (UTF-8) Signature
  • But since then we have completely rewritten the encoding detection.
  • The latest BETA version detects the code-page "GB18030" of your file correctly. 馃憤

test_chinese_chars - UCD=GB18030 (issue #1815).zip

Notepad3 v2689 BETA - 2019-11-28_101239

We will soon release a new Notepad3 version that will integrate all latest changes.
Changes.txt

My advice is that you:

  • try to recode your file to GB18030 with your current Notepad3
  • or use the latest BETA version as Notepad3Portable Apps (to optimize your beta experience, please read the note at #1129).
  • or update your currently installed version with the latest beta files (see #1105)

Feel free to test the BETA version "Notepad3Portable_5.19.1127.2689_BETA" or higher.
See "Notepad3 BETA-channel access #1129" or here Notepad3Portable_5.19.1127.2689_BETA.

In "Encoding" of setting, All checked without "Use as fallback...". Version is 5.19.1120.2681.

Failed with this file: dl.zip

In "Encoding" of setting, All checked without "Use as fallback...". Version is 5.19.1120.2681.

Failed with this file: dl.zip

Hello @maboroshin , @RaiKoHoff

This is a very good example to check/improve our encoding detection.

First findings:

  • Your file is "UTF-8" (No Signature). 馃憤
  • Notepad++ detects it correctly !
  • Notepad3 NO ! 馃槵

I have to analyze more deeply tonight. 馃

The version is v5.19.815.2595

Hello @liwenbus ,

  • Indeed you are using the latest Notepad3 Release version.
  • In my "Out-of-the-Box" Notepad3 v5.19.815.2595, your file is correctly encoded !
  • Maybe your "Notepad3.ini" is not correctly updated. 馃
    For v5.19.815.2595, it should be: SettingsVersion=4 and encoded as: Unicode (UTF-8) Signature
  • But since then we have completely rewritten the encoding detection.
  • The latest BETA version detects the code-page "GB18030" of your file correctly. 馃憤

test_chinese_chars - UCD=GB18030 (issue #1815).zip

Notepad3 v2689 BETA - 2019-11-28_101239

We will soon release a new Notepad3 version that will integrate all latest changes.
Changes.txt

My advice is that you:

  • try to recode your file to GB18030 with your current Notepad3
  • or use the latest BETA version as Notepad3Portable Apps (to optimize your beta experience, please read the note at #1129).
  • or update your currently installed version with the latest beta files (see #1105)

Feel free to test the BETA version "Notepad3Portable_5.19.1127.2689_BETA" or higher.
See "Notepad3 BETA-channel access #1129" or here Notepad3Portable_5.19.1127.2689_BETA.

OK. Thank you very much!!

Feel free to test development beta version _5.19.1128.2690_BETA.

Changed: Encoding Detection Fallback (if encoding analysis result is not reliable):
=> If fallback is not explicitly defined (File->Encoding-Set Default... : "Use as fallback..."),
then use UTF-8 (if valid for document) else use current system's ANSI code-page.

Changed: Encoding Detection Fallback (if encoding analysis result is not reliable):
=> If fallback is not explicitly defined (File->Encoding-Set Default... : "Use as fallback..."),
then use UTF-8 (if valid for document) else use current system's ANSI code-page.

I confirm with "Use as fallback on detection failure" checked, this UTF-8 text is correctly detected. 馃憤

2019-11-28_140534

From ver _5.19.1128.2690 on, Notepad3 is pushed to be more UTF-8 friendly (less current system's ANSI code-page bound).
(Fallback prefer UTF-8 (valid) over system's ANSI CP if not forced by the switch shown above )

OK. Thank you very much!!

Hello @liwenbus ,
Is your problem solved ? 馃
If No, could you give a little more detail? ?

If Yes, as far as I'm concerned, I think you (as requester) can close this issue...

Hello @hpwamr
Yes, my problem is solved! Thank you so much.

Was this page helpful?
0 / 5 - 0 ratings

Related issues

hpwamr picture hpwamr  路  3Comments

craigo- picture craigo-  路  3Comments

rizonesoft picture rizonesoft  路  4Comments

bravo-hero picture bravo-hero  路  3Comments

dlong500 picture dlong500  路  3Comments