encoding - How windows notepad interpret characters -


i wondering how windows interpret characters, instance:

i maked file hexeditor 3 bytes e3 81 81. bytes "ぁ" character encoded utf-8.

i open notepad , display "ぁ"

i dont specified encoding of file, created bytes. , notepad interpret correctly.

is notepad guessing encoding is? or hex editor saving bytes specific encoding.

if file contains these 3 bytes, there no information @ encoding use.

a byte byte, , there no way include encoding information in it. besides, hex editor doesn't know intended decode data text.

notepad uses ansi encoding, if reads file utf-8 has guess encoding based on data in file.

if save file utf-8, notepad put bom (byte order mark) ef bb bf @ beginning of file.


Comments

Popular posts from this blog

linux - Using a Cron Job to check if my mod_wsgi / apache server is running and restart -

actionscript 3 - TweenLite does not work with object -

jQuery Ajax Render Fragments OR Whole Page -