encoding - How windows notepad interpret characters -
i wondering how windows interpret characters, instance:
i maked file hexeditor 3 bytes e3 81 81
. bytes "ぁ"
character encoded utf-8.
i open notepad , display "ぁ"
i dont specified encoding of file, created bytes. , notepad interpret correctly.
is notepad guessing encoding is? or hex editor saving bytes specific encoding.
if file contains these 3 bytes, there no information @ encoding use.
a byte byte, , there no way include encoding information in it. besides, hex editor doesn't know intended decode data text.
notepad uses ansi encoding, if reads file utf-8 has guess encoding based on data in file.
if save file utf-8, notepad put bom (byte order mark) ef bb bf
@ beginning of file.
Comments
Post a Comment