Shift_JIS

The table data is here: Shift_JIS.tar.bz2
unicode.org-mappings/EASTASIA/JIS/JIS0208.TXT (omit the second column)
is JISX0208 in Shift-JIS-Encoding.
unicode.org-mappings/EASTASIA/JIS/SHIFTJIS.TXT
official source. Consists of JISX0201 and JISX0208 in Shift-JIS-Encoding. Lacks the control characters 0x00..0x1F, 0x7F.
unicode.org-mappings/VENDORS/APPLE/JAPANESE.TXT
differs in mapping of 0x7E, adds mapping of 0x80, 0xA0, 0xFD, 0xFE, 0xFF. Differs in mappings of 0x815C, 0x815F. Adds mappings for 0x{85..88}{40..7E,80..FC} up to 0x886D, plus mappings for 0x{EB..ED}{40..7E,80..9C}
unicode.org-mappings/VENDORS/MICSFT/WINDOWS/CP932.TXT
microsoft-2005/CP932.TXT
consists of ASCII (not ISO646-JP!), JISX0201 upper half, and JISX0208 in Shift-JIS-Encoding (with differences at 0x815F, 0x8160, 0x8161, 0x817C, 0x8191, 0x8192, 0x81CA), plus mappings for 0x87{40..7E,80..9C} and 0x{ED..EE,FA..FC}{40..7E,80..FC} up to 0xFC4B,
whatwg/Shift_JIS.TXT
glibc-2.3.2-iconv/CP932.TXT
jdk-1.1.8/MS932.TXT
jdk-1.3.1/MS932.TXT
jdk-1.4.2/MS932.TXT
jdk-1.5.0/MS932.TXT
is CP932, plus private area mappings to U+E000..U+E757.
windows-xp/CP932.TXT
windows-2016/CP932.TXT
Similar to CP932, but adds a mapping of 0x80 and private area mappings to U+E000..U+E757.
unicode.org-mappings/VENDORS/MICSFT/WindowsBestFit/CP932.TXT
Same as windows-2016/CP932.TXT, but adds dummy mappings for 0x8540, 0x8640, 0xEB40, 0xEC40, 0xEF40.
windows-2000/CP932.TXT
A variant of CP932.
windows-2000/CP10001.TXT
Another variant of Shift-JIS.
windows-2016/CP10001.TXT
Similar to windows-2000/CP10001.TXT, but adds mappings of 0xFF, 0x86D3..0x86D6, 0x8740..0x874A, 0x874C, 0x874E..0x874F, 0x8751..0x875D, 0x875F..0x8775, 0x877E, 0x8782..0x8792, 0x8798..0x879E, 0xED40..0xED7E, 0xED80..0xEDFC, 0xEE40..0xEE7E, 0xEE80..0xEEEC, 0xEEEF..0xEEFC, 0xFA40..0xFA54, 0xFA58..0xFA5B.
glibc-2.2-iconv/SJIS.TXT
glibc-2.2-charmaps/SJIS.TXT
consists of JISX0201 and JISX0208 in Shift-JIS-Encoding.
glibc-2.2.2-iconv/SJIS.TXT
glibc-2.3.6-charmaps/SHIFT_JIS.TXT
consists of JISX0201 and JISX0208 in Shift-JIS-Encoding, with a difference at 0x815F.
libiconv-1.0/SHIFT-JIS.TXT
consists of JISX0201 and JISX0208 in Shift-JIS-Encoding, plus user-defined range U+E000..U+E757.
libiconv-1.7/SHIFT-JIS.TXT
consists of JISX0201 and JISX0208 in Shift-JIS-Encoding, plus user-defined range U+E000..U+E757, with a difference at 0x815F.
libiconv-1.1/CP932.TXT
consists of ASCII (not ISO646-JP!), JISX0201 upper half, and JISX0208 in Shift-JIS-Encoding, plus mappings for 0x87{40..7E,80..9C} and 0x{ED..EE,FA..FC}{40..7E,80..FC} up to 0xFC4B, plus user-defined range U+E000..U+E757.

Compared to CP932, it has different mappings of 0x815F, 0x8160, 0x8161, 0x817C, 0x8191, 0x8192, 0x81CA, plus user-defined range U+E000..U+E757.

libiconv-1.7/CP932.TXT
consists of ASCII (not ISO646-JP!), JISX0201 upper half, and JISX0208 in Shift-JIS-Encoding with a difference at 0x815F, plus mappings for 0x87{40..7E,80..9C} and 0x{ED..EE,FA..FC}{40..7E,80..FC} up to 0xFC4B, plus user-defined range U+E000..U+E757.

Compared to CP932, it has different mappings of 0x8160, 0x8161, 0x817C, 0x8191, 0x8192, 0x81CA, plus user-defined range U+E000..U+E757.

solaris-2.7/SJIS.TXT
consists of JISX0201 and JISX0208 in Shift-JIS-Encoding, plus many-to-U+3013 mappings, plus 0x{EB..EC}{40..7E,80..FC} to U+00AE mapping, plus mappings for 0x{ED..EE,FA..FC}{40..7E,80..FC} up to 0xFC4B.
solaris-2.7/SJIS-OTHER.TXT
consists of ASCII (not ISO646-JP!), JISX0201 upper half, and JISX0208 in Shift-JIS-Encoding (with a difference at 0x815F), plus mappings for 0x87{40..7E,80..9C} and 0x{ED..EE,FA..FC}{40..7E,80..FC} up to 0xFC4B, plus user-defined range U+E000..U+E757.
osf1-5.1/SJIS.TXT
consists of JISX0201 and JISX0208 in Shift-JIS-Encoding, plus private area mappings for 0x{EB..FC}{40..7E,80..FC}.
jdk-1.3.1/SJIS.INVERSE.TXT
consists of JISX0201 and JISX0208 in Shift-JIS-Encoding, but differing in the mapping of 0x815F, and with additional irreversible mappings for 0x5C and 0x7E.
jdk-1.4.2/SJIS.INVERSE.TXT
jdk-1.5.0/SJIS.INVERSE.TXT
consists of JISX0201 and JISX0208 in Shift-JIS-Encoding, but differing in the mappings of 0x815C, 0x815F, and with additional irreversible mappings for 0x5C and 0x7E.
jdk-1.1.8/SJIS.TXT
jdk-1.1.8/SJIS.INVERSE.TXT
are both unusable.
jdk-1.4.2/PCK.INVERSE.TXT
consists of JISX0201 and JISX0208 in Shift-JIS-Encoding, but differing in the mappings of 0x815C, 0x815F, with additional mappings for 0x8740..0x8799, 0xFA40..0xFAFC, 0xFB40..0xFBFC, 0xFC40..0xFC4B, and with additional irreversible mappings for 0x5C and 0x7E.
jdk-1.5.0/PCK.INVERSE.TXT
consists of JISX0201 and JISX0208 in Shift-JIS-Encoding, but differing in the mappings of 0x815F, with additional mappings for 0x8740..0x8799, 0xFA40..0xFAFC, 0xFB40..0xFBFC, 0xFC40..0xFC4B, and with additional irreversible mappings for 0x5C and 0x7E.
glibc-2.2.2-iconv/IBM932.TXT
is a variant of CP932, with lots of different mappings.
zos/05028.TXT
is the same with the control characters 0x1A, 0x1C, 0x7F permuted, and with different mappings for 0x815C, 0x8160..0x8161, 0x817C, 0xFA55.
zos/IBM-942.TXT
zos/09124.TXT
zos/25508.TXT
zos/25518.TXT
zos/29614.TXT
zos/33700.TXT
is the same as zos/05028.TXT, with additional mappings of 0x80, 0xA0, 0xFD..0xFF.
zos/IBM-943.TXT
is a variant of CP932, with the control characters 0x1A, 0x1C, 0x7F permuted.
glibc-2.2.2-iconv/IBM943.TXT
jdk-1.3.0beta/CP943C.TXT
is a variant of CP932, with different mappings of 0x815C, 0x815F, 0x8191, 0x8192, 0x81CA, 0xEEFA, 0xFA55.
jdk-1.3.0beta/CP943.TXT
is the same with ASCII replaced with ISO646-JP.
zos/IBM-54191.TXT
is the same with ASCII replaced with ISO646-JP, with the control characters 0x1A, 0x1C, 0x7F permuted, with a different mapping for 0x5E, and with additional mappings of 0x80, 0xA0, 0xFD..0xFF.
zos/IBM-62383.TXT
is the same with ASCII replaced with ISO646-JP, with the control characters 0x1A, 0x1C, 0x7F permuted.
aix-4.3.2/IBM-943.TXT
icu-1.3.1/IBM-943.TXT
icu-1.3.1/IBM-942.TXT
icu-1.7/IBM-942.TXT
icu-2.2/IBM-942_P120-2000.TXT
icu-2.2/IBM-942_P12A-2000.TXT
icu-2.8/IBM-942_P12A-1999.TXT
icu-3.4/IBM-942_P12A-1999.TXT
icu-1.7/IBM-943.TXT
icu-2.2/IBM-943_P130-2000.TXT
icu-2.8/IBM-943_P130-1999.TXT
icu-3.4/IBM-943_P130-1999.TXT
icu-2.2/IBM-943_P14A-2000.TXT
icu-2.8/IBM-943_P15A-2003.TXT
icu-3.4/IBM-943_P15A-2003.TXT
are variants of CP932 with lots of mappings missing.
jdk-1.3.1/CP942C.TXT
jdk-1.4.2/CP942C.TXT
jdk-1.5.0/CP942C.TXT
is a variant of CP932 with lots of different mappings.
jdk-1.3.1/CP942.TXT
jdk-1.4.2/CP942.TXT
jdk-1.5.0/CP942.TXT
is the same with ASCII replaced with ISO646-JP and with the control characters 0x1A, 0x1C, 0x7F permuted.
jdk-1.1.8/CP942C.TXT
is a variant of CP932 with lots of different mappings.
jdk-1.1.8/CP942.TXT
is the same with ASCII replaced with ISO646-JP.
jdk-1.3.1/CP943C.TXT
jdk-1.4.2/CP943C.TXT
jdk-1.5.0/CP943C.TXT
is a buggy version of jdk-1.3.0beta/CP943C.TXT: it maps all undefined combinations to U+0000.
jdk-1.3.1/CP943.TXT
jdk-1.4.2/CP943.TXT
jdk-1.5.0/CP943.TXT
is a buggy version of jdk-1.3.0beta/CP943.TXT: it maps all undefined combinations to U+0000.
jdk-1.1.8/CP943C.TXT
is a variant of CP932: adds mappings for 0x80, 0xA0, 0xFD..0xFF, more user- defined ranges 0xEB9F..0xEC9E (U+E758..U+E813), and differs in the mappings of 0x815C, 0x815F, 0x8191, 0x8192, 0x81CA, 0xEEFA, 0xFA55, 0xFA59.
jdk-1.1.8/CP943.TXT
is the same with ASCII replaced with ISO646-JP.
icu-1.7/IBM-941.TXT
is a variant of CP932 without ASCII.

Comparison of conversion tables
Bruno Haible <bruno@clisp.org>

Last modified: 19 January 2020.