Home | History | Annotate | Download | only in mappings
      1 # Note: A number of encodings are handled with purely algorithmic converters,
      2 # without any mapping tables:
      3 # US-ASCII, ISO 8859-1, UTF-7/8/16/32, SCSU
      4 
      5 # Listed here:
      6 
      7 # * ISO 8859-2..9,10,13,14,15,16
      8 #   - 8859-11 table is not included. It's rather treated as a synonym of
      9 #     Windows-874
     10 # * Windows-125[0-8]
     11 # * Simplified Chinese : GBK(Windows cp936), GB 18030
     12 #   - GB2312 table was removed and 4 aliases for GB2312 were added
     13 #     to GBK in convrtrs.txt to treat GB2312 as a synonym of GBK.
     14 #   - GB-HZ is supported now that it uses the GBK table.
     15 # * Traditional Chinese : Big5 (Windows cp950), Big5HKSCS (no PUA)
     16 # * Japanese : SJIS (shift_jis-html5), EUC-JP (euc-jp_html5)
     17 # * Korean : Windows-949
     18 #  - EUC-KR table was removed. It's different from Windows-949, but
     19 #    in practice EUC-KR and Windows-949 are treated synonymously.
     20 #  - ISO-2022-KR is now supported by with a one-line change
     21 #    in common/ucnv2022.c and other changes in convrtrs.txt to make it
     22 #    use the windows-949 table.
     23 # * Thai : Windows-874
     24 #   - TIS-620 and ISO-8859-11 are treated as synonyms of Windows-874
     25 #     although they're not the same.
     26 # * Mac encodings : MacRoman, MacCyrillic
     27 # * Cyrillic : KOI8-R, KOI8-U, IBM-866
     28 # * Three 'fake' tables to prevent Webkit from falling back to the default
     29 #   encoding when coming across ISO-2022-CN-(Ext).
     30 #
     31 # * Missing
     32 #  - Armenian, Georgian  : extremly rare
     33 #  - Mac encodings (other than Roman and Cyrillic) : extremly rare
     34 
     35 UCM_SOURCE_FILES=
     36 
     37 UCM_SOURCE_CORE=ibm-912_P100-1995.ucm ibm-913_P100-2000.ucm\
     38 ibm-914_P100-1995.ucm ibm-915_P100-1995.ucm\
     39 ibm-1089_P100-1995.ucm ibm-9005_X110-2007.ucm\
     40 ibm-5012_P100-1999.ucm ibm-920_P100-1995.ucm\
     41 iso-8859_10-1998.ucm\
     42 ibm-921_P100-1995.ucm iso-8859_14-1998.ucm ibm-923_P100-1998.ucm\
     43 iso-8859_16-2001.ucm\
     44 ibm-5346_P100-1998.ucm ibm-5347_P100-1998.ucm ibm-5348_P100-1997.ucm\
     45 ibm-5349_P100-1998.ucm ibm-5350_P100-1998.ucm ibm-9447_P100-2002.ucm\
     46 ibm-9448_X100-2005.ucm ibm-9449_P100-2002.ucm ibm-5354_P100-1998.ucm\
     47 windows-936-2000.ucm gb18030.ucm\
     48 windows-950-2000.ucm ibm-1375_P100-2007.ucm\
     49 shift_jis-html5.ucm euc-jp-html5.ucm\
     50 windows-949-2000.ucm\
     51 windows-874-2000.ucm ibm-874_P100-1995.ucm\
     52 macos-0_2-10.2.ucm macos-7_3-10.2.ucm\
     53 ibm-878_P100-1996.ucm ibm-1168_P100-2002.ucm ibm-866_html5-2012.ucm\
     54 noop-cns-11643.ucm noop-gb2312_gl.ucm noop-iso-ir-165.ucm
     55 
     56 
     57 # Do not build EBCDIC converters.
     58 # ibm-37 and ibm-1047 are hardcoded in Makefile.in and
     59 # they're removed by modifying the file. It's also hard-coded in makedata.mak for
     60 # Winwodws, but we don't have to touch it because the data dll is generated out of
     61 # icu*.dat file generated on Linux.
     62 UCM_SOURCE_EBCDIC =
     63