1 # Note: A number of encodings are handled with purely algorithmic converters, 2 # without any mapping tables: 3 # US-ASCII, ISO 8859-1, UTF-7/8/16/32, SCSU 4 5 # Listed here: 6 7 # * ISO 8859-2..9,10,13,14,15,16 8 # - 8859-11 table is not included. It's rather treated as a synonym of 9 # Windows-874 10 # * Windows-125[0-8] 11 # * Simplified Chinese : GBK(Windows cp936), GB 18030 12 # - GB2312 table was removed and 4 aliases for GB2312 were added 13 # to GBK in convrtrs.txt to treat GB2312 as a synonym of GBK. 14 # - GB-HZ is supported now that it uses the GBK table. 15 # * Traditional Chinese : Big5 (Windows cp950), Big5HKSCS (no PUA) 16 # * Japanese : SJIS (Windows cp932), EUC-JP (google-euc_jp_mod) 17 # * Korean : Windows-949 18 # - EUC-KR table was removed. It's different from Windows-949, but 19 # in practice EUC-KR and Windows-949 are treated synonymously. 20 # - ISO-2022-KR is now supported by with a one-line change 21 # in common/ucnv2022.c and other changes in convrtrs.txt to make it 22 # use the windows-949 table. 23 # * Thai : Windows-874 24 # - TIS-620 and ISO-8859-11 are treated as synonyms of Windows-874 25 # although they're not the same. 26 # * Mac encodings : MacRoman, MacCyrillic 27 # * Cyrillic : KOI8-R, KOI8-U 28 # * WebKit wants PC-Arabic (DOS 720 = IBM-864) 29 # * Three 'fake' tables to prevent Webkit from falling back to the default 30 # encoding when coming across ISO-2022-CN-(Ext). 31 # 32 # * Missing 33 # - Armenian, Georgian : extremly rare 34 # - Mac encodings (other than Roman and Cyrillic) : extremly rare 35 36 UCM_SOURCE_FILES= 37 38 UCM_SOURCE_CORE=ibm-912_P100-1995.ucm ibm-913_P100-2000.ucm\ 39 ibm-914_P100-1995.ucm ibm-915_P100-1995.ucm\ 40 ibm-1089_P100-1995.ucm ibm-9005_X110-2007.ucm\ 41 ibm-5012_P100-1999.ucm ibm-920_P100-1995.ucm\ 42 iso-8859_10-1998.ucm\ 43 ibm-921_P100-1995.ucm iso-8859_14-1998.ucm ibm-923_P100-1998.ucm\ 44 iso-8859_16-2001.ucm\ 45 ibm-5346_P100-1998.ucm ibm-5347_P100-1998.ucm ibm-5348_P100-1997.ucm\ 46 ibm-5349_P100-1998.ucm ibm-5350_P100-1998.ucm ibm-9447_P100-2002.ucm\ 47 ibm-9448_X100-2005.ucm ibm-9449_P100-2002.ucm ibm-5354_P100-1998.ucm\ 48 windows-936-2000.ucm gb18030.ucm\ 49 windows-950-2000.ucm ibm-1375_P100-2007.ucm\ 50 ibm-943_P15A-2003.ucm google-euc_jp_mod.ucm\ 51 windows-949-2000.ucm\ 52 windows-874-2000.ucm ibm-874_P100-1995.ucm\ 53 macos-0_2-10.2.ucm macos-7_3-10.2.ucm\ 54 ibm-878_P100-1996.ucm ibm-1168_P100-2002.ucm\ 55 ibm-864_X110-1999.ucm\ 56 noop-cns-11643.ucm\ 57 noop-gb2312_gl.ucm\ 58 noop-iso-ir-165.ucm 59 60 # Do not build EBCDIC converters. 61 # ibm-37 and ibm-1047 are hardcoded in Makefile.in and 62 # they're removed by modifying the file. It's also hard-coded in makedata.mak for 63 # Winwodws, but we don't have to touch it because the data dll is generated out of 64 # icu*.dat file generated on Linux. 65 UCM_SOURCE_EBCDIC = 66