Home | History | Annotate | only in /external/icu4c/data/mappings
Up to higher level directory
NameDateSize
cns-11643-1992.ucm03-Jun-2014318.7K
convrtrs.txt03-Jun-201478.6K
docomo-shift_jis-2012.ucm03-Jun-2014173.8K
ebcdic-xml-us.ucm03-Jun-20146.6K
euc-jp-2007.ucm03-Jun-2014291.1K
gb18030.ucm03-Jun-2014657.6K
gsm-03.38-2000.ucm03-Jun-20144.7K
ibm-1006_P100-1995.ucm03-Jun-20147.5K
ibm-1025_P100-1995.ucm03-Jun-20146.2K
ibm-1026_P100-1995.ucm03-Jun-20146.2K
ibm-1047_P100-1995.ucm03-Jun-20146.2K
ibm-1051_P100-1995.ucm03-Jun-20146.2K
ibm-1089_P100-1995.ucm03-Jun-20147.7K
ibm-1097_P100-1995.ucm03-Jun-20147.5K
ibm-1098_P100-1995.ucm03-Jun-20148.1K
ibm-1112_P100-1995.ucm03-Jun-20146.2K
ibm-1122_P100-1999.ucm03-Jun-20146.2K
ibm-1123_P100-1995.ucm03-Jun-20146.2K
ibm-1124_P100-1996.ucm03-Jun-20146.2K
ibm-1125_P100-1997.ucm03-Jun-20146.8K
ibm-1129_P100-1997.ucm03-Jun-20146.2K
ibm-1130_P100-1997.ucm03-Jun-20146.2K
ibm-1131_P100-1997.ucm03-Jun-20146.8K
ibm-1132_P100-1998.ucm03-Jun-20145.8K
ibm-1133_P100-1997.ucm03-Jun-20145.8K
ibm-1137_P100-1999.ucm03-Jun-20146.1K
ibm-1140_P100-1997.ucm03-Jun-20146.2K
ibm-1141_P100-1997.ucm03-Jun-20146.2K
ibm-1142_P100-1997.ucm03-Jun-20146.2K
ibm-1143_P100-1997.ucm03-Jun-20146.2K
ibm-1144_P100-1997.ucm03-Jun-20146.2K
ibm-1145_P100-1997.ucm03-Jun-20146.2K
ibm-1146_P100-1997.ucm03-Jun-20146.2K
ibm-1147_P100-1997.ucm03-Jun-20146.2K
ibm-1148_P100-1997.ucm03-Jun-20146.2K
ibm-1149_P100-1997.ucm03-Jun-20146.2K
ibm-1153_P100-1999.ucm03-Jun-20146.2K
ibm-1154_P100-1999.ucm03-Jun-20146.2K
ibm-1155_P100-1999.ucm03-Jun-20146.2K
ibm-1156_P100-1999.ucm03-Jun-20146.2K
ibm-1157_P100-1999.ucm03-Jun-20146.2K
ibm-1158_P100-1999.ucm03-Jun-20146.2K
ibm-1160_P100-1999.ucm03-Jun-20146.2K
ibm-1162_P100-1999.ucm03-Jun-20146.1K
ibm-1164_P100-1999.ucm03-Jun-20146.2K
ibm-1168_P100-2002.ucm03-Jun-20146.2K
ibm-1250_P100-1995.ucm03-Jun-20146.2K
ibm-1251_P100-1995.ucm03-Jun-20146.2K
ibm-1252_P100-2000.ucm03-Jun-20146.2K
ibm-1253_P100-1995.ucm03-Jun-20146.2K
ibm-1254_P100-1995.ucm03-Jun-20146.2K
ibm-1255_P100-1995.ucm03-Jun-20145.9K
ibm-1256_P110-1997.ucm03-Jun-20148.6K
ibm-1257_P100-1995.ucm03-Jun-20146.1K
ibm-1258_P100-1997.ucm03-Jun-20146.2K
ibm-12712_P100-1998.ucm03-Jun-20145.7K
ibm-1276_P100-1995.ucm03-Jun-20145.5K
ibm-1363_P110-1997.ucm03-Jun-2014342.3K
ibm-1363_P11B-1998.ucm03-Jun-2014342K
ibm-1364_P110-2007.ucm03-Jun-2014386.3K
ibm-1371_P100-1999.ucm03-Jun-2014400.5K
ibm-1373_P100-2002.ucm03-Jun-2014388.1K
ibm-1375_P100-2007.ucm03-Jun-2014420.9K
ibm-1383_P110-1999.ucm03-Jun-2014177.1K
ibm-1386_P100-2001.ucm03-Jun-2014472.6K
ibm-1388_P103-2001.ucm03-Jun-2014637.4K
ibm-1390_P110-2003.ucm03-Jun-2014437.1K
ibm-1399_P110-2003.ucm03-Jun-2014437.3K
ibm-16684_P110-2003.ucm03-Jun-2014433.4K
ibm-16804_X110-1999.ucm03-Jun-20147.4K
ibm-273_P100-1995.ucm03-Jun-20146.2K
ibm-277_P100-1995.ucm03-Jun-20146.2K
ibm-278_P100-1995.ucm03-Jun-20146.2K
ibm-280_P100-1995.ucm03-Jun-20146.2K
ibm-284_P100-1995.ucm03-Jun-20146.2K
ibm-285_P100-1995.ucm03-Jun-20146.2K
ibm-290_P100-1995.ucm03-Jun-20145.7K
ibm-297_P100-1995.ucm03-Jun-20146.2K
ibm-33722_P120-1999.ucm03-Jun-2014191.3K
ibm-33722_P12A_P12A-2004_U2.ucm03-Jun-2014191.2K
ibm-33722_P12A_P12A-2009_U2.ucm03-Jun-2014192.9K
ibm-37_P100-1995.ucm03-Jun-20146.2K
ibm-420_X120-1999.ucm03-Jun-20147.6K
ibm-424_P100-1995.ucm03-Jun-20145.6K
ibm-437_P100-1995.ucm03-Jun-20146.8K
ibm-4517_P100-2005.ucm03-Jun-20147.7K
ibm-4899_P100-1998.ucm03-Jun-20144.2K
ibm-4909_P100-1999.ucm03-Jun-20146.1K
ibm-4971_P100-1999.ucm03-Jun-20146.1K
ibm-500_P100-1995.ucm03-Jun-20146.2K
ibm-5012_P100-1999.ucm03-Jun-20145.6K
ibm-5123_P100-1999.ucm03-Jun-20145.8K
ibm-5346_P100-1998.ucm03-Jun-20146.2K
ibm-5347_P100-1998.ucm03-Jun-20146.2K
ibm-5348_P100-1997.ucm03-Jun-20146.2K
ibm-5349_P100-1998.ucm03-Jun-20146.2K
ibm-5350_P100-1998.ucm03-Jun-20146.2K
ibm-5351_P100-1998.ucm03-Jun-20145.9K
ibm-5352_P100-1998.ucm03-Jun-20148.6K
ibm-5353_P100-1998.ucm03-Jun-20146.1K
ibm-5354_P100-1998.ucm03-Jun-20146.2K
ibm-5471_P100-2006.ucm03-Jun-2014382.7K
ibm-5478_P100-1995.ucm03-Jun-2014146.2K
ibm-720_P100-1997.ucm03-Jun-20146.1K
ibm-737_P100-1997.ucm03-Jun-20146.8K
ibm-775_P100-1996.ucm03-Jun-20146.8K
ibm-803_P100-1999.ucm03-Jun-20144K
ibm-813_P100-1995.ucm03-Jun-20146.1K
ibm-838_P100-1995.ucm03-Jun-20146.2K
ibm-8482_P100-1999.ucm03-Jun-20145.8K
ibm-850_P100-1995.ucm03-Jun-20146.8K
ibm-851_P100-1995.ucm03-Jun-20146.8K
ibm-852_P100-1995.ucm03-Jun-20146.8K
ibm-855_P100-1995.ucm03-Jun-20146.8K
ibm-856_P100-1995.ucm03-Jun-20146.1K
ibm-857_P100-1995.ucm03-Jun-20146.7K
ibm-858_P100-1997.ucm03-Jun-20146.8K
ibm-860_P100-1995.ucm03-Jun-20146.8K
ibm-861_P100-1995.ucm03-Jun-20146.8K
ibm-862_P100-1995.ucm03-Jun-20146.8K
ibm-863_P100-1995.ucm03-Jun-20146.8K
ibm-864_X110-1999.ucm03-Jun-20148.2K
ibm-865_P100-1995.ucm03-Jun-20146.8K
ibm-866_P100-1995.ucm03-Jun-20146.8K
ibm-867_P100-1998.ucm03-Jun-20146.7K
ibm-868_P100-1995.ucm03-Jun-20148.1K
ibm-869_P100-1995.ucm03-Jun-20146.7K
ibm-870_P100-1995.ucm03-Jun-20146.2K
ibm-871_P100-1995.ucm03-Jun-20146.2K
ibm-874_P100-1995.ucm03-Jun-20145.7K
ibm-875_P100-1995.ucm03-Jun-20146.1K
ibm-878_P100-1996.ucm03-Jun-20146.2K
ibm-9005_X110-2007.ucm03-Jun-20146.2K
ibm-901_P100-1999.ucm03-Jun-20147.2K
ibm-902_P100-1999.ucm03-Jun-20147.3K
ibm-9067_X100-2005.ucm03-Jun-20146.2K
ibm-912_P100-1995.ucm03-Jun-20147.3K
ibm-913_P100-2000.ucm03-Jun-20146.1K
ibm-914_P100-1995.ucm03-Jun-20146.2K
ibm-915_P100-1995.ucm03-Jun-20146.2K
ibm-916_P100-1995.ucm03-Jun-20145.6K
ibm-918_P100-1995.ucm03-Jun-20147.5K
ibm-920_P100-1995.ucm03-Jun-20146.2K
ibm-921_P100-1995.ucm03-Jun-20146.2K
ibm-922_P100-1999.ucm03-Jun-20147.3K
ibm-923_P100-1998.ucm03-Jun-20146.2K
ibm-930_P120-1999.ucm03-Jun-2014234.1K
ibm-933_P110-1995.ucm03-Jun-2014215.3K
ibm-935_P110-1999.ucm03-Jun-2014187.2K
ibm-937_P110-1999.ucm03-Jun-2014400.6K
ibm-939_P120-1999.ucm03-Jun-2014234.2K
ibm-942_P12A-1999.ucm03-Jun-2014185.4K
ibm-943_P130-1999.ucm03-Jun-2014194.3K
ibm-943_P15A-2003.ucm03-Jun-2014192.4K
ibm-9447_P100-2002.ucm03-Jun-20146K
ibm-9448_X100-2005.ucm03-Jun-20149.2K
ibm-9449_P100-2002.ucm03-Jun-20146.1K
ibm-949_P110-1999.ucm03-Jun-2014203K
ibm-949_P11A-1999.ucm03-Jun-2014202.7K
ibm-950_P110-1999.ucm03-Jun-2014401.1K
ibm-954_P101-2007.ucm03-Jun-2014324.9K
ibm-964_P110-1999.ucm03-Jun-2014512.4K
ibm-970_P110_P110-2006_U2.ucm03-Jun-2014169.6K
ibm-971_P100-1995.ucm03-Jun-2014165.2K
icu-internal-25546.ucm03-Jun-2014169K
icu-internal-compound-d1.ucm03-Jun-201471.1K
icu-internal-compound-d2.ucm03-Jun-2014132.6K
icu-internal-compound-d3.ucm03-Jun-201466.4K
icu-internal-compound-d4.ucm03-Jun-201488K
icu-internal-compound-d5.ucm03-Jun-20145K
icu-internal-compound-d6.ucm03-Jun-201467.8K
icu-internal-compound-d7.ucm03-Jun-201441.8K
icu-internal-compound-s1.ucm03-Jun-2014656
icu-internal-compound-s2.ucm03-Jun-20141.7K
icu-internal-compound-s3.ucm03-Jun-20143.5K
icu-internal-compound-t.ucm03-Jun-2014898.8K
iso-8859_10-1998.ucm03-Jun-20145.8K
iso-8859_11-2001.ucm03-Jun-20145.7K
iso-8859_14-1998.ucm03-Jun-20145.8K
iso-ir-165.ucm03-Jun-2014164.8K
jisx-208.ucm03-Jun-2014146.9K
jisx-212.ucm03-Jun-2014119.6K
kddi-jisx-208-2007.ucm03-Jun-2014155.5K
kddi-shift_jis-2012.ucm03-Jun-2014191.2K
lmb-excp.ucm03-Jun-20149.2K
macos-0_2-10.2.ucm03-Jun-20145.9K
macos-29-10.2.ucm03-Jun-20145.6K
macos-35-10.2.ucm03-Jun-20145.8K
macos-6_2-10.4.ucm03-Jun-20145.8K
macos-7_3-10.2.ucm03-Jun-20145.6K
noop-cns-11643.ucm03-Jun-20141.1K
noop-gb2312_gl.ucm03-Jun-20141.2K
noop-iso-ir-165.ucm03-Jun-20141K
README.emoji03-Jun-20145K
softbank-jisx-208-2007.ucm03-Jun-2014153.2K
softbank-shift_jis-2012.ucm03-Jun-2014177K
ucmcore.mk03-Jun-20141.1K
ucmebcdic.mk03-Jun-20141.7K
ucmfiles.mk03-Jun-20143.7K
ucmlocal.mk03-Jun-2014507
windows-874-2000.ucm03-Jun-20147.4K
windows-936-2000.ucm03-Jun-2014479.3K
windows-949-2000.ucm03-Jun-2014347.8K
windows-950-2000.ucm03-Jun-2014397.9K

README.emoji

      1 == Notes on {kddi,docomo,softbank}-*.ucm mappings.
      2 
      3 kddi-jisx-208 is a variant of JIS X 208 used by KDDI, a Japanese cell
      4 phone carrier.
      5 
      6 kddi-shift_jis, docomo-shift_jis, and softbank-shift_jis are variants
      7 of Shift_JIS used by KDDI, DoCoMo and SoftBank.
      8 
      9   - kddi-jisx-208 contains Emoji (emoticon) code points in
     10     0x75xx, 0x76xx, 0x77xx, 0x78xx, 0x79xx, 0x7Axx, 0x7Bxx,
     11     where xx means 21-7E.
     12 
     13   - kddi-shift_jis contains Emoji code points in
     14     0xEBxx, 0xECxx, 0xEDxx, and 0xEExx, 0xF3xx, 0xF4xx, 0xF6xx, 0xF7xx,
     15     where xx means 40-7E, 80-FC.
     16 
     17   - docomo-shift_jis contains Emoji code points in
     18     0xF8xx, and 0xF9xx, where xx means 40-7E, 80-FC.
     19 
     20   - softbank-shift_jis contains Emoji code points in
     21     0xF7xx, 0xF9xx, and 0xFBxx, where xx means 40-7E, 80-FC.
     22 
     23   - softbank-jisx-208 contains Emoji code points in
     24     0x75xx, 0x76xx, 0x77xx, 0x78xx, 0x79xx, 0x7Axx, 0x7Bxx, 0x7Dxx
     25     where xx means 21-7E.
     26 
     27 
     28 == How the -2012.ucm tables were modified in April 2013
     29 
     30 The -2012 versions were created by
     31     http://code.google.com/p/emoji4unicode/source/browse/trunk/src/gen_conversion_files.py
     32 
     33 using each of the older 2012 versions as the base table files
     34 to avoid non-Emoji changes:
     35 
     36 # gen_google_ucm.sh
     37 icu_mappings=/google/src/cloud/mscherer/icubranch/google_vendor_src_branch/icu/source/data/mappings
     38 dest=/home/mscherer/www/no_crawl/emoji
     39 ./gen_conversion_files.py $icu_mappings/docomo-shift_jis-2012.ucm
     40 cp ../generated/docomo-shift_jis-2012.ucm $dest
     41 ./gen_conversion_files.py $icu_mappings/kddi-shift_jis-2012.ucm
     42 cp ../generated/kddi-shift_jis-2012.ucm $dest
     43 ./gen_conversion_files.py $icu_mappings/softbank-shift_jis-2012.ucm
     44 cp ../generated/softbank-shift_jis-2012.ucm $dest
     45 ./gen_conversion_files.py
     46 
     47 The only differences from 2012-sep are in mappings for symbols
     48 that have Unicode Variation Selector (VS) sequences.
     49 
     50 The older tables relied on a hack in the ICU conversion code that
     51 ignored the "use fallback" flag for fallbacks from sequences with VS.
     52 
     53 The new tables rely on a new feature in ICU4C 51:
     54 For the relevant symbols that have roundtrip mappings,
     55 - the mappings with Emoji Variation Selector
     56   use the |0 roundtrip precision
     57 - the other mappings (no VS & text VS)
     58   use the |4 "good one-way" precision
     59 
     60 See http://bugs.icu-project.org/trac/ticket/9602
     61 
     62 == How the -2012.ucm tables were created in September 2012
     63 
     64 The 2012 versions were created by
     65     http://code.google.com/p/emoji4unicode/source/browse/trunk/src/gen_conversion_files.py
     66 
     67 using each of the 2007 versions as the base table files
     68 to avoid non-Emoji changes:
     69 
     70 icu_mappings=~/p4/emoji/google_vendor_src_branch/icu/source/data/mappings
     71 dest=~/www/no_crawl/emoji
     72 ./gen_conversion_files.py $icu_mappings/docomo-shift_jis-2007.ucm
     73 cp ../generated/docomo-shift_jis-2012.ucm $dest
     74 ./gen_conversion_files.py $icu_mappings/kddi-shift_jis-2007.ucm
     75 cp ../generated/kddi-shift_jis-2012.ucm $dest
     76 ./gen_conversion_files.py $icu_mappings/softbank-shift_jis-2007.ucm
     77 cp ../generated/softbank-shift_jis-2012.ucm $dest
     78 ./gen_conversion_files.py
     79 
     80 The emoji4unicode code uses the mappings that were established during the
     81 Unicode Emoji standardization process.
     82 The new conversion tables round-trip carrier Emoji symbol codes
     83 to and from Unicode 6 standard code points
     84 and also include fallback mappings from the Google PUA code points
     85 to the carrier codes.
     86 
     87 The trailing "|0" etc. on the mapping table lines specify the mapping type:
     88     |0  round-trip              Unicode <-> charset
     89     |1  fallback                Unicode  -> charset
     90     |3  "reverse fallback"      Unicode <-  charset
     91 
     92 For details about the .ucm file format see
     93 http://userguide.icu-project.org/conversion/data#TOC-.ucm-File-Format
     94 
     95 == How the -2007.ucm tables were created
     96 
     97 So far, we haven't obtained "official" conversion tables from the cell
     98 phone carriers.  However, we empirically know their clients support
     99 VDCs in MS932, like U2460 (CIRCLED DIGIT ONE), etc.  Hence we use
    100 MS932 as the base table for them.
    101 
    102 kddi-jisx-208-2007.ucm is based on jisx-208.ucm in this directory.
    103 The original table's mappings to codes 0x75xx to 0x7Bxx are excluded
    104 to avoid collisions with emoji.
    105 
    106 kddi-shift_jis-2007.ucm is based on windows-932-2000.ucm.
    107 The original table's mappings to codes 0xEBxx to 0xEExx, and 0xF0xx to
    108 0xF90xx (EUDC block), are excluded to avoid collisions with emoji.
    109 
    110 docomo-shift_jis-2007.ucm is based on windows-932-2000.ucm.
    111 The original table's mappings to codes 0xF0xx to 0xF90xx (EUDC block)
    112 are excluded to avoid collisions with emoji.
    113 
    114 softbank-shift_jis-2007.ucm is based on windows-932-2000.ucm.
    115 The original table's mappings to codes 0xF0xx to 0xF90xx (EUDC block),
    116 and 0xFBxx, are excluded to avoid collisions with emoji.
    117 
    118 softbank-jisx-208-2007.ucm is based on jisx-208.ucm in this directory.
    119 The original table's mappings to codes 0x75xx to 0x7Bxx, and 0x7Dxx
    120 are excluded to avoid collisions with emoji.
    121 
    122 == Google Standard Emoji Unicode Mapping
    123 
    124 The Google standard emoji Unicode mapping can be found at:
    125 
    126   /home/build/google3/i18n/encodings/emoji/emoji_unicode_mapping.txt
    127 
    128 
    129 
    130 TODO(mscherer): Use <icu:base> to share most standard JIS mappings
    131 among *-shift_jis-2007.ucm files.
    132