Home | History | Annotate | Download | only in charsetdet
      1 <?xml version="1.0" encoding="UTF-8"?>
      2 
      3 <!-- Copyright (C) 2016 and later: Unicode, Inc. and others. -->
      4 <!-- License & terms of use: http://www.unicode.org/copyright.html#License -->
      5 <!-- Copyright (c) 2005-2015 IBM Corporation and others. All rights reserved -->
      6 <!-- See individual test cases for their specific copyright. -->
      7 
      8 <charset-detection-tests>
      9     <test-case id="IUC10-ar" encodings="UTF-8 UTF-16LE UTF-16BE UTF-32BE UTF-32LE ISO-8859-6/ar windows-1256/ar">
     10     <!-- Copyright  1991-2005 Unicode, Inc. All rights reserved. -->
     11 
     12     ,   +  :
     13        
     14     IUC10
     15           ,    10-12  1997  ,
     16     .             ,  ,
     17                     
     18     , ,     . 
     19 
     20     Unicode
     21         ,    
     22 
     23     </test-case>
     24 
     25     <test-case id="IUC10-da-Q" encodings="UTF-8 UTF-16LE UTF-16BE UTF-32BE UTF-32LE windows-1252/da">
     26     <!-- Copyright  1991-2005 Unicode, Inc. All rights reserved. -->
     27 
     28     Europa, Software + Internet:
     29     Bliv global med Unicode
     30     IUC10
     31     Indskriv dig nu til den tiende internationale Unicode-konference, der holdes den 10-12
     32     marts 1997 i Mainz, Tyskland. Konferencen samler eksperter fra hele verden inden for det
     33     globale Internet og Unicode, internationalisering og lokalisering, implementering af
     34     Unicode i styresystemer og programmer, skrifttyper, tekst-layout og flersproget databehandling.
     35 
     36     Unicode
     37     Nr verden vil tale, taler den Unicode.
     38 
     39     </test-case>
     40 
     41     <test-case id="IUC10-da" encodings="UTF-8 UTF-16LE UTF-16BE UTF-32BE UTF-32LE ISO-8859-1/da">
     42     <!-- Copyright  1991-2005 Unicode, Inc. All rights reserved. -->
     43 
     44     Europa, Software + Internet:
     45     Bliv global med Unicode
     46     IUC10
     47     Indskriv dig nu til den tiende internationale Unicode-konference, der holdes den 10-12
     48     marts 1997 i Mainz, Tyskland. Konferencen samler eksperter fra hele verden inden for det
     49     globale Internet og Unicode, internationalisering og lokalisering, implementering af
     50     Unicode i styresystemer og programmer, skrifttyper, tekst-layout og flersproget databehandling.
     51 
     52     Unicode
     53     Nr verden vil tale, taler den Unicode.
     54 
     55     </test-case>
     56 
     57     <test-case id="IUC10-de" encodings="UTF-8 UTF-16LE UTF-16BE UTF-32BE UTF-32LE ISO-8859-1/de">
     58     <!-- Copyright  1991-2005 Unicode, Inc. All rights reserved. -->
     59 
     60     Europa, Software + das Internet:
     61     International mit Unicode
     62     IUC10
     63     Melden Sie sich jetzt fr die 10. Internationale Unicode Konferenz an, die in der Zeit vom 10.-12. Mrz 1997 in
     64     Mainz stattfinden wird. Die Konferenz ist ein Treffpunkt fr Betriebsexperten aus den Bereichen globales
     65     Internet und Unicode, Internationalisierung und Lokalisierung, die Implementierung von Unicode in
     66     Betriebssystemen und Programmen, sowie fr Schriftarten, Textlayout und mehrsprachige Computeranwendungen.
     67 
     68     Unicode
     69     Wenn die Welt miteinander spricht, spricht sie Unicode.
     70 
     71     </test-case>
     72 
     73     <!-- No UTF-8 in this test because there are no non-ASCII characters. -->
     74     <test-case id="IUC10-en" encodings="UTF-16LE UTF-16BE UTF-32BE UTF-32LE ISO-8859-1/en">
     75     <!-- Copyright  1991-2005 Unicode, Inc. All rights reserved. -->
     76 
     77     Europe, Software + the Internet:
     78     Going Global with Unicode
     79     IUC10
     80     Register now for the Tenth International Unicode Conference, to be held on March 10-12, 1997,
     81     in Mainz, Germany. The Conference will bring together industry-wide experts on the global Internet and
     82     Unicode, internationalization and localization, implementation of Unicode in operating systems and applications,
     83     fonts, text layout, and multilingual computing.
     84 
     85     Unicode
     86     When the world wants to talk, it speaks Unicode.
     87 
     88     </test-case>
     89 
     90     <test-case id="IUC10-es" encodings="UTF-8 UTF-16LE UTF-16BE UTF-32BE UTF-32LE ISO-8859-1/es">
     91     <!-- Copyright  1991-2005 Unicode, Inc. All rights reserved. -->
     92 
     93     Europa, Software + el Internet:
     94     Mundializando con Unicode
     95     IUC10
     96     Inscrbase ahora para la Dcima Conferencia Internacional Unicode, que tendr lugar del 10 al 12 de marzo de
     97     1997 en Maguncia, Alemania. La Conferencia reunir expertos de los sectores de la mundializacin del Internet y
     98     Unicode, la internacionalizacin y localizacin, implementacin de Unicode en sistemas operativos y aplicaciones,
     99     tipos, composicin de texto e informtica multilinge.
    100 
    101     Unicode
    102     Cuando el mundo quiere conversar, habla Unicode.
    103 
    104     </test-case>
    105 
    106     <test-case id="IUC10-fr" encodings="UTF-8 UTF-16LE UTF-16BE UTF-32BE UTF-32LE ISO-8859-1/fr">
    107     <!-- Copyright  1991-2005 Unicode, Inc. All rights reserved. -->
    108 
    109     L'Europe, le logiciel et l'Internet :
    110     la mondialisation avec Unicode
    111     IUC10
    112     Inscrivez-vous ds maintenant  la dixime Confrence internationale sur Unicode, qui se tiendra du 10 au 12
    113     mars 1997  Mayence, en Allemagne. Cette confrence rassemblera des experts de tous les horizons industriels
    114     sur les sujets suivants : l'Internet mondial et Unicode, l'internationalisation et l'adaptation locale,
    115     l'implmentation d'Unicode dans les systmes d'exploitation et les applications, les polices de caractres,
    116     la disposition de texte, l'informatique plurilingue.
    117 
    118     Unicode
    119     Quand le monde veut communiquer, il parle en Unicode.
    120 
    121     </test-case>
    122 
    123     <test-case id="IUC10-he" encodings="UTF-8 UTF-16LE UTF-16BE UTF-32BE UTF-32LE ISO-8859-8-I/he">
    124     <!-- Copyright  1991-2005 Unicode, Inc. All rights reserved. -->
    125 
    126     ,  :
    127     Unicode   
    128     IUC10
    129        Unicode  ,    12-10  1997,  . 
    130             -Unicode,    , 
    131     Unicode   , ,    -.
    132 
    133     Unicode
    134        ,   -Unicode
    135 
    136     </test-case>
    137 
    138     <test-case id="IUC10-he-Q" encodings="UTF-8 UTF-16LE UTF-16BE UTF-32BE UTF-32LE windows-1255/he">
    139     <!-- Copyright  1991-2005 Unicode, Inc. All rights reserved. -->
    140 
    141     ,  :
    142     Unicode   
    143     IUC10
    144        Unicode  ,    12-10  1997,  . 
    145             -Unicode,    , 
    146     Unicode   , ,    -.
    147 
    148     Unicode
    149        ,   -Unicode.
    150 
    151     </test-case>
    152 
    153     <test-case id="IUC10-hu" encodings="UTF-8 UTF-16LE UTF-16BE UTF-32BE UTF-32LE ISO-8859-2/hu">
    154     <!-- Copyright  1991-2005 Unicode, Inc. All rights reserved. -->
    155 
    156     Eurpa, a Szoftver s az Internet -
    157     Globliss Vltozik a Unicode ltal
    158     IUC10
    159     Iratkozzon mr most a Tizedik Nemzetkzi Unicode Konferencira, amely Mrcius 10-12 1997
    160     lesz megtartva, Meinz-be, Nmetorszgba. Ebben a Konferencin az iparg szerte sok szakrt
    161     fog rszt venni: a globlis Internet s Unicode nemzetkzistse s lokalizlsa, a
    162     Unicode beteljestse a mkd rendszerekben s alkalmazsokban, fontokba, szveg
    163     trbeosztsba s tbbnyelv computerekben.
    164 
    165     Unicode
    166     Ha a vilg beszlni akar, azt Unicode-ul mondja.
    167 
    168     </test-case>
    169 
    170     <test-case id="IUC10-hu-Q" encodings="UTF-8 UTF-16LE UTF-16BE UTF-32BE UTF-32LE windows-1250/hu">
    171     <!-- Copyright  1991-2005 Unicode, Inc. All rights reserved. -->
    172 
    173     Eurpa, a Szoftver s az Internet -
    174     Globliss Vltozik a Unicode ltal
    175     IUC10
    176     Iratkozzon mr most a Tizedik Nemzetkzi Unicode Konferencira, amely Mrcius 10-12 1997
    177     lesz megtartva, Meinz-be, Nmetorszgba. Ebben a Konferencin az iparg szerte sok szakrt
    178     fog rszt venni: a globlis Internet s Unicode nemzetkzistse s lokalizlsa, a
    179     Unicode beteljestse a mkd rendszerekben s alkalmazsokban, fontokba, szveg
    180     trbeosztsba s tbbnyelv computerekben.
    181 
    182     Unicode
    183     Ha a vilg beszlni akar, azt Unicode-ul mondja.
    184 
    185     </test-case>
    186 
    187     <test-case id="IUC10-it" encodings="UTF-8 UTF-16LE UTF-16BE UTF-32BE UTF-32LE ISO-8859-1/it">
    188     <!-- Copyright  1991-2005 Unicode, Inc. All rights reserved. -->
    189 
    190     Europa, software e Internet:
    191     Globalizzazione con Unicode
    192     IUC10
    193     Iscrivetevi subito alla X Conferenza Internazionale su Unicode, che si terr dal 10 al 12 marzo 1997 a
    194     Mainz in Germania. Alla Conferenza parteciperanno esperti di tutti i settori per discutere di Internet globale e
    195     Unicode, internazionalizzazione e localizzazione, implementazione di Unicode in sistemi operativi e applicazioni,
    196     caratteri, composizione dei testi ed elaborazione multilingue.
    197 
    198     Unicode
    199     Quando il mondo vuole comunicare, parla Unicode.
    200 
    201     </test-case>
    202 
    203     <!-- No EUC-JP in this test because it detects as GB18030 -->
    204     <test-case id="IUC10-jp" encodings="UTF-8 UTF-16LE UTF-16BE UTF-32BE UTF-32LE Shift_JIS/ja ISO-2022-JP">
    205     <!-- Copyright  1991-2005 Unicode, Inc. All rights reserved. -->
    206 
    207     :
    208     Unicode 
    209     IUC10
    210      10  Unicode  1997  3  10-12
    211     UnicodeOS 
    212     Unicode 
    213 
    214     Unicode
    215     Unicode 
    216 
    217     </test-case>
    218 
    219     <test-case id="IUC10-ko" encodings="UTF-8 UTF-16LE UTF-16BE UTF-32BE UTF-32LE EUC-KR/ko ISO-2022-KR">
    220     <!-- Copyright  1991-2005 Unicode, Inc. All rights reserved. -->
    221 
    222     ,   :
    223       
    224     IUC10
    225     10    1997 3 10 12   .  .
    226               . -  ,  ,
    227          , ,  ,  .
    228 
    229     Unicode
    230       ,  
    231 
    232     </test-case>
    233 
    234     <!-- No UTF-8 in this test because there are no non-ASCII characters. -->
    235     <test-case id="IUC10-nl" encodings="UTF-16LE UTF-16BE UTF-32BE UTF-32LE ISO-8859-1/nl">
    236     <!-- Copyright  1991-2005 Unicode, Inc. All rights reserved. -->
    237 
    238     Europa, Software + het Internet:
    239     wereldwijd met Unicode
    240     IUC10
    241     Meld u nu aan voor de Tiende Internationale Unicode-conferentie, die van 10 tot 12 maart 1997 in
    242     Mainz (Duitsland) wordt gehouden. De Conferentie is een ontmoetingsplaats voor experts uit de industrie op het
    243     gebied van het wereldwijde Internet en Unicode, internationalisatie en localisatie, implementatie van Unicode in
    244     besturingssystemen en applicaties, lettertypes, tekstopmaak en meertalig computergebruik.
    245 
    246     Unicode
    247     Als de wereld wil praten, spreekt hij Unicode.
    248 
    249     </test-case>
    250 
    251     <!-- No language for ISO-8859-1 in this test because no-NO is recogonized as Danish... -->
    252     <test-case id="IUC10-no-NO" encodings="UTF-8 UTF-16LE UTF-16BE UTF-32BE UTF-32LE ISO-8859-1/da">
    253     <!-- Copyright  1991-2005 Unicode, Inc. All rights reserved. -->
    254 
    255     Europa, Programvare og Internet:
    256     Global fokus med Unicode
    257     IUC10
    258     Registrer deg som deltager p den tiende inernasjonale Unicode konferansen i Mainz, Tyskland, fra 10. til 12. mars,
    259     1997. Konferansen vil samle eksperter p Internet, Unicode, internasjonalisering og integrasjon av Unicode i
    260     operativsystemer og programmer, fonter, tekst layout og flersprklig databehandling.
    261 
    262     Unicode
    263     Nr verden vil snakke, snakker den Unicode
    264 
    265     </test-case>
    266 
    267     <test-case id="IUC10-no-NO-NY" encodings="UTF-8 UTF-16LE UTF-16BE UTF-32BE UTF-32LE ISO-8859-1/no">
    268     <!-- Copyright  1991-2005 Unicode, Inc. All rights reserved. -->
    269 
    270     Europa, programvare og Internett:
    271     Femn verda med Unicode
    272     IUC10
    273     Meld deg p den 10. internasjonale Unicode-konferansen. Han gr fre seg i Mainz i Tyskland i dagane 10.--12. mars
    274     1997, og samlar fagkunnige innan konferansetemaet fr heile databransjen. Tema: Det globale Internettet og
    275     Unicode, internasjonalisering og nasjonal tilpassing, implementering av Unicode i operativsystem og brukarprogram,
    276     skriftsnitt (fontar), tekstutlegg, og fleirsprkleg databehandling.
    277 
    278     Unicode
    279     Nr verda nskjer  snakke, talar ho Unicode
    280 
    281     </test-case>
    282 
    283     <test-case id="IUC10-pt-BR" encodings="UTF-8 UTF-16LE UTF-16BE UTF-32BE UTF-32LE ISO-8859-1/pt">
    284     <!-- Copyright  1991-2005 Unicode, Inc. All rights reserved. -->
    285 
    286     Europa, Software e a Internet:
    287     Globalizao com o Unicode
    288     IUC10
    289     Inscreva-se agora para a Dcima Conferncia Internacional Sobre O Unicode, realizada entre os dias 10 e 12 de
    290     maro de 1997 em Mainz na Alemanha. A Conferncia reunir peritos de todas as reas da indstria especializados
    291     em assuntos relacionados com a Internet global e o Unicode, internacionalizao e localizao de software,
    292     implementao do Unicode em sistemas operacionais e aplicativos, fontes, layout de texto e informtica multilnge.
    293 
    294     Unicode
    295     Quando o mundo quer falar, fala Unicode.
    296 
    297     </test-case>
    298 
    299     <test-case id="IUC10-pt-PT" encodings="UTF-8 UTF-16LE UTF-16BE UTF-32BE UTF-32LE ISO-8859-1/pt">
    300     <!-- Copyright  1991-2005 Unicode, Inc. All rights reserved. -->
    301 
    302     Europa, Software e a Internet:
    303     Globalizao com o Unicode
    304     IUC10
    305     Inscreva-se agora para a Dcima Conferncia Internacional Sobre O Unicode, a ser realizada entre os dias 10 e 12
    306     de Maro de 1997 em Mainz na Alemanha. A Conferncia reunir peritos de todas as reas da indstria
    307     especializados em assuntos relacionados com a Internet global e o Unicode, internacionalizao e localizao de
    308     software, implementao do Unicode em sistemas operativos e aplicaes, tipos de letra, esquematizao de
    309     texto e informtica multilngue.
    310 
    311     Unicode
    312     Quando o mundo quer falar, fala Unicode.
    313 
    314     </test-case>
    315 
    316     <test-case id="IUC10-ro" encodings="UTF-8 UTF-16LE UTF-16BE UTF-32BE UTF-32LE ISO-8859-2/ro">
    317     <!-- Copyright  1991-2005 Unicode, Inc. All rights reserved. -->
    318 
    319     Europa, Software i Internet:
    320     Globalizarea cu Unicode
    321     IUC10
    322     Inscriei-v acum la a Zecea Conferin Internaional "Unicode" ce va avea loc in
    323     perioada de 10-12 martie, 1997 n Mainz, Germania. Conferina va ntruni experi din
    324     variate domenii: Internet global i Unicode, internaionalizare i localizare,
    325     implementarede Unicode n sisteme de operare i aplicaii, fonturi, aranjare de text n
    326     pagin, computerizare multilingual.
    327 
    328     Unicode
    329     Cnd lumea vrea s comunice, vorbete Unicode.
    330 
    331     </test-case>
    332 
    333     <test-case id="IUC10-ru" encodings="UTF-8 UTF-16LE UTF-16BE UTF-32BE UTF-32LE ISO-8859-5/ru windows-1251/ru KOI8-R/ru">
    334     <!-- Copyright  1991-2005 Unicode, Inc. All rights reserved. -->
    335 
    336     ,   + :
    337     Unicode   
    338     IUC10
    339            Unicode,  
    340     10-12  1997     .       
    341         Unicode,   , 
    342      Unicode       ,
    343     ,     .
    344 
    345     Unicode
    346        ,    Unicode.
    347 
    348     </test-case>
    349 
    350     <test-case id="IUC10-sv" encodings="UTF-8 UTF-16LE UTF-16BE UTF-32BE UTF-32LE ISO-8859-1/sv">
    351     <!-- Copyright  1991-2005 Unicode, Inc. All rights reserved. -->
    352 
    353     Europa, programvara och Internet:
    354     globalisera med Unicode
    355     IUC10
    356     Anml Dig till den tionde internationella Unicode-konferensen, som hlls den 10-12 mars 1997 i Mainz,
    357     Tyskland. Vid konferensen kommer experter inom fljande omrden att delta: det globala Internet och Unicode,
    358     internationalisering och lokalisering, implementering av Unicode i operativsystem, tillmpningar, typsnitt,
    359     textlayout och mngsprklig datoranvndning.
    360 
    361     Unicode
    362     Nr vrlden vill tala, s talar den Unicode.
    363 
    364     </test-case>
    365 
    366     <test-case id="IUC10-yi" encodings="UTF-8 UTF-16LE UTF-16BE UTF-32BE UTF-32LE">
    367     <!-- Copyright  1991-2005 Unicode, Inc. All rights reserved. -->
    368 
    369     :    :
    370         
    371     IUC10
    372            -,    
    373     10  12 , 1997,  , .       ,
    374      ,     ,    - 
    375     , , -,   .
    376 
    377     Unicode
    378         ,   
    379 
    380     </test-case>
    381 
    382     <test-case id="IUC10-zh-Hant" encodings="UTF-8 UTF-16LE UTF-16BE UTF-32BE UTF-32LE Big5/zh">
    383     <!-- Copyright  1991-2005 Unicode, Inc. All rights reserved. -->
    384 
    385     
    386     Unicode
    387     IUC10
    388     Mainz
    389     
    390     
    391 
    392     Unicode
    393     Unicode
    394 
    395     </test-case>
    396 
    397     <test-case id="IUC10-zh-Hans" encodings="UTF-8 UTF-16LE UTF-16BE UTF-32BE UTF-32LE ISO-2022-CN//noroundtrip GB18030/zh">
    398     <bytes encoding="ISO-2022-CN">\n\ \ \ \ \n\ \ \ \ \n\n\ \ \ \ \x1b$)A\x0eE7V^#,Hm\x3c~#+;%A*Mx\x0f\n\ \ \ \ \x1b$)A\x0eSCM3R;Bk\x0f\ (Unicode)\ \x0eW_1iJ@=g\x0f\n\ \ \ \ IUC10\n\ \ \ \ \x1b$)A\x0e=+SZ\x0f1997\x0eDj\x0f\ 3\ \x0eTB\x0f10\x0eHU#-\x0f12\x0eHUTZ5B9z\x0f\ Mainz\ \x0eJP\x3eYPP5D5ZJ.=lM3R;Bk9z\x3cJQPLV;aOVTZ?*J\x3cW"2a!#\x0f\n\ \ \ \ \x1b$)A\x0e1\x3e4N;aRi=+;c\x3c/8w7=Cf5DW(\x3cR!#If\x3c05DAlSr0|@(#:9z\x3cJ;%A*Mx:MM3R;Bk#,9z\x3cJ;/:M1\x3e5X;/#,\x0f\n\ \ \ \ \x1b$)A\x0eM3R;BkTZ2YWwO5M3:MS\x26SCHm\x3c~VP5DJ5OV#,WVPM#,ND1\x3e8qJ=RT\x3c06`NDVV\x3cFKc5H!#\x0f\n\n\ \ \ \ Unicode\n\ \ \ \ \x1b$)A\x0e51J@=gPhR*95M(J1#,GkSC\x0fUnicode\x0e#!\x0f\n\nConference\ Program\n\ \ \ \ </bytes>
    399     <!-- Copyright  1991-2005 Unicode, Inc. All rights reserved. -->
    400 
    401     
    402      (Unicode) 
    403     IUC10
    404     1997 3 1012 Mainz 
    405     
    406     
    407 
    408     Unicode
    409     Unicode
    410 
    411 Conference Program
    412     </test-case>
    413 
    414     <test-case id="WIU-cz" encodings="UTF-8 UTF-16LE UTF-16BE UTF-32BE UTF-32LE ISO-8859-2/cs">
    415     <!-- Copyright  1991-2005 Unicode, Inc. All rights reserved. -->
    416 
    417     Co je Unicode?
    418 
    419     Unicode piazuje kadmu znaku jedinen slo,
    420     nezvisle na platform,
    421     nezvisle na programu,
    422     nezvisle na jazyce.
    423 
    424     Potae, ze sv podstaty, pracuj pouze s sly. Psmena a dal znaky ukldaj tak, e kadmu z nich
    425     piad slo. Ped vznikem Unicode existovaly stovky rozdlnch kdovacch systm pro piazovn tchto
    426     sel. dn z tchto kdovn nemohlo obsahovat dostatek znak: napklad Evropsk unie sama potebuje
    427     nkolik rznch kdovn, aby pokryla vechny sv jazyky. Dokonce i pro jeden jedin jazyk, jako je anglitina,
    428     nevyhovovalo dn kdovn pro vechny psmena, interpunkci a bn pouvan technick symboly.
    429 
    430     Tyto kdovac systmy tak byly v konfliktu jeden s druhm. To znamen, e dv kdovn mohou pouvat
    431     stejn slo pro dva rzn znaky, nebo pouvat rzn sla pro stejn znak. Jakkoli pota (zvlt servery)
    432     mus podporovat mnoho rznch kdovn; pesto, kdykoli jsou data pedvna mezi rznmi kdovnmi nebo
    433     platformami, hroz, e tato data budou pokozena.
    434 
    435     </test-case>
    436 
    437     <test-case id="WIU-el" encodings="UTF-8 UTF-16LE UTF-16BE UTF-32BE UTF-32LE ISO-8859-7/el">
    438     <!-- Copyright  1991-2005 Unicode, Inc. All rights reserved. -->
    439 
    440        Unicode;
    441 
    442       Unicode        ,
    443         ,
    444        ,
    445        .
    446 
    447       ,   ,   .   
    448              (   
    449     ).     Unicode,    . 
    450       ,       :  ,
    451                  
    452     - .       ,  ..  ,   
    453           ,       .
    454 
    455      ,      . ,     
    456            ,      
    457       .   (    )     
    458               
    459      ,      .
    460 
    461     </test-case>
    462 
    463     <test-case id="WIU-el-Q" encodings="UTF-8 UTF-16LE UTF-16BE UTF-32BE UTF-32LE windows-1253/el">
    464     <!-- Copyright  1991-2005 Unicode, Inc. All rights reserved. -->
    465 
    466        Unicode;
    467 
    468       Unicode        ,
    469         ,
    470        ,
    471        .
    472 
    473       ,   ,   .   
    474              (   
    475     ).     Unicode,    . 
    476       ,       :  ,
    477                  
    478     - .       ,  ..  ,   
    479           ,       .
    480 
    481      ,      . ,     
    482            ,      
    483       .   (    )     
    484               
    485      ,      .
    486 
    487     </test-case>
    488 
    489     <test-case id="WIU-pl" encodings="UTF-8 UTF-16LE UTF-16BE UTF-32BE UTF-32LE ISO-8859-2/pl">
    490     <!-- Copyright  1991-2005 Unicode, Inc. All rights reserved. -->
    491 
    492     Czym jest Unikod ?
    493 
    494     Unikod przypisuje unikalny numer kademu znakowi, niezaleny od uywanej platformy, programu czy jzyka.
    495 
    496     Zasadniczo, komputery rozumiej tylko liczby. Zapisuj litery i inne znaki przypisujc kademu z nich liczb.
    497     Nim powsta Unikod, byo wiele rnych systemw kodowania przypisujcych te liczby. Brakowao jednego,
    498     ktry mgby pomieci wystarczajco du liczb znakw. Przykadowo, sama Unia Europejska potrzebowaa
    499     kilku rnych kodowa, by mc uywa wszystkich uywanych w niej jzykw. Nawet dla pojedynczego jzyka
    500     takiego jak np. angielski brakowao jednego kodowania, ktre byoby odpowiednie dla zaprezentowania
    501     wszystkich liter, znakw przestankowych i popularnych symboli technicznych.
    502 
    503     Innym problemem byo, e kodowania te kolidoway ze sob. Dwa, rne kodowania uyway jednej liczby dla dwu
    504     rnych znakw lub rnych liczb dla tego samego znaku. Wszystkie komputery (midzy innymi serwery) musz
    505     wspiera wszystkie te kodowania, gdy dane przesyane midzy rnymi systemami operacyjnymi zawsze
    506     naraone s na uszkodzenie.
    507 
    508     </test-case>
    509 
    510     <test-case id="WIU-tr" encodings="UTF-8 UTF-16LE UTF-16BE UTF-32BE UTF-32LE ISO-8859-9/tr">
    511     <!-- Copyright  1991-2005 Unicode, Inc. All rights reserved. -->
    512 
    513     Evrensel Kod Nedir?
    514 
    515     Evrensel Kod her yaz karakteri iin bir ve yalnz bir say art koar,
    516     hangi altyap,
    517     hangi yazlm,
    518     hangi dil olursa olsun.
    519 
    520     lke olarak, bilgisayarlar sadece saylarla ilem yaparlar. Kelimelerin ve yaz karakterlerinin her biri iin
    521     birer say atarlar ve byle saklarlar. Evrensel Kod kefedilmeden nce, bu saylar atamak iin birok ifreleme
    522     yntemi vard. Ancak, tm bu dilleri gsterebilecek, rnein; Avrupa Topluluu bnyesindeki tm lkelerin dillerini
    523     kapsayacak bir tek ifreleme yntemi yoktu. Bunun yansra, sadece ngilizcedeki harfleri, noktalama
    524     iaretlerini ve teknik sembolleri kapsayan tek bir ifreleme yntemi de bulunmamaktayd.
    525 
    526     Bu ifreleme yntemleri kendi aralarnda elimektedir. ki farkl ifreleme, ayn sayy iki farkl karaktere
    527     vermi olabilir ya da farkl saylar ayn karekteri kodlayabilir. Bilgisayarlar, zellikle sunucular, birok
    528     ifrelemeyi desteklemek zorundadrlar; veriler, farkl ifreleme ve altyaplardan geerken bozulma riski tarlar.
    529 
    530     </test-case>
    531 
    532     <test-case id="WIU-tr-Q" encodings="UTF-8 UTF-16LE UTF-16BE UTF-32BE UTF-32LE windows-1254/tr">
    533     <!-- Copyright  1991-2005 Unicode, Inc. All rights reserved. -->
    534 
    535     Evrensel Kod Nedir?
    536 
    537     Evrensel Kod her yaz karakteri iin bir ve yalnz bir say art koar,
    538     hangi altyap,
    539     hangi yazlm,
    540     hangi dil olursa olsun.
    541 
    542     lke olarak, bilgisayarlar sadece saylarla ilem yaparlar. Kelimelerin ve yaz karakterlerinin her biri iin
    543     birer say atarlar ve byle saklarlar. Evrensel Kod kefedilmeden nce, bu saylar atamak iin birok ifreleme
    544     yntemi vard. Ancak, tm bu dilleri gsterebilecek, rnein; Avrupa Topluluu bnyesindeki tm lkelerin dillerini
    545     kapsayacak bir tek ifreleme yntemi yoktu. Bunun yansra, sadece ngilizcedeki harfleri, noktalama
    546     iaretlerini ve teknik sembolleri kapsayan tek bir ifreleme yntemi de bulunmamaktayd.
    547 
    548     Bu ifreleme yntemleri kendi aralarnda elimektedir. ki farkl ifreleme, ayn sayy iki farkl karaktere
    549     vermi olabilir ya da farkl saylar ayn karekteri kodlayabilir. Bilgisayarlar, zellikle sunucular, birok
    550     ifrelemeyi desteklemek zorundadrlar; veriler, farkl ifreleme ve altyaplardan geerken bozulma riski tarlar.
    551 
    552     </test-case>
    553     
    554     
    555     <test-case id="bug-10532-utf-16" encodings="UTF-8 UTF-16BE UTF-16LE UTF-32BE UTF-32LE">
    556     foo 
    557     </test-case>
    558     
    559     <test-case id="bug-10532-ASCII" encodings="UTF-8 UTF-16BE UTF-16LE UTF-32BE UTF-32LE">
    560     <!--  Note that plain 7 bit ASCII is detected as UTF-8 -->
    561     ,1,,,5
    562     </test-case>
    563 </charset-detection-tests>
    564