Home | History | Annotate | Download | only in unicode

Lines Matching full:code

41  * upper/lower-casing and string comparison in code point order.
43 * ICU uses 16-bit Unicode (UTF-16) in the form of arrays of UChar code units.
44 * UTF-16 encodes each Unicode code point with either one or two UChar code units.
49 * Some APIs accept a 32-bit UChar32 value for a single code point.
53 * Code-point-related functions treat unpaired surrogates as surrogate code points,
57 * it is much more efficient even for random access because the code unit values
59 * This means that it is easy to determine character (code point) boundaries from
64 * (which use pairs of lead/trail code units called "surrogates"),
66 * a single UChar code unit (i.e., their code point values are <=0xffff).
82 * @return The number of UChars in <code>chars</code>, minus the terminator.
90 * Count Unicode code points in the length UChar code units of the string.
91 * A code point may occupy either one or two UChar code units.
92 * Counting code points involves reading all code units.
97 * @param length The number of UChar code units to be checked, or -1 to count all
98 * code points before the first NUL (U+0000).
99 * @return The number of code points in the specified code units.
106 * Check if the string contains more Unicode code points than a certain number.
107 * This is more efficient than counting all code points in the entire string
111 * never needs to count more than 'number+1' code points.
113 * A Unicode code point may occupy either one or two UChar code units.
117 * @param number The number of code points in the string is compared against
119 * @return Boolean value for whether the string contains more Unicode code points
127 * Concatenate two ustrings. Appends a copy of <code>src</code>,
128 * including the null terminator, to <code>dst</code>. The initial copied
129 * character from <code>src</code> overwrites the null terminator in <code>dst</code>.
133 * @return A pointer to <code>dst</code>.
142 * Appends at most <code>n</code> characters from <code>src</code> to <code>dst</code>.
144 * If src is too long, then only <code>n-1</code> characters will be copied
146 * If <code>n&lt;=0</code> then dst is not modified.
151 * @return A pointer to <code>dst</code>.
161 * The substring is found at code point boundaries.
170 * @return A pointer to the first occurrence of <code>substring</code> in <code>s</code>,
171 * or <code>s</code> itself if the <code>substring</code> is empty,
172 * or <code>NULL</code> if <code>substring</code> is not in <code>s</code>.
184 * The substring is found at code point boundaries.
195 * @return A pointer to the first occurrence of <code>substring</code> in <code>s</code>,
196 * or <code>s</code> itself if the <code>substring</code> is empty,
197 * or <code>NULL</code> if <code>substring</code> is not in <code>s</code>.
207 * Find the first occurrence of a BMP code point in a string.
208 * A surrogate code point is found only if its match in the text is not
213 * @param c The BMP code point to find.
214 * @return A pointer to the first occurrence of <code>c</code> in <code>s</code>
215 * or <code>NULL</code> if <code>c</code> is not in <code>s</code>.
227 * Find the first occurrence of a code point in a string.
228 * A surrogate code point is found only if its match in the text is not
233 * @param c The code point to find.
234 * @return A pointer to the first occurrence of <code>c</code> in <code>s</code>
235 * or <code>NULL</code> if <code>c</code> is not in <code>s</code>.
248 * The substring is found at code point boundaries.
257 * @return A pointer to the last occurrence of <code>substring</code> in <code>s</code>,
258 * or <code>s</code> itself if the <code>substring</code> is empty,
259 * or <code>NULL</code> if <code>substring</code> is not in <code>s</code>.
271 * The substring is found at code point boundaries.
282 * @return A pointer to the last occurrence of <code>substring</code> in <code>s</code>,
283 * or <code>s</code> itself if the <code>substring</code> is empty,
284 * or <code>NULL</code> if <code>substring</code> is not in <code>s</code>.
294 * Find the last occurrence of a BMP code point in a string.
295 * A surrogate code point is found only if its match in the text is not
300 * @param c The BMP code point to find.
301 * @return A pointer to the last occurrence of <code>c</code> in <code>s</code>
302 * or <code>NULL</code> if <code>c</code> is not in <code>s</code>.
314 * Find the last occurrence of a code point in a string.
315 * A surrogate code point is found only if its match in the text is not
320 * @param c The code point to find.
321 * @return A pointer to the last occurrence of <code>c</code> in <code>s</code>
322 * or <code>NULL</code> if <code>c</code> is not in <code>s</code>.
334 * Locates the first occurrence in the string <code>string</code> of any of the characters
335 * in the string <code>matchSet</code>.
339 * @param matchSet A NUL-terminated string defining a set of code points
341 * @return A pointer to the character in <code>string</code> that matches one of the
342 * characters in <code>matchSet</code>, or NULL if no such character is found.
349 * Returns the number of consecutive characters in <code>string</code>,
350 * beginning with the first, that do not occur somewhere in <code>matchSet</code>.
354 * @param matchSet A NUL-terminated string defining a set of code points
356 * @return The number of initial characters in <code>string</code> that do not
357 * occur in <code>matchSet</code>.
365 * Returns the number of consecutive characters in <code>string</code>,
366 * beginning with the first, that occur somewhere in <code>matchSet</code>.
370 * @param matchSet A NUL-terminated string defining a set of code points
372 * @return The number of initial characters in <code>string</code> that do
373 * occur in <code>matchSet</code>.
395 * @param delim Set of delimiter characters (Unicode code points).
411 * Compare two Unicode strings for bitwise equality (code unit order).
415 * @return 0 if <code>s1</code> and <code>s2</code> are bitwise equal; a negative
416 * value if <code>s1</codecode>s2,</code>; a positive
417 * value if <code>s1</code> is bitwise greater than <code>s2</code>.
425 * Compare two Unicode strings in code point order.
432 * in code point order
441 * The comparison can be done in code unit order or in code point order.
443 * comparing supplementary code points (U+10000..U+10ffff)
444 * to BMP code points near the end of the BMP (i.e., U+e000..U+ffff).
445 * In code unit order, high BMP code points sort after supplementary code points
458 * @param codePointOrder Choose between code unit order (FALSE)
459 * and code point order (TRUE).
481 * @param codePointOrder Choose between code unit order (FALSE)
482 * and code point order (TRUE).
497 * Compare strings in code point order instead of code unit order.
510 * The comparison can be done in UTF-16 code unit order or in code point order.
511 * They differ only when comparing supplementary code points (U+10000..U+10ffff)
512 * to BMP code points near the end of the BMP (i.e., U+e000..U+ffff).
513 * In code unit order, high BMP code points sort after supplementary code points
528 * Comparison in code unit order with default case folding.
531 * Set to choose code point order instead of code unit order
536 * @param pErrorCode Must be a valid pointer to an error code value,
551 * Compares at most <code>n</code> characters.
556 * @return 0 if <code>s1</code> and <code>s2</code> are bitwise equal; a negative
557 * value if <code>s1</code> is bitwise less than <code>s2</code>; a positive
558 * value if <code>s1</code> is bitwise greater than <code>s2</code>.
567 * Compare two Unicode strings in code point order.
576 * in code point order
590 * Comparison in code unit order with default case folding.
593 * Set to choose code point order instead of code unit order
614 * Comparison in code unit order with default case folding.
617 * Set to choose code point order instead of code unit order
638 * Comparison in code unit order with default case folding.
641 * Set to choose code point order instead of code unit order
657 * @return A pointer to <code>dst</code>.
666 * Copies at most <code>n</code> characters. The result will be null terminated
667 * if the length of <code>src</code> is less than <code>n</code>.
672 * @return A pointer to <code>dst</code>.
689 * @return A pointer to <code>dst</code>.
697 * Copies at most <code>n</code> characters. The result will be null terminated
698 * if the length of <code>src</code> is less than <code>n</code>.
704 * @return A pointer to <code>dst</code>.
718 * @return A pointer to <code>dst</code>.
726 * Copies at most <code>n</code> characters. The result will be null terminated
727 * if the length of <code>src</code> is less than <code>n</code>.
733 * @return A pointer to <code>dst</code>.
747 * @return A pointer to <code>dest</code>
758 * @return A pointer to <code>dest</code>
765 * Initialize <code>count</code> characters of <code>dest</code> to <code>c</code>.
770 * @return A pointer to <code>dest</code>.
777 * Compare the first <code>count</code> UChars of each buffer.
791 * Compare two Unicode strings in code point order.
800 * in code point order
807 * Find the first occurrence of a BMP code point in a string.
808 * A surrogate code point is found only if its match in the text is not
812 * @param s The string to search (contains <code>count</code> UChars).
813 * @param c The BMP code point to find.
815 * @return A pointer to the first occurrence of <code>c</code> in <code>s</code>
816 * or <code>NULL</code> if <code>c</code> is not in <code>s</code>.
827 * Find the first occurrence of a code point in a string.
828 * A surrogate code point is found only if its match in the text is not
832 * @param s The string to search (contains <code>count</code> UChars).
833 * @param c The code point to find.
835 * @return A pointer to the first occurrence of <code>c</code> in <code>s</code>
836 * or <code>NULL</code> if <code>c</code> is not in <code>s</code>.
847 * Find the last occurrence of a BMP code point in a string.
848 * A surrogate code point is found only if its match in the text is not
852 * @param s The string to search (contains <code>count</code> UChars).
853 * @param c The BMP code point to find.
855 * @return A pointer to the last occurrence of <code>c</code> in <code>s</code>
856 * or <code>NULL</code> if <code>c</code> is not in <code>s</code>.
867 * Find the last occurrence of a code point in a string.
868 * A surrogate code point is found only if its match in the text is not
872 * @param s The string to search (contains <code>count</code> UChars).
873 * @param c The code point to find.
875 * @return A pointer to the last occurrence of <code>c</code> in <code>s</code>
876 * or <code>NULL</code> if <code>c</code> is not in <code>s</code>.
901 * <code>NUL</code>, must be specified as a constant.
1069 * @param pErrorCode Must be a valid pointer to an error code value,
1095 * @param pErrorCode Must be a valid pointer to an error code value,
1141 * @param pErrorCode Must be a valid pointer to an error code value,
1172 * @param pErrorCode Must be a valid pointer to an error code value,
1202 * @param pErrorCode Must be a valid pointer to an error code value,
1231 * @param pErrorCode Must be a valid pointer to an error code value,
1247 code is set.
1260 * @param pErrorCode Must be a valid pointer to an error code value,
1277 * If the input string is not well-formed, then the U_INVALID_CHAR_FOUND error code is set.
1290 * @param pErrorCode Must be a valid pointer to an error code value,
1307 * If the input string is not well-formed, then the U_INVALID_CHAR_FOUND error code is set.
1310 * illegal input sequences, instead of stopping with the U_INVALID_CHAR_FOUND error code.
1326 * A substitution character can be any valid Unicode code point (up to U+10FFFF)
1327 * except for surrogate code points (U+D800..U+DFFF).
1332 * @param pErrorCode Pointer to a standard ICU error code. Its input value must
1352 * If the input string is not well-formed, then the U_INVALID_CHAR_FOUND error code is set.
1355 * illegal input sequences, instead of stopping with the U_INVALID_CHAR_FOUND error code.
1371 * A substitution character can be any valid Unicode code point (up to U+10FFFF)
1372 * except for surrogate code points (U+D800..U+DFFF).
1377 * @param pErrorCode Pointer to a standard ICU error code. Its input value must
1410 * The function will resynchronize to valid code point boundaries
1411 * within a small number of code points after an illegal sequence.
1437 * @param pErrorCode Pointer to a standard ICU error code. Its input value must
1457 * If the input string is not well-formed, then the U_INVALID_CHAR_FOUND error code is set.
1470 * @param pErrorCode Must be a valid pointer to an error code value,
1487 * If the input string is not well-formed, then the U_INVALID_CHAR_FOUND error code is set.
1500 * @param pErrorCode Must be a valid pointer to an error code value,
1517 * If the input string is not well-formed, then the U_INVALID_CHAR_FOUND error code is set.
1520 * illegal input sequences, instead of stopping with the U_INVALID_CHAR_FOUND error code.
1536 * A substitution character can be any valid Unicode code
1537 * except for surrogate code points (U+D800..U+DFFF).
1542 * @param pErrorCode Pointer to a standard ICU error code. Its input value must
1562 * If the input string is not well-formed, then the U_INVALID_CHAR_FOUND error code is set.
1565 * illegal input sequences, instead of stopping with the U_INVALID_CHAR_FOUND error code.
1581 * A substitution character can be any valid Unicode code point (up to U+10FFFF)
1582 * except for surrogate code points (U+D800..U+DFFF).
1587 * @param pErrorCode Pointer to a standard ICU error code. Its input value must