Home | History | Annotate | Download | only in python2.7

Lines Matching full:unicode

8 Unicode implementation based on original code by Fredrik Lundh,
10 Unicode Integration Proposal (see file Misc/unicode.txt).
18 * Yet another Unicode string type for Python. This type supports the
30 * This Unicode String Type is
61 /* --- Internal Unicode Format -------------------------------------------- */
78 /* Setting Py_UNICODE_WIDE enables UCS-4 storage. Otherwise, Unicode
133 /* Py_UNICODE is the native Unicode storage format (code unit) used by
134 Python and represents a single Unicode element in the Unicode
141 /* Unicode API names are mangled to assure that UCS-2 and UCS-4 builds
144 Unicode width assumptions are combined. */
323 /* --- Internal Unicode Operations ---------------------------------------- */
413 /* --- Unicode Type ------------------------------------------------------- */
417 Py_ssize_t length; /* Length of raw Unicode data in buffer */
418 Py_UNICODE *str; /* Raw Unicode buffer */
443 /* This Unicode character will be used as replacement character during
445 Unicode character U+FFFD is the official REPLACEMENT CHARACTER in
446 Unicode 3.0. */
454 /* Create a Unicode Object from the Py_UNICODE buffer u of the given
459 that modifying the Unicode object contents after construction is
465 const Py_UNICODE *u, /* Unicode buffer */
481 /* Return a read-only pointer to the Unicode object's internal
485 PyObject *unicode /* Unicode object */
488 /* Get the length of the Unicode object. */
491 PyObject *unicode /* Unicode object */
494 /* Get the maximum ordinal for a Unicode character. */
497 /* Resize an already allocated Unicode object to the new size length.
499 *unicode is modified to point to the new (resized) object and 0
503 Unicode constructor. The refcount on the object must be 1. Otherwise,
507 is returned and *unicode left untouched.
512 PyObject **unicode, /* Pointer to the Unicode object */
516 /* Coerce obj to an Unicode object and return a reference with
525 2. All other objects (including Unicode objects) raise an
539 /* Coerce obj to an Unicode object and return a reference with
542 Unicode objects are passed back as-is (subclasses are converted to
543 true Unicode objects), all other objects are delegated to
569 /* Create a Unicode Object from the whcar_t buffer w of the given
579 /* Copies the Unicode Object contents into the wchar_t buffer w. At
592 PyUnicodeObject *unicode, /* Unicode object */
599 /* --- Unicode ordinals --------------------------------------------------- */
601 /* Create a Unicode Object from the given Unicode code point ordinal.
613 /* Clear the free list used by the Unicode implementation.
626 of the builtin unicode() API.
643 Unicode object.
645 The resulting string is cached in the Unicode object for subsequent
648 the Unicode object itself.
682 /* Create a Unicode object by decoding the encoded string s of the
696 const Py_UNICODE *s, /* Unicode char buffer */
702 /* Encodes a Unicode object and returns the result as Python
706 PyObject *unicode, /* Unicode object */
711 /* Encodes a Unicode object and returns the result as Python string
715 PyObject *unicode, /* Unicode object */
741 const Py_UNICODE *data, /* Unicode char buffer */
764 PyObject *unicode /* Unicode object */
768 const Py_UNICODE *data, /* Unicode char buffer */
776 the corresponding Unicode object.
821 PyObject *unicode /* Unicode object */
825 the Unicode data.
835 Unicode BOM mark (U+FEFF). In the other two modes, no BOM mark is
841 Unicode char buffer */
850 the corresponding Unicode object.
895 PyObject *unicode /* Unicode object */
899 the Unicode data.
909 Unicode BOM mark (U+FEFF). In the other two modes, no BOM mark is
919 const Py_UNICODE *data, /* Unicode char buffer */
925 /* --- Unicode-Escape Codecs ---------------------------------------------- */
928 const char *string, /* Unicode-Escape encoded string */
934 PyObject *unicode /* Unicode object */
938 const Py_UNICODE *data, /* Unicode char buffer */
942 /* --- Raw-Unicode-Escape Codecs ------------------------------------------ */
945 const char *string, /* Raw-Unicode-Escape encoded string */
951 PyObject *unicode /* Unicode object */
955 const Py_UNICODE *data, /* Unicode char buffer */
959 /* --- Unicode Internal Codec ---------------------------------------------
971 Note: Latin-1 corresponds to the first 256 Unicode ordinals.
982 PyObject *unicode /* Unicode object */
986 const Py_UNICODE *data, /* Unicode char buffer */
1004 PyObject *unicode /* Unicode object */
1008 const Py_UNICODE *data, /* Unicode char buffer */
1018 Unicode characters, integers (which are then interpreted as Unicode
1022 Encoding mappings must map single Unicode characters to single
1029 Unicode or Latin-1 ordinal resp. Because of this mappings only need
1039 (char ordinal -> unicode ordinal) */
1044 PyObject *unicode, /* Unicode object */
1046 (unicode ordinal -> char ordinal) */
1050 const Py_UNICODE *data, /* Unicode char buffer */
1053 (unicode ordinal -> char ordinal) */
1058 character mapping table to it and return the resulting Unicode
1061 The mapping table must map Unicode ordinal integers to Unicode
1071 const Py_UNICODE *data, /* Unicode char buffer */
1095 PyObject *unicode /* Unicode object */
1099 const Py_UNICODE *data, /* Unicode char buffer */
1108 /* Takes a Unicode string holding a decimal value and writes it into
1116 \0 as-is. Characters outside this range (Unicode ordinals 1-256)
1131 Py_UNICODE *s, /* Unicode buffer */
1139 These are capable of handling Unicode objects and strings on input
1141 Unicode objects or integers as apporpriate. */
1143 /* Concat two strings giving a new Unicode string. */
1150 /* Split a string giving a list of Unicode strings.
1192 /* Split a string giving a list of Unicode strings.
1212 return the resulting Unicode object.
1214 The mapping table must map Unicode ordinal integers to Unicode
1230 the resulting Unicode string. */
1270 and return the resulting Unicode object. */
1295 case the conversion of the arguments to Unicode
1311 the resulting Unicode string. */
1321 element has to coerce to an one element Unicode string. -1 is
1329 /* Externally visible for str.strip(unicode) */
1350 Py_UNICODE ch /* Unicode character */
1354 Py_UNICODE ch /* Unicode character */
1358 Py_UNICODE ch /* Unicode character */
1362 const Py_UNICODE ch /* Unicode character */
1366 const Py_UNICODE ch /* Unicode character */
1370 Py_UNICODE ch /* Unicode character */
1374 Py_UNICODE ch /* Unicode character */
1378 Py_UNICODE ch /* Unicode character */
1382 Py_UNICODE ch /* Unicode character */
1386 Py_UNICODE ch /* Unicode character */
1390 Py_UNICODE ch /* Unicode character */
1394 Py_UNICODE ch /* Unicode character */
1398 Py_UNICODE ch /* Unicode character */
1402 Py_UNICODE ch /* Unicode character */
1406 Py_UNICODE ch /* Unicode character */