UTF-8 String Functions
The UTF-8 string functions handle string operations on UTF-8 encoded strings. More...Defines | |
| #define | rtxUTF8StrToInt32 rtxUTF8StrToInt |
| #define | rtxUTF8StrToUInt32 rtxUTF8StrToUInt |
| #define | RTUTF8STRCMPL(name, lstr) rtxUTF8Strcmp(name,(const OSUTF8CHAR*)lstr) |
| Compare UTF-8 string to a string literal. | |
Functions | |
| EXTERNRT long | rtxUTF8ToUnicode (OSCTXT *pctxt, const OSUTF8CHAR *inbuf, OSUNICHAR *outbuf, size_t outbufsiz) |
| This function converts a UTF-8 string to a Unicode string (UTF-16). | |
| EXTERNRT int | rtxValidateUTF8 (OSCTXT *pctxt, const OSUTF8CHAR *inbuf) |
| This function will validate a UTF-8 encoded string to ensure that it is encoded correctly. | |
| EXTERNRT size_t | rtxUTF8Len (const OSUTF8CHAR *inbuf) |
| This function will return the length (in characters) of a null-terminated UTF-8 encoded string. | |
| EXTERNRT size_t | rtxUTF8LenBytes (const OSUTF8CHAR *inbuf) |
| This function will return the length (in bytes) of a null-terminated UTF-8 encoded string. | |
| EXTERNRT int | rtxUTF8CharSize (OS32BITCHAR wc) |
| This function will return the number of bytes needed to encode the given 32-bit universal character value as a UTF-8 character. | |
| EXTERNRT int | rtxUTF8EncodeChar (OS32BITCHAR wc, OSOCTET *buf, size_t bufsiz) |
| This function will convert a wide character into an encoded UTF-8 character byte string. | |
| EXTERNRT int | rtxUTF8DecodeChar (OSCTXT *pctxt, const OSUTF8CHAR *pinbuf, int *pInsize) |
| This function will convert an encoded UTF-8 character byte string into a wide character value. | |
| EXTERNRT OS32BITCHAR | rtxUTF8CharToWC (const OSUTF8CHAR *buf, OSUINT32 *len) |
| Thia function will convert a UTF-8 encoded character value into a wide character. | |
| EXTERNRT OSUTF8CHAR * | rtxUTF8StrChr (OSUTF8CHAR *utf8str, OS32BITCHAR utf8char) |
| This function finds a character in the given UTF-8 character string. | |
| EXTERNRT OSUTF8CHAR * | rtxUTF8Strdup (OSCTXT *pctxt, const OSUTF8CHAR *utf8str) |
| This function creates a duplicate copy of the given UTF-8 character string. | |
| EXTERNRT OSUTF8CHAR * | rtxUTF8Strndup (OSCTXT *pctxt, const OSUTF8CHAR *utf8str, size_t nbytes) |
| This function creates a duplicate copy of the given UTF-8 character string. | |
| EXTERNRT OSUTF8CHAR * | rtxUTF8StrRefOrDup (OSCTXT *pctxt, const OSUTF8CHAR *utf8str) |
| This function check to see if the given UTF8 string pointer exists on the memory heap. | |
| EXTERNRT OSBOOL | rtxUTF8StrEqual (const OSUTF8CHAR *utf8str1, const OSUTF8CHAR *utf8str2) |
| This function compares two UTF-8 string values for equality. | |
| EXTERNRT OSBOOL | rtxUTF8StrnEqual (const OSUTF8CHAR *utf8str1, const OSUTF8CHAR *utf8str2, size_t count) |
| This function compares two UTF-8 string values for equality. | |
| EXTERNRT int | rtxUTF8Strcmp (const OSUTF8CHAR *utf8str1, const OSUTF8CHAR *utf8str2) |
| This function compares two UTF-8 character strings and returns a trinary result (equal, less than, greater than). | |
| EXTERNRT int | rtxUTF8Strncmp (const OSUTF8CHAR *utf8str1, const OSUTF8CHAR *utf8str2, size_t count) |
| This function compares two UTF-8 character strings and returns a trinary result (equal, less than, greater than). | |
| EXTERNRT OSUTF8CHAR * | rtxUTF8Strcpy (OSUTF8CHAR *dest, size_t bufsiz, const OSUTF8CHAR *src) |
| This function copies a null-terminated UTF-8 string to a target buffer. | |
| EXTERNRT OSUTF8CHAR * | rtxUTF8Strncpy (OSUTF8CHAR *dest, size_t bufsiz, const OSUTF8CHAR *src, size_t nchars) |
| This function copies the given number of characters from a UTF-8 string to a target buffer. | |
| EXTERNRT OSUINT32 | rtxUTF8StrHash (const OSUTF8CHAR *str) |
| This function computes a hash code for the given string value. | |
| EXTERNRT const OSUTF8CHAR * | rtxUTF8StrJoin (OSCTXT *pctxt, const OSUTF8CHAR *str1, const OSUTF8CHAR *str2, const OSUTF8CHAR *str3, const OSUTF8CHAR *str4, const OSUTF8CHAR *str5) |
| This function concatanates up to five substrings together into a single string. | |
| EXTERNRT int | rtxUTF8StrToBool (const OSUTF8CHAR *utf8str, OSBOOL *pvalue) |
| This function converts the given null-terminated UTF-8 string to a boolean (true/false) value. | |
| EXTERNRT int | rtxUTF8StrnToBool (const OSUTF8CHAR *utf8str, size_t nbytes, OSBOOL *pvalue) |
| This function converts the given part of UTF-8 string to a boolean (true/false) value. | |
| EXTERNRT int | rtxUTF8StrToDouble (const OSUTF8CHAR *utf8str, OSREAL *pvalue) |
| This function converts the given null-terminated UTF-8 string to a floating point (C/C++ double) value. | |
| EXTERNRT int | rtxUTF8StrnToDouble (const OSUTF8CHAR *utf8str, size_t nbytes, OSREAL *pvalue) |
| This function converts the given part of UTF-8 string to a double value. | |
| EXTERNRT int | rtxUTF8StrToInt (const OSUTF8CHAR *utf8str, OSINT32 *pvalue) |
| This function converts the given null-terminated UTF-8 string to an integer value. | |
| EXTERNRT int | rtxUTF8StrnToInt (const OSUTF8CHAR *utf8str, size_t nbytes, OSINT32 *pvalue) |
| This function converts the given part of UTF-8 string to an integer value. | |
| EXTERNRT int | rtxUTF8StrToUInt (const OSUTF8CHAR *utf8str, OSUINT32 *pvalue) |
| This function converts the given null-terminated UTF-8 string to an unsigned integer value. | |
| EXTERNRT int | rtxUTF8StrnToUInt (const OSUTF8CHAR *utf8str, size_t nbytes, OSUINT32 *pvalue) |
| This function converts the given part of UTF-8 string to an unsigned integer value. | |
| EXTERNRT int | rtxUTF8StrToInt64 (const OSUTF8CHAR *utf8str, OSINT64 *pvalue) |
| This function converts the given null-terminated UTF-8 string to a 64-bit integer value. | |
| EXTERNRT int | rtxUTF8StrnToInt64 (const OSUTF8CHAR *utf8str, size_t nbytes, OSINT64 *pvalue) |
| This function converts the given part of UTF-8 string to a 64-bit integer value. | |
| EXTERNRT int | rtxUTF8StrToUInt64 (const OSUTF8CHAR *utf8str, OSUINT64 *pvalue) |
| This function converts the given null-terminated UTF-8 string to an unsigned 64-bit integer value. | |
| EXTERNRT int | rtxUTF8StrnToUInt64 (const OSUTF8CHAR *utf8str, size_t nbytes, OSUINT64 *pvalue) |
| This function converts the given part of UTF-8 string to an unsigned 64-bit integer value. | |
| EXTERNRT int | rtxUTF8ToDynUniStr (OSCTXT *pctxt, const OSUTF8CHAR *utf8str, const OSUNICHAR **ppdata, OSUINT32 *pnchars) |
| This function converts the given UTF-8 string to a Unicode string. | |
| EXTERNRT int | rtxUTF8RemoveWhiteSpace (const OSUTF8CHAR *utf8instr, size_t nbytes, const OSUTF8CHAR **putf8outstr) |
| This function removes leading and trailing whitespace from a string. | |
| EXTERNRT int | rtxUTF8StrToDynHexStr (OSCTXT *pctxt, const OSUTF8CHAR *utf8str, OSDynOctStr *pvalue) |
| This function converts the given null-terminated UTF-8 string to a octet string value. | |
| EXTERNRT int | rtxUTF8StrnToDynHexStr (OSCTXT *pctxt, const OSUTF8CHAR *utf8str, size_t nbytes, OSDynOctStr *pvalue) |
| This function converts the given part of UTF-8 string to a octet string value. | |
| EXTERNRT int | rtxUTF8StrToNamedBits (OSCTXT *pctxt, const OSUTF8CHAR *utf8str, const OSBitMapItem *pBitMap, OSOCTET *pvalue, OSUINT32 *pnbits, OSUINT32 bufsize) |
| This function converts the given null-terminated UTF-8 string to named bit items. | |
| EXTERNRT const OSUTF8CHAR * | rtxUTF8StrNextTok (OSUTF8CHAR *utf8str, OSUTF8CHAR **ppNext) |
| This function returns the next whitespace-separated token from the input string. | |
Detailed Description
The UTF-8 string functions handle string operations on UTF-8 encoded strings.This is the default character string data type used for encoded XML data. UTF-8 strings are represented in C as strings of unsigned characters (bytes) to cover the full range of possible single character encodings.
Define Documentation
|
|
Compare UTF-8 string to a string literal.
|
Function Documentation
|
|
This function will return the number of bytes needed to encode the given 32-bit universal character value as a UTF-8 character.
|
|
||||||||||||
|
Thia function will convert a UTF-8 encoded character value into a wide character.
|
|
||||||||||||||||
|
This function will convert an encoded UTF-8 character byte string into a wide character value.
|
|
||||||||||||||||
|
This function will convert a wide character into an encoded UTF-8 character byte string.
|
|
|
This function will return the length (in characters) of a null-terminated UTF-8 encoded string.
|
|
|
This function will return the length (in bytes) of a null-terminated UTF-8 encoded string.
|
|
||||||||||||||||
|
This function removes leading and trailing whitespace from a string.
|
|
||||||||||||
|
This function finds a character in the given UTF-8 character string.
It is similar to the C
|
|
||||||||||||
|
This function compares two UTF-8 character strings and returns a trinary result (equal, less than, greater than).
It is similar to the C
|
|
||||||||||||||||
|
This function copies a null-terminated UTF-8 string to a target buffer.
It is similar to the C
|
|
||||||||||||
|
This function creates a duplicate copy of the given UTF-8 character string.
It is similar to the C
|
|
||||||||||||
|
This function compares two UTF-8 string values for equality.
|
|
|
This function computes a hash code for the given string value.
|
|
||||||||||||||||||||||||||||
|
This function concatanates up to five substrings together into a single string.
|
|
||||||||||||||||
|
This function compares two UTF-8 character strings and returns a trinary result (equal, less than, greater than).
In this case, a maximum count of the number of bytes to compare can be specified. It is similar to the C
|
|
||||||||||||||||||||
|
This function copies the given number of characters from a UTF-8 string to a target buffer.
It is similar to the C
|
|
||||||||||||||||
|
This function creates a duplicate copy of the given UTF-8 character string.
It is similar to the
|
|
||||||||||||||||
|
This function compares two UTF-8 string values for equality.
It is similar to the
|
|
||||||||||||
|
This function returns the next whitespace-separated token from the input string. It also returns a pointer to the first non-whitespace chracter after the parsed token. Note that the input string is altered in the operation as null-terminators are insterted to mark the token boundaries.
|
|
||||||||||||||||
|
This function converts the given part of UTF-8 string to a boolean (true/false) value. It is assumed the string contains only the tokens 'true', 'false', '1', or '0'.
|
|
||||||||||||||||
|
This function converts the given part of UTF-8 string to a double value. It is assumed the string contains only numeric digits, whitespace, and other special floating point characters. It is similar to the C atof function except that the result is returned as a separate argument and an error status value is returned if the conversion cannot be performed successfully.
|
|
||||||||||||||||||||
|
This function converts the given part of UTF-8 string to a octet string value. The string consists of a series of hex digits. This is the dynamic version in which memory is allocated for the returned octet string variable.
|
|
||||||||||||||||
|
This function converts the given part of UTF-8 string to an integer value. It is assumed the string contains only numeric digits and whitespace. It is similar to the C atoi function except that the result is returned as a separate argument and an error status value is returned if the conversion cannot be performed successfully.
|
|
||||||||||||||||
|
This function converts the given part of UTF-8 string to a 64-bit integer value. It is assumed the string contains only numeric digits and whitespace.
|
|
||||||||||||||||
|
This function converts the given part of UTF-8 string to an unsigned integer value. It is assumed the string contains only numeric digits and whitespace.
|
|
||||||||||||||||
|
This function converts the given part of UTF-8 string to an unsigned 64-bit integer value. It is assumed the string contains only numeric digits and whitespace.
|
|
||||||||||||
|
This function check to see if the given UTF8 string pointer exists on the memory heap. If it does, its reference count is incremented; otherwise, a duplicate copy is made.
|
|
||||||||||||
|
This function converts the given null-terminated UTF-8 string to a boolean (true/false) value. It is assumed the string contains only the tokens 'true', 'false', '1', or '0'.
|
|
||||||||||||
|
This function converts the given null-terminated UTF-8 string to a floating point (C/C++ double) value. It is assumed the string contains only numeric digits, special floating point characters (+,-,E,.), and whitespace. It is similar to the C atof function except that the result is returned as a separate argument and an error status value is returned if the conversion cannot be performed successfully.
|
|
||||||||||||||||
|
This function converts the given null-terminated UTF-8 string to a octet string value. The string consists of a series of hex digits. This is the dynamic version in which memory is allocated for the returned octet string variable.
|
|
||||||||||||
|
This function converts the given null-terminated UTF-8 string to an integer value. It is assumed the string contains only numeric digits and whitespace. It is similar to the C atoi function except that the result is returned as a separate argument and an error status value is returned if the conversion cannot be performed successfully.
|
|
||||||||||||
|
This function converts the given null-terminated UTF-8 string to a 64-bit integer value. It is assumed the string contains only numeric digits and whitespace.
|
|
||||||||||||||||||||||||||||
|
This function converts the given null-terminated UTF-8 string to named bit items. The token-to-bit mappings are defined by a bit map table that is passed into the function. It is assumed the string contains a space-separated list of named bit token values.
|
|
||||||||||||
|
This function converts the given null-terminated UTF-8 string to an unsigned integer value. It is assumed the string contains only numeric digits and whitespace.
|
|
||||||||||||
|
This function converts the given null-terminated UTF-8 string to an unsigned 64-bit integer value. It is assumed the string contains only numeric digits and whitespace.
|
|
||||||||||||||||||||
|
This function converts the given UTF-8 string to a Unicode string. Memory is allocated for the Unicode string using the rtxMemAlloc function. This memory will be freed when the context is freed (rtxFreeContext) or it can be freed using rtxMemFreePtr.
|
|
||||||||||||||||||||
|
This function converts a UTF-8 string to a Unicode string (UTF-16). The Unicode string is stored as an array of 16-bit characters (unsigned short integers).
|
|
||||||||||||
|
This function will validate a UTF-8 encoded string to ensure that it is encoded correctly.
|
