Onix Text Retrieval Toolkit
API Reference

API
Function List
Topical List

Main Index

Introduction
Calling Sequences
Query Processing
Relevancy Ranking
Data Types
Error Handling
   
Support
   
Lextek Products
Onix Text Retrieval Engine
Lextek Document Profiler & Categorizer
Brevity Document Summarizer
RouteX Routing Engine
Language Identifier

ucTableNormalizeChar

Name

ucTableNormalizeChar

Synopsis

void ucTableNormalizeChar( UnicodeCharT *TableBuffer, UnicodeCharT *UnicodeChar, size_t MaxChars)

Arguments

TableBuffer: Table of characters of type UnicodeCharT which has been initialized by ucInitializeNormalizationTable.

UnicodeChar: A pointer to a buffer which contains a unicode character.

MaxChars: The maximum number of characters in the TableBuffer. (So as to prevent overruns.)

Returns

Nothing.

Description

Many European languages have characters with many different accents. ucNormalizeChar helps by normalizing these characters to their form without their accents to either their upper or lower case form. ucTableNormalizeChar does this for all the Unicode characters from the Latin (ASCII) and European Latin code pages. ucTableNormalizeChar is designed to work with Unicode and does not work with single byte or other character sets.

ucTableNormalizeChar is almost identical to ucNormalizeChar except since it is table driven, it is much faster. ucTableNormalizeChar normalizes the character to either upper case or lower case depending on the initialization of the TableBuffer by ucInitializeNormalizationTable.

See Also

Unicode
ucNormalizeChar, ixUnicodeCharToHex, ixUnicodeHexToChar