Onix Text Retrieval Toolkit
API Reference

API
Function List
Topical List

Main Index

Introduction
Calling Sequences
Query Processing
Relevancy Ranking
Data Types
Error Handling
   
Support
   
Lextek Products
Onix Text Retrieval Engine
Lextek Document Profiler & Categorizer
Brevity Document Summarizer
RouteX Routing Engine
Language Identifier

ixAddStopWord

Name

ixAddStopWord

Synopsis

void ixAddStopWord(OnixStopWordListT StopWordList, UCharT *Word, size_t WordLength, StatusCodeT *Status)

Arguments

StopWordList: A stop word list which was created by a call to ixCreateStopWordList

Word:A pointer to the word's data

WordLength: The length of the word in bytes.

Status: A pointer to a value of type StatusCodeT.  If an error occurs, it will be reported in *Status.

Returns

Nothing.

If an error occurred, Status will be set to the error number.

Description

ixAddStopWord adds a new stop word to the list of stop words in the stop word list. You check whether a word is a stop word with a call to ixIsStopWord. Stop words are words like "a", "the" or other frequently occurring words that typically add little or no information to a sentence. By eliminating stop words you can reduce your index size by as much as 30%.

Generally before calling ixIndexWord you will call ixIsStopWord to see if you should index the word. Exactly how you utilize the information is up to you. Be aware that if you aren't careful how you eliminate stop words you can affect phrase searching. i.e. if you don't index "the" then searching for "the first time" as a phrase won't match. Typically you will thus have to eliminate stop words from both the index and any queries.

See Also

Dealing with Stop Words, Stop Word List 1, Stop Word List 2
ixCreateStopWordList, ixDeleteStopWordList, ixAddStopWordList, ixIsStopWord, ixNumStopWords