Ascert Support Website - Knowledgebase

How does the Knowledgebase search function work?

The search function performs a free text search of the Knowledgebase. It searches the current displayed category and all its subcategories.

The default search algorithm regards any sequence of true word characters (letters, digits, and underscores) as a word. That sequence may also contain apostrophes ('), but not more than one in a row. This means that aaa'bbb is regarded as one word, but aaa''bbb is regarded as two words. Apostrophes at the beginning or the end of a word are stripped by the FULLTEXT parser; 'aaa'bbb' would be parsed as aaa'bbb.

The parser determines where words start and end by looking for certain delimiter characters; for example, " " (space), "," (comma), and "." (period).

Some words are ignored in full-text searches:

Any word that is too short is ignored. The minimum length of words that are found is three characters.
Words in the stopword list are ignored. A stopword is a word such as "the" or "some" that is so common that it is considered to have zero semantic value.

Every correct word in the collection and in the query is weighted according to its significance in the collection or query. Consequently, a word that is present in many documents has a lower weight (and may even have a zero weight), because it has lower semantic value in this particular collection. Conversely, if the word is rare, it receives a higher weight. The weights of the words are combined to compute the relevance of the row.

Words that are present in at least 50% of the rows are effectively treated as stopwords. This is because a word that matches half of the rows in a table is less likely to locate relevant documents. In fact, it most likely finds plenty of irrelevant documents.

User Comments

None found

Article Information

Created:

27-Feb-2006

Last Updated:

27-Feb-2006

•	Why am I locked out of the website?
•	What if I can't find an answer in the Knowledgebase?
•	Do you have any RSS feeds on the website?

All times are GMT -8. The time now is 08:07 pm.

ascert Support

Support