Table 9 Supported character sets

Supported character

Description

set

 

 

 

ISO-8859-1

Western European, extended ASCII

 

 

WINDOWS-1252

(Code pages supported by Windows) Latin 1

 

 

US-ASCII

7-bit American Standard Code for Information Interchange

 

 

UTF-8

Universal (all languages)

 

 

ISO-8859-2

Eastern European

 

 

KOI8-R

Cyrillic (Russian and Bulgarian)

 

 

ISO-8859-5

Cyrillic (Bulgarian, Belarusian, Russian)

 

 

WINDOWS-1254

(Code pages supported by Windows) Turkish

 

 

GB18030

Chinese (Mainland)

 

 

BIG5

Chinese (Taiwan)

 

 

GB2312

Chinese (Mainland)

 

 

EUC-KR

Korean

 

 

KS_C-5601-1987

Korean

 

 

ISO-2022-JP

Japanese

 

 

EUC-JP

Japanese

 

 

SHIFT-JIS

Japanese

 

 

Matching words

Matching words is not case-sensitive: cat, Cat, cAt, and CAT all match. Corresponding uppercase and lowercase letters, such as A and a, are treated the same in all respects.

There are two kinds of query words: words that contain occurrences of one or both of the wildcard characters * and ?, and literal words that do not contain wildcards.

Literal words that do not contain wildcards

Words containing occurrences of one or both wildcard characters * and ?

A literal word in a query expression matches the same word, character for character (case ignored), in an archived document. A word with wildcard characters (* or ?) matches a document word in the same way, character by character, except for the following:

A ? matches any single character in a document word. For example, b??t matches beat, beet, boat, blot, best, bust, bout, and so on.

An * matches any sequence of characters in a document word, including a sequence of no characters. For example, f*t matches the document words foot, feet, fit, fault, and ft; and f* matches any document word beginning with f.

You can use any number of wildcard characters (* or ?) in a query word, but you cannot use a wildcard at the beginning of a query word. An error message results. For example, *ion is not a valid query.

Matching similar words

Topics include:

Fuzzy words, page 36

Measuring word similarity, page 36

User Guide

35