Table 9 Supported character sets

Supported character

Description

set

 

 

 

ISO-8859-1

Western European, extended ASCII

 

 

WINDOWS-1252

(Code pages supported by Windows) Latin 1

 

 

US-ASCII

7-bit American Standard Code for Information Interchange

 

 

UTF-8

Universal (all languages)

 

 

ISO-8859-2

Eastern European

 

 

KOI8-R

Cyrillic (Russian and Bulgarian)

 

 

ISO-8859-5

Cyrillic (Bulgarian, Belarusian, Russian)

 

 

WINDOWS-1254

(Code pages supported by Windows) Turkish

 

 

GB18030

Chinese (Mainland)

 

 

BIG5

Chinese (Taiwan)

 

 

GB2312

Chinese (Mainland)

 

 

EUC-KR

Korean

 

 

KS_C-5601-1987

Korean

 

 

ISO-2022-JP

Japanese

 

 

EUC-JP

Japanese

 

 

SHIFT-JIS

Japanese

 

 

Matching words

Matching words is not case-sensitive: cat, Cat, cAt, and CAT all match. Corresponding uppercase and lowercase letters, such as A and a, are treated the same in all respects.

There are two kinds of query words: words that contain occurrences of one or both of the wildcard characters * and ?, and literal words that do not contain wildcards.

Literal words that do not contain wildcards

Words containing occurrences of one or both wildcard characters * and ?

A literal word in a query expression matches the same word, character for character (case ignored), in an archived document. A word with wildcard characters (* or ?) matches a document word in the same way, character by character, except for the following:

A ? matches any single character in a document word. For example, b??t matches beat, beet, boat, blot, best, bust, bout, and so on.

An * matches any sequence of characters in a document word, including a sequence of no characters. For example, f*t matches the document words foot, feet, fit, fault, and ft; and f* matches any document word beginning with f.

You can use any number of wildcard characters (* or ?) in a query word, but you cannot use a wildcard at the beginning of a query word. An error message results. For example, *ion is not a valid query.

Matching similar words

Topics include:

Fuzzy words, page 36

Measuring word similarity, page 36

User Guide

35

Page 35
Image 35
HP RISS Components Matching words, Matching similar words, Supported character sets, Supported character Description Set