HP Integrated Archive Platform manual Matching words, Supported character sets

Page 39

Table 12 Supported character sets

Supported character

Description

set

 

ISO-8859-1

Western European, extended ASCII

 

 

WINDOWS-1252

(Code pages supported by Windows) Latin 1

 

 

US-ASCII

7-bit American Standard Code for Information Interchange

 

 

UTF-8

Universal (all languages)

 

 

ISO-8859-2

Eastern European

 

 

KOI8-R

Cyrillic (Russian and Bulgarian)

 

 

ISO-8859-5

Cyrillic (Bulgarian, Belarusian, Russian)

 

 

WINDOWS-1251

Cyrillic

 

 

WINDOWS-1254

(Code pages supported by Windows) Turkish

 

 

ISO-8859–9

Turkish

 

 

GB18030

Chinese (Mainland)

 

 

BIG5

Chinese (Taiwan)

 

 

GB2312

Chinese (Mainland)

 

 

EUC-KR

Korean

 

 

KS_C-5601-1987

Korean

 

 

ISO-2022-JP

Japanese

 

 

EUC-JP

Japanese

 

 

SHIFT-JIS

Japanese

 

 

Matching words

Matching words is not case-sensitive: cat, Cat, cAt, and CAT all match. Corresponding uppercase and lowercase letters, such as A and a, are treated the same in all respects.

There are two kinds of query words: words that contain occurrences of one or both of the wildcard characters * and ?, and literal words that do not contain wildcards.

Literal words that do not contain wildcards

Words containing occurrences of one or both wildcard characters * and ?

A literal word in a query expression matches the same word, character for character (case ignored), in an archived document. A word with wildcard characters (* or ?) matches a document word in the same way, character by character, except for the following:

A ? matches any single character in a document word. For example, b??t matches beat, beet, boat, blot, best, bust, bout, and so on.

An * matches any sequence of characters in a document word, including a sequence of no characters. For example, f*t matches the document words foot, feet, fit, fault, and ft; and f* matches any document word beginning with f.

You can use any number of wildcard characters (* or ?) in a query word, but you cannot use a wildcard at the beginning of a query word. An error message results. For example, *ion is not a valid query.

User Guide

39

Image 39
Contents HP Integrated Archive Platform User Guide Page Contents Index Figures Tables Related documentation Document conventions and symbolsIntended audience PrerequisitesTIP Subscription serviceHP technical support Other web sitesUser Guide About this guide Understanding document archiving EAs applicationsApplication What You Can Do Understanding searching and document indexing Indexed document typesMessage Mime types advanced users Office 2007 supported file extensions and Mime types Office 2007 supported features Type Property Microsoft Word, PowerPoint ExcelOffice 2007 supported properties Modified Forward to Using the toolbar Logging in and outUnderstanding the user interface Search basics Common tasksCompleting simple searches IAP Web Interface tasksTask Reference Simple Search Completing advanced searchesAdvanced Search page email content type Query Field Matches in the Document Additional advanced search query fieldsAs path c\abc\xyz FolderDisplaying query or search results Query Results page email content typeQuery results navigation bar Bars Saving query or search criteriaSave Criteria Saving query or search resultsSave Results Sending query or search resultsAccessing saved criteria Accessing saved resultsExporting query or search results Deleting quarantine repositories Copying saved results to a quarantine repositoryTo search for multiple items, use the advanced search form Searching audit log repositoriesAdvanced Search page document content type Logged actions and descriptions Logged Action DescriptionQuery Field Matches Changing your language TroubleshootingChanging your password Troubleshooting topics includeUnable to display saved results Problems exporting resultsIAP Web Interface Word characters Query expressionsLetters and digits defined Letters and digits in different character setsWord characters and separators Regular expression definition of English word charactersMatching words Supported character setsSupported character Description Set Measuring word similarity Matching similar wordsMatching word sequences Fuzzy wordsExample 2. Sequence is not intuitive Proximity word sequencesMatching word sequences in attachments Example 1. Separators are ignoredExcel spreadsheet Boolean query expressions Boolean query expressionsSyntax Matches Nested Boolean query expressions Query expression examplesFollowing are examples of query expressions Query expression Finds documents with Query expression examplesQuery expression syntax and matching Index See IAP User Guide