HP Integrated Archive Platform manual Query expressions, Word characters

Page 37

3 Query expression syntax and matching

Query expression syntax and matching describes the IAP Web Interface syntax to use to search and retrieve archived documents (files or email messages), and explains how queries are matched against documents.

Major topics include:

Query expressions, page 37

Word characters, page 37

Letters and digits in different character sets, page 38

Matching words, page 39

Matching similar words, page 40

Matching word sequences, page 40

Boolean query expressions, page 43

Nested Boolean query expressions, page 44

Query expression examples, page 44

Query expressions

Query expressions can be as simple or as complex as needed. The essential idea behind document retrieval is that query words are compared with document words to find a match. You can also:

Look for document words that are textually similar, but not necessarily identical, to query words. See Matching similar words” on page 40.

Look for word sequences in a document: words that are near each other, and in a particular order. See Matching word sequences” on page 40.

Combine query words using logical (Boolean) operators (AND, OR, NOT). See Boolean query expressions” on page 43.

Together, these query constructs provide considerable power to find what you need, provided you learn to use them well.

The way query expressions are interpreted is similar to the way documents are indexed when archived. Text is parsed (broken down) into words. Remaining characters are considered separators and ignored. Query expressions are fundamentally composed of words, no matter how complex the expression.

For indexing and searching, a word need not belong to a natural language, such as English. For example, wt6_ht3 is a valid document word or query word. Query words can contain wildcards, such as in f??t.

Word characters

When the system examines a query expression to determine its words, some characters are not included in query words, but are treated as word separators. When a document is archived, indexing determines which document words are available for searching in the same way.

Learning the rules of creating query words means also learning the rules of document indexing and, therefore, what words you can search for.

User Guide

37

Image 37
Contents HP Integrated Archive Platform User Guide Page Contents Index Figures Tables Intended audience Document conventions and symbolsPrerequisites Related documentationHP technical support Subscription serviceOther web sites TIPUser Guide About this guide EAs applications Understanding document archivingApplication What You Can Do Indexed document types Understanding searching and document indexingMessage Mime types advanced users Office 2007 supported file extensions and Mime types Type Property Microsoft Word, PowerPoint Excel Office 2007 supported featuresOffice 2007 supported properties Modified Forward to Logging in and out Using the toolbarUnderstanding the user interface Search basics Common tasksIAP Web Interface tasks Completing simple searchesTask Reference Simple Search Completing advanced searchesAdvanced Search page email content type Query Field Matches in the Document Additional advanced search query fieldsAs path c\abc\xyz FolderDisplaying query or search results Query Results page email content typeQuery results navigation bar Bars Saving query or search criteriaSave Criteria Saving query or search resultsSave Results Sending query or search resultsAccessing saved results Accessing saved criteriaExporting query or search results Deleting quarantine repositories Copying saved results to a quarantine repositoryTo search for multiple items, use the advanced search form Searching audit log repositoriesAdvanced Search page document content type Logged Action Description Logged actions and descriptionsQuery Field Matches Changing your password TroubleshootingTroubleshooting topics include Changing your languageUnable to display saved results Problems exporting resultsIAP Web Interface Word characters Query expressionsWord characters and separators Letters and digits in different character setsRegular expression definition of English word characters Letters and digits definedSupported character sets Matching wordsSupported character Description Set Matching word sequences Matching similar wordsFuzzy words Measuring word similarityMatching word sequences in attachments Proximity word sequencesExample 1. Separators are ignored Example 2. Sequence is not intuitiveExcel spreadsheet Boolean query expressions Boolean query expressionsSyntax Matches Query expression examples Nested Boolean query expressionsFollowing are examples of query expressions Query expression Finds documents with Query expression examplesQuery expression syntax and matching Index See IAP User Guide