Appendix 1. About Searchable PDF Files

(1)What is a searchable PDF file?

When a paper document is scanned and stored in the form of PDF file, all information in the document such as text, pictures, and charts is output as one image.

In such cases, characters in the scanned document are not searchable because they are the image without textual information.

Image

PDF

This image cannot be searched for character strings!

ABCDEFABCDEF

This being the case, you process the image data with OCR (text recognition) software to retrieve textual information from the image and transparently superimpose the information onto the image data. By doing so, the image data can be stored as a PDF file with textual information.

Although such a PDF file appears to be image data, it has been made searchable, like the text file, with textual information superimposed.

Only files scanned with the ScanSnap can be converted.

For details on how to convert PDF files into searchable PDF files, see “3.3. Converting PDF Files into Searchable Ones.” (page 37)

73