Scanned PDFs need to be processed with Optical Character
Recognition (OCR) software at some stage in the process. This could be
done during the scan (some scanners/copiers provide for this) or after
the scan. If you have to do it after the scan, we've found that Adobe
Acrobat Professional works very well (Document -> Recognize Text using
OCR). There are also more elaborate solutions that can automate this for
you. OCR accuracy will vary with the software, the quality of the scan,
whether the scan is aligned exactly horizontally, the font used, the
colour of the paper of the original, and the resolution of the scan.
Once the document has been processed with the OCR software and saved,
both the image and text are stored with the PDF document. The image is
what is displayed when the document is opened using Acrobat Reader, and
the text is stored as metadata in the background. It is always this
metadata that is indexed by the IFilter.