Logo 
Search:

Sharepoint Forum

Ask Question   UnAnswered
Home » Forum » Sharepoint       RSS Feeds

WSS Search, PDF images

  Asked By: Jordan    Date: Mar 25    Category: Sharepoint    Views: 1952

How would I search Adobe PDF scanned images, without an OCR solution?
We are about to install WSS with SQL 2005. I was hoping we could use
a "workaround" solution: - searching off of filenames, and/or perhaps
by - searching off of categories in a document list. Any advice would
be appreciated.

Share: 

 

8 Answers Found

 
Answer #1    Answered By: Cory Brooks     Answered On: Mar 25

How would I search  Adobe PDF scanned  images, without an OCR solution?
We are about to install  WSS with SQL 2005. I was hoping we could use
a "workaround" solution: - searching off of filenames, and/or perhaps
by - searching off of categories in a document list.

 
Answer #2    Answered By: Ruth George     Answered On: Mar 25

Without OCR, your only option would be to add some extra metadata columns to the
document library and enter the search  keywords into those manually. OCR is what
turns the image of the scanned  page into text, and text is the only type of
information that can be searched against.

 
Answer #3    Answered By: Peter Peterson     Answered On: Mar 25

I'm ok with having users add metadata, as long as I can verify that
WSS site level -SQL FREETEXT search  will find the pdf  scanned images
based on this metadata (and/or filename - since we do have a logical
naming convention as well). Does anyone know if WSS search can do
this?

 
Answer #4    Answered By: Kalyan Pujari     Answered On: Mar 25

Metadata columns and filenames in the document library are searchable. Metadata
inside the PDF file itself should also be, assuming you install  the PDF iFilter.

 
Answer #5    Answered By: Isidro Berger     Answered On: Mar 25

it should be possible to install  an iFilter whihc will then inde your
PDF file

never tried it tho, just something mentioned in a course..

www.ifilter.org or search  for ifilter in any search engine

 
Answer #6    Answered By: Schuyler Le     Answered On: Mar 25

The Adobe iFilter doesn't handled scanned, only text PDFs.

 
Answer #7    Answered By: Kristina Cox     Answered On: Mar 25

Yes it does do this. The table below shows the SQL tables and columns
that are indexed by WSS search.

Table Indexed Columns Description

UserInfo tp_Login, tp_Title, tp_Email Details of users that
have visited
a site
collection
Lists tp_Title, tp_Description Title and description of
lists
UserData nvarchar1 to nvarchar64
ntext1 to ntext32 Text based
custom columns for
items in a list or library
Docs LeafId, Content Filename of
document and its
content.

 
Answer #8    Answered By: Buyi Wen     Answered On: Dec 03

I find a free online ocr http://www.online-code.net/ocr.html to convert image to text.

 
Didn't find what you were looking for? Find more on WSS Search, PDF images Or get search suggestion and latest updates.




Tagged: