Logo 
Search:

Sharepoint Forum

Ask Question   UnAnswered
Home » Forum » Sharepoint       RSS Feeds

PDF Ifilter

  Asked By: Akhila    Date: Jul 24    Category: Sharepoint    Views: 1217

I installed and cannot search inside pdf doc
only metadata searches.

I have a scanned doc and a scanned map with text
it does not see it in the search.

Share: 

 

11 Answers Found

 
Answer #1    Answered By: Trinity Scott     Answered On: Jul 24

To crawl inside  of pdfs, you must install the Adobe pdf  iFilter.

 
Answer #2    Answered By: Constance Guerrero     Answered On: Jul 24

i did it still doesn't work.
check registry and everything.

 
Answer #3    Answered By: Chandrabhan Konwar     Answered On: Jul 24
 
Answer #4    Answered By: Tina Owens     Answered On: Jul 24

PDF files created by scanning are treated as images not text. Unless you
put them through an Optical Character Resolution process there is no text to
be indexed only metadata.

 
Answer #5    Answered By: Tiana Whitaker     Answered On: Jul 24

Still does not work.
i even ocr'd everything
i can find the text in the pdf  doc
but i cant do a search  in sharepoint and find the document containing the text.

 
Answer #6    Answered By: Alice Chandler     Answered On: Jul 24

Does the document show as crawled successfully in the crawl log?

The ifilttst.exe program can be used to confirm that the IFilter is
installed correctly (although it will not confirm that it is configured
in SharePoint correctly).
msdn2.microsoft.com/en-us/library/ms692580.aspx

Watch out for slightly different results when searching within Adobe
Reader or Acrobat versus the final output of a crawl and what can be
ultimately be searched in SharePoint. The word breaker/stemmer can give
you slightly different results between the two.

 
Answer #7    Answered By: Lynette Sawyer     Answered On: Jul 24

You might find the answer in one of these 2 URLs:

www.sharepointblogs.com/.../13812.aspx
geekswithblogs.net/.../96045.aspx

 
Answer #8    Answered By: Alisha Itagi     Answered On: Jul 24

The links you provide work for any content added after you've installed  and
registered the PDF filter properly. Any old content you have to change a
meta data value within the document library list for it to go and re-index
the object. So you could always go and write a quick and dirty script to go
and touch every file and change the owner within the API which would then
force the indexer to go out and update and read through the documents.

I believe that Microsoft / Adobe is looking at how to fix this. In SPS 2003
it was cake with SQL server because you would go and tell it to rebuild the
Full Text Searches, but unfortunately since '07 doesn't work with the FTS
concept any longer you're SOL. Likewise with WSS v2, you could force the
FTS update, but not in WSS v3 since it's just a crippled version of MOSS
search.

 
Answer #9    Answered By: Gretchen Stokes     Answered On: Jul 24

You can, of course, just run a full index and it will index the pdf's -
just depends how much content you have which is the easier route to go -
if you do carry out a full re-index, turn off alerting beforehand and
turn it on again afterwards

 
Answer #10    Answered By: Angarika Shroff     Answered On: Jul 24

True, but how do you set it to do a full index if you're just using good ole
reliable WSS and not MOSS.

 
Answer #11    Answered By: Eliza Hutchinson     Answered On: Jul 24

I don't have a server close, but if my memory is right:

stsadm.exe -o spsearch -action fullcrawlstart

 
Didn't find what you were looking for? Find more on PDF Ifilter Or get search suggestion and latest updates.




Tagged: