MOSS Forum

 
Home » Forum » MOSS       Ask a questionRSS Feeds

Promote PDF doc properties to MOSS2007 doc library w/ Content Types

  Asked By: Eliezer Preston         Date: Sep 28, 2007      Category: MOSS      Views: 954
 

My organization distributes information to its membership primarily via
PDFs. The PDFs are mostly created in non-Microsoft apps
programmatically. I have been able to figure out how to set properties
for the PDFs using Acrobat javascript and other methods.

Here is my problem: I have a requirement that the metadata (doc
properties, etc.) be promoted from the PDF document to the MOSS2007
document library upon upload or very soon thereafter. The resulting
document in SharePoint also needs to be associated with a Content Type.
Furthermore, I need this to work for single file uploads as well as bulk
file uploads.

I've looked for a solution out on the web and have not found one yet
that directly addresses this need. I have found a potential solution
that can promote the metadata if it resides in a separate text file (see
this article
<www.sharepointblogs.com/.../sharepoint-20\
07-upload-a-document-with-its-meta-data.aspx> ), which my developer
buddy is looking into.

Given the level of technical knowledge in this group, I figure one of
you may have come across a solution to this issue. If you have, I would
love to know about it.

Tagged:                    

 

9 Answers Found

 
Answer #1       Answered By: Larry Gonzales          Answered On: Sep 28, 2007       

Does anyone have any idea how to do this? Even something close might
help.

 
Answer #2       Answered By: Lamont Tran          Answered On: Sep 28, 2007       

In the absence of any response from someone with more definite
information, I believe there is a 3rd party iFilter for pdf  files
that you can buy. One of its advantages over the Adobe filter is
that it can extract metadata  as properties  that you can extract for
search.

I guess though you would still need to write custom code to promote
these properties into document library  columns.

 
Answer #3       Answered By: Orlando Silva          Answered On: Sep 28, 2007       

I'll reseach that more. I was under the
impression that the iFilters were primarily for Search and indexing.
I'll keep you posted on what I find out.

 
Answer #4       Answered By: Dana Hodges          Answered On: Sep 28, 2007       

iFilters are used for indexing because they understand the underlying
structure of a binary document  and how to retrieve metadata  from within
that structure, something the indexor needs.

 
Answer #5       Answered By: Kent Harmon          Answered On: Sep 28, 2007       

I'm finding that there is a very limited
amount of information  about how to promote  PDF properties  up to a
document library's metadata  columns. Even the PDFs I create/publish
in Word 2007 don't propagate the properties to the doc  library.
There has got to be a way to do this.

 
Answer #6       Answered By: Alexis Pearson          Answered On: Sep 28, 2007       

I've never read of anyone successfully doing it. It doesn't behoove
Microsoft to promote  the PDF standard.

 
Answer #7       Answered By: Hugo Park          Answered On: Sep 28, 2007       

Here's a code sample for a 2003 web  part that claimed it would autocreate
site columns based on EXIF metadata  from JPEG images, so it's conceivable to
do the same with pdf  metadata.
www.gotdotnet.com/.../Details.aspx\
dd48-48c7-a503-b85b404cf6db

Another thought is that with pdf metadata iFilters if you just need to
display the metadata for library  of PDF documents could you make a web part
page that connects a list view of the doc  library and a search result part?
So when you click on the doc title in the list view part it passes that
information to the search result part to display the metadata as a custom
search result on the same page?

 
Answer #8       Answered By: Sterling Allison          Answered On: Sep 28, 2007       

I have my developer buddy looking into the custom web  part route. It
probably will not do what we're looking for, but there may be a way
using an offshoot to your suggestion.

As far as Microsoft's support for pdf's, I understand the reasoning
for them not to come out and endorse the standard. That is their
way. However, the partner ecosystem is missing out on a large
opportunity with MOSS2007 exploding in popularity for enterprise
content management. I guarantee you that there are plenty of
organizations that are storing pdfs  in their doc  libraries and would
like to see the properties  they've defined for them automatically
exposed.

 
Answer #9       Answered By: Jessie Burton          Answered On: Sep 28, 2007       
 


Your Answer
  • Answer should be atleast 30 Characters.
  • Please put code inside [Code] your code [/Code].