Logo 
Search:

MOSS Forum

Ask Question   UnAnswered
Home » Forum » MOSS       RSS Feeds

Promote PDF doc properties to MOSS2007 doc library w/ Content Types

  Asked By: Eliezer    Date: Sep 28    Category: MOSS    Views: 2671

My organization distributes information to its membership primarily via
PDFs. The PDFs are mostly created in non-Microsoft apps
programmatically. I have been able to figure out how to set properties
for the PDFs using Acrobat javascript and other methods.

Here is my problem: I have a requirement that the metadata (doc
properties, etc.) be promoted from the PDF document to the MOSS2007
document library upon upload or very soon thereafter. The resulting
document in SharePoint also needs to be associated with a Content Type.
Furthermore, I need this to work for single file uploads as well as bulk
file uploads.

I've looked for a solution out on the web and have not found one yet
that directly addresses this need. I have found a potential solution
that can promote the metadata if it resides in a separate text file (see
this article
<www.sharepointblogs.com/.../sharepoint-20\
07-upload-a-document-with-its-meta-data.aspx> ), which my developer
buddy is looking into.

Given the level of technical knowledge in this group, I figure one of
you may have come across a solution to this issue. If you have, I would
love to know about it.

Share: 

 

9 Answers Found

 
Answer #1    Answered By: Larry Gonzales     Answered On: Sep 28

Does anyone have any idea how to do this? Even something close might
help.

 
Answer #2    Answered By: Lamont Tran     Answered On: Sep 28

In the absence of any response from someone with more definite
information, I believe there is a 3rd party iFilter for pdf  files
that you can buy. One of its advantages over the Adobe filter is
that it can extract metadata  as properties  that you can extract for
search.

I guess though you would still need to write custom code to promote
these properties into document library  columns.

 
Answer #3    Answered By: Orlando Silva     Answered On: Sep 28

I'll reseach that more. I was under the
impression that the iFilters were primarily for Search and indexing.
I'll keep you posted on what I find out.

 
Answer #4    Answered By: Dana Hodges     Answered On: Sep 28

iFilters are used for indexing because they understand the underlying
structure of a binary document  and how to retrieve metadata  from within
that structure, something the indexor needs.

 
Answer #5    Answered By: Kent Harmon     Answered On: Sep 28

I'm finding that there is a very limited
amount of information  about how to promote  PDF properties  up to a
document library's metadata  columns. Even the PDFs I create/publish
in Word 2007 don't propagate the properties to the doc  library.
There has got to be a way to do this.

 
Answer #6    Answered By: Alexis Pearson     Answered On: Sep 28

I've never read of anyone successfully doing it. It doesn't behoove
Microsoft to promote  the PDF standard.

 
Answer #7    Answered By: Hugo Park     Answered On: Sep 28

Here's a code sample for a 2003 web  part that claimed it would autocreate
site columns based on EXIF metadata  from JPEG images, so it's conceivable to
do the same with pdf  metadata.
www.gotdotnet.com/.../Details.aspx\
dd48-48c7-a503-b85b404cf6db

Another thought is that with pdf metadata iFilters if you just need to
display the metadata for library  of PDF documents could you make a web part
page that connects a list view of the doc  library and a search result part?
So when you click on the doc title in the list view part it passes that
information to the search result part to display the metadata as a custom
search result on the same page?

 
Answer #8    Answered By: Sterling Allison     Answered On: Sep 28

I have my developer buddy looking into the custom web  part route. It
probably will not do what we're looking for, but there may be a way
using an offshoot to your suggestion.

As far as Microsoft's support for pdf's, I understand the reasoning
for them not to come out and endorse the standard. That is their
way. However, the partner ecosystem is missing out on a large
opportunity with MOSS2007 exploding in popularity for enterprise
content management. I guarantee you that there are plenty of
organizations that are storing pdfs  in their doc  libraries and would
like to see the properties  they've defined for them automatically
exposed.

 
Answer #9    Answered By: Jessie Burton     Answered On: Sep 28
 




Tagged: