This discussion has been locked.
You can no longer post new replies to this discussion. If you have a question you can start a new discussion

Extract content from a PDF file, how?

M-Files indexer and a number of IML services can look into the text content of text based filetypes, so methods are available. I just can't find a way to extract content in VB script (the API documentation is down at the moment, that makes it difficult to look for it!). Wonder if any of you clever folks might remember a method that would allow me to get the text content of a PDF file and parse it through a regex in order to extract metadata from that content?

Thank you, Karl

Parents Reply Children