• dawnbrown243 (5/7/2013)


    OCR means Optical character recognition, it is the mechanical or electronic conversion of scanned images of handwritten, typewritten or printed text into machine-encoded text. Some PDFs are scans, so OCR recongnition[/url] would be required, PDF format is well-documented, PDF have multiple columns and the extraction of pdf text needs to use a mature and structure pdf reading app.

    Even though this thread is getting old, it was never fully resolved and remains interesting.

    Are you able to suggest how to "use a mature and structure (sic.) pdf reading app" in SSIS to solve this problem?

    If you haven't even tried to resolve your issue, please don't expect the hard-working volunteers here to waste their time providing links to answers which you could easily have found yourself.