Click here to monitor SSC
SQLServerCentral is supported by Red Gate Software Ltd.
 
Log in  ::  Register  ::  Not logged in
 
 
 
        
Home       Members    Calendar    Who's On


Add to briefcase

Insert data from .pdf files Expand / Collapse
Author
Message
Posted Sunday, March 23, 2014 1:27 AM
SSC Veteran

SSC VeteranSSC VeteranSSC VeteranSSC VeteranSSC VeteranSSC VeteranSSC VeteranSSC Veteran

Group: General Forum Members
Last Login: Friday, September 12, 2014 2:32 PM
Points: 209, Visits: 528
Hi everyone,

We are facing a problem with loading data from .pdf files from vendor.
.pdf files have data in tabular format and we would like to insert those fields into a SQL table.
We do not want to insert the physical location of the file but, we need to insert the data within the file.

How can we read a pdf file?

Thanks & Regards
Post #1553798
Posted Monday, March 24, 2014 12:51 AM
SSC-Addicted

SSC-AddictedSSC-AddictedSSC-AddictedSSC-AddictedSSC-AddictedSSC-AddictedSSC-AddictedSSC-Addicted

Group: General Forum Members
Last Login: Tuesday, September 16, 2014 2:39 AM
Points: 470, Visits: 483
SSC experts would definitely have an answer to this .. my wild guess though is ....probably firstly converting it to excel and then reading that excel... or writing some code in some language eg. java .. or using some third party software to read text from PDF...
Please see if following helps
http://stackoverflow.com/questions/4784825/how-to-read-pdf-files-using-java
http://www.a-pdf.com/data-extractor/
Post #1553877
Posted Tuesday, March 25, 2014 7:02 AM
Forum Newbie

Forum NewbieForum NewbieForum NewbieForum NewbieForum NewbieForum NewbieForum NewbieForum Newbie

Group: General Forum Members
Last Login: Tuesday, March 25, 2014 7:01 AM
Points: 3, Visits: 15
May be with iText:

http://itextpdf.com/

Regards
Post #1554438
Posted Tuesday, March 25, 2014 9:04 PM
Forum Newbie

Forum NewbieForum NewbieForum NewbieForum NewbieForum NewbieForum NewbieForum NewbieForum Newbie

Group: General Forum Members
Last Login: Wednesday, March 26, 2014 2:18 AM
Points: 6, Visits: 11
Confusing Queries (3/23/2014)
Hi everyone,

We are facing a problem with
loading data text from .pdf files from vendor.
.pdf files have data in tabular format and we would like to insert those fields into a SQL table.
We do not want to insert the physical location of the file but, we need to insert the data within the file.

How can we
read a pdf file?

Thanks & Regards


If you want to read a pdf file, I think you might use some PDF reading utility. And as for this question, I think you can find answer in this post.

http://www.sqlservercentral.com/Forums/Topic1339455-148-1.aspx

As for "you want to insert data filed that is in tabular format into SQL table", maybe you can check this post

http://social.msdn.microsoft.com/Forums/sqlserver/en-US/01bf1171-6165-4c29-9242-d7f11f9662d3/insert-pdf-fields-into-sql-table?forum=sqlintegrationservices

Hope it offers some useful help.
Post #1554753
Posted Thursday, March 27, 2014 6:47 AM
Forum Newbie

Forum NewbieForum NewbieForum NewbieForum NewbieForum NewbieForum NewbieForum NewbieForum Newbie

Group: General Forum Members
Last Login: Wednesday, November 5, 2014 7:40 AM
Points: 4, Visits: 62
In Adobe:
File>Save As>Text

PDF table will convert like this:

Arizona

5

Alabama

4

Kansas

9

Missouri

3

Montana

2

Read Text file, parse it out.

Or, you might look at this application: Winautomation. Macro software that can read and write to sql db. it's an excellent application, I've used to to do some web scraping to store in SQL.
Post #1555397
Posted Thursday, March 27, 2014 11:20 AM


SSCommitted

SSCommittedSSCommittedSSCommittedSSCommittedSSCommittedSSCommittedSSCommittedSSCommitted

Group: General Forum Members
Last Login: Yesterday @ 3:15 PM
Points: 1,754, Visits: 4,965
This is just a shot in the dark, but try Googling or Binging the following:
+"sql server" +iFilter +PDF +text filestream semantic
Post #1555579
Posted Friday, March 28, 2014 11:45 AM
SSC Eights!

SSC Eights!SSC Eights!SSC Eights!SSC Eights!SSC Eights!SSC Eights!SSC Eights!SSC Eights!

Group: General Forum Members
Last Login: Wednesday, November 19, 2014 12:06 PM
Points: 887, Visits: 2,453
On a non-technical level, has anyone asked the vendor what other formats they can send the data in? PDF is a print format for humans; having a computer pull data out of it is less than ideal compared to getting a fixed width text file, a delimited text file, or a variety of other formats.
Post #1556086
« Prev Topic | Next Topic »

Add to briefcase

Permissions Expand / Collapse