SQL Clone
SQLServerCentral is supported by Redgate
 
Log in  ::  Register  ::  Not logged in
 
 
 


Insert data from .pdf files


Insert data from .pdf files

Author
Message
Confusing Queries
Confusing Queries
Old Hand
Old Hand (345 reputation)Old Hand (345 reputation)Old Hand (345 reputation)Old Hand (345 reputation)Old Hand (345 reputation)Old Hand (345 reputation)Old Hand (345 reputation)Old Hand (345 reputation)

Group: General Forum Members
Points: 345 Visits: 528
Hi everyone,

We are facing a problem with loading data from .pdf files from vendor.
.pdf files have data in tabular format and we would like to insert those fields into a SQL table.
We do not want to insert the physical location of the file but, we need to insert the data within the file.

How can we read a pdf file?

Thanks & Regards
SQLCJ
SQLCJ
Say Hey Kid
Say Hey Kid (693 reputation)Say Hey Kid (693 reputation)Say Hey Kid (693 reputation)Say Hey Kid (693 reputation)Say Hey Kid (693 reputation)Say Hey Kid (693 reputation)Say Hey Kid (693 reputation)Say Hey Kid (693 reputation)

Group: General Forum Members
Points: 693 Visits: 576
SSC experts would definitely have an answer to this .. my wild guess though is Unsure ....probably firstly converting it to excel and then reading that excel... or writing some code in some language eg. java .. or using some third party software to read text from PDF...
Please see if following helps
http://stackoverflow.com/questions/4784825/how-to-read-pdf-files-using-java
http://www.a-pdf.com/data-extractor/
Virgilio Licovali Bustos...
Virgilio Licovali Bustos Ramirez
Forum Newbie
Forum Newbie (3 reputation)Forum Newbie (3 reputation)Forum Newbie (3 reputation)Forum Newbie (3 reputation)Forum Newbie (3 reputation)Forum Newbie (3 reputation)Forum Newbie (3 reputation)Forum Newbie (3 reputation)

Group: General Forum Members
Points: 3 Visits: 18
May be with iText:

http://itextpdf.com/

Regards
cathyhill345
cathyhill345
SSC Rookie
SSC Rookie (26 reputation)SSC Rookie (26 reputation)SSC Rookie (26 reputation)SSC Rookie (26 reputation)SSC Rookie (26 reputation)SSC Rookie (26 reputation)SSC Rookie (26 reputation)SSC Rookie (26 reputation)

Group: General Forum Members
Points: 26 Visits: 11
Confusing Queries (3/23/2014)
Hi everyone,

We are facing a problem with
loading data text from .pdf files from vendor.
.pdf files have data in tabular format and we would like to insert those fields into a SQL table.
We do not want to insert the physical location of the file but, we need to insert the data within the file.

How can we
read a pdf file?

Thanks & Regards


If you want to read a pdf file, I think you might use some PDF reading utility. And as for this question, I think you can find answer in this post.

http://www.sqlservercentral.com/Forums/Topic1339455-148-1.aspx

As for "you want to insert data filed that is in tabular format into SQL table", maybe you can check this post

http://social.msdn.microsoft.com/Forums/sqlserver/en-US/01bf1171-6165-4c29-9242-d7f11f9662d3/insert-pdf-fields-into-sql-table?forum=sqlintegrationservices

Hope it offers some useful help.:-D
SQL2219
SQL2219
Grasshopper
Grasshopper (13 reputation)Grasshopper (13 reputation)Grasshopper (13 reputation)Grasshopper (13 reputation)Grasshopper (13 reputation)Grasshopper (13 reputation)Grasshopper (13 reputation)Grasshopper (13 reputation)

Group: General Forum Members
Points: 13 Visits: 94
In Adobe:
File>Save As>Text

PDF table will convert like this:

Arizona

5

Alabama

4

Kansas

9

Missouri

3

Montana

2

Read Text file, parse it out.

Or, you might look at this application: Winautomation. Macro software that can read and write to sql db. it's an excellent application, I've used to to do some web scraping to store in SQL.
Eric M Russell
Eric M Russell
One Orange Chip
One Orange Chip (28K reputation)One Orange Chip (28K reputation)One Orange Chip (28K reputation)One Orange Chip (28K reputation)One Orange Chip (28K reputation)One Orange Chip (28K reputation)One Orange Chip (28K reputation)One Orange Chip (28K reputation)

Group: General Forum Members
Points: 28518 Visits: 11495
This is just a shot in the dark, but try Googling or Binging the following:
+"sql server" +iFilter +PDF +text filestream semantic


"The universe is complicated and for the most part beyond your control, but your life is only as complicated as you choose it to be."
Nadrek
Nadrek
SSCarpal Tunnel
SSCarpal Tunnel (4.4K reputation)SSCarpal Tunnel (4.4K reputation)SSCarpal Tunnel (4.4K reputation)SSCarpal Tunnel (4.4K reputation)SSCarpal Tunnel (4.4K reputation)SSCarpal Tunnel (4.4K reputation)SSCarpal Tunnel (4.4K reputation)SSCarpal Tunnel (4.4K reputation)

Group: General Forum Members
Points: 4444 Visits: 2741
On a non-technical level, has anyone asked the vendor what other formats they can send the data in? PDF is a print format for humans; having a computer pull data out of it is less than ideal compared to getting a fixed width text file, a delimited text file, or a variety of other formats.
Go


Permissions

You can't post new topics.
You can't post topic replies.
You can't post new polls.
You can't post replies to polls.
You can't edit your own topics.
You can't delete your own topics.
You can't edit other topics.
You can't delete other topics.
You can't edit your own posts.
You can't edit other posts.
You can't delete your own posts.
You can't delete other posts.
You can't post events.
You can't edit your own events.
You can't edit other events.
You can't delete your own events.
You can't delete other events.
You can't send private messages.
You can't send emails.
You can read topics.
You can't vote in polls.
You can't upload attachments.
You can download attachments.
You can't post HTML code.
You can't edit HTML code.
You can't post IFCode.
You can't post JavaScript.
You can post emoticons.
You can't post or upload images.

Select a forum

































































































































































SQLServerCentral


Search