December 4, 2023 at 9:14 pm
If I just use a PowerQuery inside my data flow, I can just read the table and tell PowerQuery to ignore the rest. Then it's stupid easy. No need to use Excel at all.
December 4, 2023 at 10:30 pm
First convert the PDF to excel format. Remove any unwanted lines, rows or any characters. Verify the data is correctly formatted, then save it as .csv file. Use import/export wizard to import the data to SQL Server table.
I'll try that and see what happens. The challenge will be getting the 776 pages done in my lifetime!
Rick
Disaster Recovery = Backup ( Backup ( Your Backup ) )
December 5, 2023 at 1:46 am
If your data is in tables in the PDF file, you can tell PowerQuery to return just the table. Is this a publicly available PDF file? (likely not)... I wanted to see if I could get PowerQuery to read it and then maybe dump that into a table.
(I have sooo wished for a PowerQuery transform in SSIS since forever.. so I could read from a PDF or whatever, and then send the data anywhere I want... Oh right, PYTHON.
Viewing 3 posts - 16 through 18 (of 18 total)
You must be logged in to reply to this topic. Login to reply
This website stores cookies on your computer.
These cookies are used to improve your website experience and provide more personalized services to you, both on this website and through other media.
To find out more about the cookies we use, see our Privacy Policy