Converting pdf files to excel xls

0 votes
Two hours ago, a friend of mine who is completing an internship asked me whether I could save him from having to manually convert 462 pdf files to.xls by using a free online tool.

Unoconv came to mind as a shell script, but I wasn't sure how to use it properly, and I'm not sure if it can address this issue since it mostly converts files to PDFs rather than the other way around.
Oct 30, 2022 in Others by Kithuzzz
• 38,010 points
541 views

1 answer to this question.

0 votes
It is not always possible and typically not advised to convert from PDF to any other structured format. Having said that, there are quite a few of them and this does appear to be a one-off job (462).

If you can dependably extract text from the majority of them and it is reasonably structured, it is worthwhile to pursue. A sample of the PDFs that you can dependably parse into a table structure must be output as ordinary text.

Just do a search to get a tonne of tools that focus on direct or OCR-based text extraction.

One that I like using is pstotext from the ghostscript suite; using the -bboxes option, I can obtain the coordinates of each word and reassemble the structure on my own. In spite of its name, it can function with PDF inputs. The drawback is that it can be a little erratic, and some PDFs may function but not others.

In order to convert that to a CSV if you make it this far, you'd probably need to develop a shell script or program. You have two options: open it straight in a spreadsheet or find tools that will convert it to XLS.
answered Oct 30, 2022 by narikkadan
• 63,420 points

Related Questions In Others

0 votes
1 answer

Converting all tabs of excel sheet to PDF

Using VBA, try it like this, for ...READ MORE

answered Sep 26, 2022 in Others by narikkadan
• 63,420 points
1,323 views
0 votes
1 answer

Converting excel to pdf using PHP

If utilizing PHP on a Windows PC, ...READ MORE

answered Oct 23, 2022 in Others by narikkadan
• 63,420 points
4,622 views
0 votes
1 answer

Convert Word doc, docx and Excel xls, xlsx to PDF with PHP

After receiving a request, I'll put the ...READ MORE

answered Nov 20, 2022 in Others by narikkadan
• 63,420 points
952 views
0 votes
1 answer

Converting PDF to Excel using Adobe PDF Tool API in .NET

An identical mistake was occurring for me ...READ MORE

answered Dec 15, 2022 in Others by narikkadan
• 63,420 points
746 views
0 votes
1 answer

How can I convert excel to PDF by Libreoffice and keep all format from excel file?

"Times New Roman" typeface does not have ...READ MORE

answered Oct 3, 2022 in Others by narikkadan
• 63,420 points
1,302 views
0 votes
1 answer

How to convert .xls to .pdf via PHP?

You can use PHPExcel to read the XLS ...READ MORE

answered Nov 8, 2022 in Others by narikkadan
• 63,420 points
744 views
0 votes
1 answer

Java Spring - Writing Excel file and converting to PDF

Since you are using Spring I suggest ...READ MORE

answered Sep 26, 2022 in Others by narikkadan
• 63,420 points
1,959 views
0 votes
1 answer

Convert Excel to PDF issue with documents4j

MS Excel may not always be used ...READ MORE

answered Sep 26, 2022 in Others by narikkadan
• 63,420 points
1,111 views
0 votes
1 answer

Print chosen worksheets in excel files to pdf in python

In the simplest form: import win32com.client o = win32com.client.Dispatch("Excel.Application") o.Visible ...READ MORE

answered Sep 24, 2022 in Others by narikkadan
• 63,420 points
2,764 views
0 votes
1 answer

Convert Excel and Word files to PDF Using ruby

 You can combine some: For excel files - ...READ MORE

answered Sep 26, 2022 in Others by narikkadan
• 63,420 points
931 views
webinar REGISTER FOR FREE WEBINAR X
REGISTER NOW
webinar_success Thank you for registering Join Edureka Meetup community for 100+ Free Webinars each month JOIN MEETUP GROUP