Reading the pdf file using selenium webdriver

0 votes
I m clicking on a link on a web page.When clicking on a link, a pdf file opens up in new window. I had to read that pdf file to validate some data against the transactions done. How can I validate it without downloading it?
Apr 25, 2018 in Selenium by Martin
• 4,220 points
2,561 views

1 answer to this question.

0 votes

Use should use PDFBox and FontBox.

    public String readPDFInURL() throws EmptyFileException, IOException {

        WebDriver driver = new FirefoxDriver();

        // page with example pdf document

        driver.get("file:///C:/Users/admin/Downloads/theleader.pdf");

        URL url = new URL(driver.getCurrentUrl());

        InputStream is = url.openStream();

        BufferedInputStream fileToParse = new BufferedInputStream(is);

        PDDocument document = null;

        try {

            document = PDDocument.load(fileToParse);

            String output = new PDFTextStripper().getText(document);

        } finally {

            if (document != null) {

                document.close();

            }

            fileToParse.close();

            is.close();

        }

        return output;

    }

Some of the functions from the older versions of PDFBox have been deprecated, so we need to use another FontBox along with PDFBox. I have used PDFBox (2.0.3) and FontBox (2.0.3) and it is working fine. It won't read images though.

answered Apr 25, 2018 by Vardy
• 2,250 points

Related Questions In Selenium

0 votes
1 answer

How can I download the *.jar file from http:// seleniumhq.org using selenium WebDriver?

For Selenium Standalone Server use this: profile.setPreference("browser.helperApps.neverAsk.saveToDisk", "application/java-archive"); and ...READ MORE

answered Apr 9, 2018 in Selenium by Martin
• 4,220 points
563 views
0 votes
1 answer

How to scroll the Page up or down in Selenium WebDriver? (using java)

Its pretty simple. Check out the below ...READ MORE

answered Apr 5, 2018 in Selenium by nsv999
• 5,090 points
7,874 views
0 votes
1 answer

How do i change the location where my file gets downloaded in Selenium Webdriver and firefox driver?

There are a couple of errors there. ...READ MORE

answered Apr 12, 2018 in Selenium by nsv999
• 5,090 points
1,614 views
0 votes
1 answer
0 votes
1 answer

Finding WebDriver element with Class Name in java

The better way to handle this element ...READ MORE

answered Apr 10, 2018 in Selenium by nsv999
• 5,090 points
780 views
0 votes
1 answer
0 votes
1 answer

Geo-location microphone camera pop up

To Allow or Block the notification, access using Selenium and you have to ...READ MORE

answered May 11, 2018 in Selenium by Samarpit
• 5,130 points
1,025 views
0 votes
1 answer

How to use such xpath to find web elements

It's much simpler to use: #MyDiv123 > div.super or ...READ MORE

answered Jun 14, 2018 in Selenium by Samarpit
• 5,130 points
726 views
0 votes
1 answer

Using gettext() method for the specific element using Selenium WebDriver

Mistake is that u r printing the ...READ MORE

answered Apr 10, 2018 in Selenium by Vardy
• 2,250 points
6,669 views
0 votes
1 answer

Select an item from a dropdown list using Selenium WebDriver

Use this then it will work - new ...READ MORE

answered Apr 9, 2018 in Selenium by Vardy
• 2,250 points
2,417 views