Extract multiple pdf pages with specific criteria

+1 vote
Hi All,

Can someone help me with the pdf page extraction in Uipath. I have a pdf with 500 pages and it has data with multiple criteria's.

FOr example = page 1 to 21 has one criteria and 22 to 26 has one criteria and page 27 to 30 has one criteria. I want to extract specific pages as per the criteria. example I wuld like to extract firast 21 pages and then from 22 to 26 as one pdf and then 27 to 30 as one pdf.

The criteria's are available as a header in each page in the pdf. I want to name the extracted pdfs as per the criteria name.

Thanks
Aug 24, 2020 in RPA by Moin
• 150 points
780 views

1 answer to this question.

0 votes

You have to provide anchors and ensure the correct page boundaries are identified - not just the keywords.

Here's the idea:

  1. Read pdf page by page with activity Read PDF Text
  2. Search the text string with activity IsMatch
  3. If a match (Boolean) is found, add a data row containing the search text and starting page number
  4. increment page number
  5. repeat step 2
  6. If the second page is read, update the previous data row ending page number
  7. When the last page is read, update the data row ending page number
  8. Finally, Extract PDF Page Range to extract the pages.

You could use  Assign Regex Pattern activity to replace a space with \s for a regular expression to work correctly. You will need to change it accordingly for the text you are searching

answered Aug 25, 2020 by Karan
• 19,610 points

Related Questions In RPA

0 votes
1 answer

RPA : How to extract specific data from scanned pdf and write into excel in blueprism?

Your query is similar to https://www.edureka.co/community/6509 ...READ MORE

answered Mar 23, 2020 in RPA by Sirajul
• 59,050 points
895 views
+1 vote
0 answers
0 votes
1 answer

How to extract specific type of files from a directory using UiPath?

Hey @Avani, if you want to extract ...READ MORE

answered Mar 13, 2019 in RPA by Anvi
• 14,150 points
6,094 views
+1 vote
2 answers

Looping through PDF files to extract specific data using Uipath

Hi Rashi, you can access all the ...READ MORE

answered Mar 5, 2019 in RPA by Anvi
• 14,150 points
5,210 views
0 votes
1 answer

UiPath automation for converting a .docx file to a .pdf file

Hey Rashmi, that's actually a very easy ...READ MORE

answered Mar 8, 2019 in RPA by Abha
• 28,040 points
3,003 views
0 votes
1 answer

Robot Manager in UiPath

the robot.exe file is located in: C:\Users\USER_ACCOUNT\AppData\Local\UiPath\app-17.1.6523\Ui-Robot.exe Hope it ...READ MORE

answered Apr 17, 2018 in RPA by wrecker
• 3,110 points
906 views
0 votes
1 answer

Automation in Uipath

you can use the Database package for connecting  ...READ MORE

answered Apr 17, 2018 in RPA by wrecker
• 3,110 points
789 views
0 votes
1 answer

how to extract information from a pdf using regex and send that information to an excel spreadsheet?

You can use the following high level activities to ...READ MORE

answered Apr 9, 2020 in RPA by Karan
• 19,610 points
692 views
0 votes
1 answer

when to use ReframeWork with Orchestrator and when to use it without ?

UiPath Orchestrator provides basic reporting capabilities: for ...READ MORE

answered Nov 29, 2019 in RPA by Karan
• 19,610 points
779 views