Support > Forums > OpenBots Studio > PDF Text Extraction

PDF Text Extraction

Bhavin Suthar

  • Reporter
  • Calender Icon May 04,2023 at 11:20 AM

How to Extract specific Text from PDF which has content in paragraph type ??

This forum has 314 topics, 681 replies, and was last updated 22 days ago by Support Agent

Rushabh Arbat

  • Participant
  • Calender Icon May 05,2023 at 2:25 PM

Hi Bhavin Suthar,

To extract information or text from a pdf we have a following commands which is located under Commands -> Data Manipulation Command -> PDF

Get PDF Text Command - this command extracts all the text from PDF file. Get PDF Form Field Value - this command gets the value of specified form PDF field. Get PDF Images - this command reads all the images from PDF file.

After text is extracted, you can use string manipulation commands or C# code snippets (like Regex pattern or substring) to get the specific text.

We have a command called Get Regex Matches that retrieves all matches in the given text depending on the regex pattern.

Thanks

You are not authorized to reply, please click here to login and add your reply to this topic.