0

Currently i'm on project scraping a data from website. The data is not a text or numeric value but it contains canvas element produced number phone as canvas image. My first method to scrape this data i'm using OCR Text Recognition in Power Automate Desktop which collect all text specifically from the webpage coordinates. I'm able to get the number, but it also scrapped unnecessary number and text from the coordinates. So i'm filter it using Recognize Entities In Text action in PDA(Power Automate Desktop) and able to filter the number phone.

But the problem is, when im running the action it will add new phone number to the new rows and also produced unnecessary number which is 000000 in the item list row.I tried to get rid of the unnecessary number by using remove item from the list action and it get rid of it. But it also remove my other item which is NULLED to differentiate which product does not contains phone number from the list.

enter image description here enter image description here

I want the list result write to excel file, without removing NULLED from the list only remove 00000 from the list.So the list will be transfer to excel row by row

  • When you say NULLED, do you mean it shows the text NULLED or that it is a blank row? If you use 'Recognise entities in text' then NULL will not be selected as it is not a phone number. You will have to use a regular expression to extract the items as needed. – CobyC Dec 04 '22 at 20:47

0 Answers0