3

I am doing a data extraction project where i am required to build a web scraping program written using python using selenium and phantomjs headless webkit as browser for scaping public information like friendlist in facebook.The program is starting fairly fast but after a day of running it is getting slower and slower and I cannot figure out why ?? Can anyone give me an idea why it is getting slower ? I am running on a local machine which pretty good specs of 4gb ram and quad core processor . Does FB provide any API to find friends of friends ?

Soumya
  • 87
  • 1
  • 2
  • 15

1 Answers1

0

We faced the same issue. We resolved this by closing browser automatically after particular time interval. Clear temporary cache and open new browser instance and continue the process.

QAMate.com
  • 141
  • 1
  • 3
  • does quiting a webdriver instance of phantomjs automatically deletes cache content ? – Soumya Aug 19 '14 at 04:06
  • Its supposed to delete cache, but its an open source software, we can not expect quality all the time. WebDriver's core is browser automation. Rest we have to take care. – QAMate.com Aug 22 '14 at 03:45