0

I need to scrape a website with dynamic data loading (i.e. "read more") capabilities and content reachable only by curl parsing. Is it possible to render the website with RSelenium (or a similar package) then parse with curl package instead of getPageSource command?

berkorbay
  • 443
  • 7
  • 22
  • Why can't you use RSelenium to dynamically load the data and then pull it down with getPageSource? Can you provide the actual site address and the data you want to scrape? – Chris Mar 31 '17 at 01:32
  • With curl, somehow I can get tooltip data information as embedded json in span tags. getPageSource gives you only html source but not the json responses. I can't give you the url but I will try to find a MWE. – berkorbay Mar 31 '17 at 08:19

0 Answers0