0

I would like to extract the complete html source of website as we get it on chrome inspect element.

I am trying to the scrap the comments of disqus from http://bossip.com/1190557/cops-kill-people-mississippi-cop-under-investigation-for-strangling-unarmed-black-man-jonathan-sanders-to-death-43081/

Initially I tried with java and Jsoup but I am unable to extract the complete html source.

Then as suggested in How to get html that I see in inspect element? I tried using HTML Unit but unfortunately HTML unit also did not extract as i expected.

Is there any solution around for this problem?

Community
  • 1
  • 1
Arun Kishore
  • 119
  • 3
  • 12

1 Answers1

-1

In Chrome,

  1. in normal browser document: right-click on element, select "Inspect Element" from pop-up menu. You will go to the "developer tools" window.
  2. in developer tools window: right-click on the HTML element and select "Edit as HTML" from pop-up menu.
  3. select all HTML text in the editor (press CTRL-A) and copy to clipboard (press CTRL-C)
geert3
  • 7,086
  • 1
  • 33
  • 49
  • 1
    not sure why this is downvoted. It works as described. – geert3 Jul 30 '15 at 12:41
  • since @Arun Kishore asked how to get the html of one very specific page your solution works great, but the down vote was likely due to this being such a manual task. It can't be automated. – Ben Harrison Jun 29 '16 at 19:17