Questions tagged [boilerpipe]

The boilerpipe library for Java provides algorithms to detect and remove the surplus "clutter" (boilerplate, templates) around the main textual content of a web page.

The boilerpipe library for Java provides algorithms to detect and remove the surplus "clutter" (boilerplate, templates) around the main textual content of a web page.

77 questions
0
votes
2 answers

How to extract news content from a web page using Boilerpipe?

I need to extract main news content from a web page.I searched on internet and found an api named Boilerpipe freely available for that purpose http://boilerpipe-web.appspot.com/ But I'm not abled to find any implementations in java that make use of…
dark_shadow
  • 3,503
  • 11
  • 56
  • 81
-2
votes
1 answer

Stuck on Boilerpipe

Fairly excited about this utility, but running into some issues with implementation. Installed it but getting no result with execution of HTML file on server. Syntax error on the URL code.