1

Currently, I'm attempting to use the boilerpipe APl in order to extract text from news articles. However, it doesn't fully work. For example, see this link. Even though boilerpipe gets all of the main text, it also gets some of the unimportant text such as "Chat with us on Facebook Messenger." Are there any viable alternatives to boilerpipe, or is there a way to configure boilerpipe in order to find the main article text better?

Tdonut
  • 153
  • 5

0 Answers0