7

I've currently written an algorithm in Ruby based on the arc90 readability code to extract an article from a web page.

Now that I have the article, I want to extract keywords and specific information from it (names, author, etc)

I heard Alchemy was a great ruby gem for doing this though it consumes a lot of resources. Are there any better gems I can use for this?

Nakilon
  • 34,866
  • 14
  • 107
  • 142
dpigera
  • 3,339
  • 5
  • 39
  • 60
  • i think the 'alchemy' keyword refers to Adobe's Alchemy project, not the Ruby Gem. See the faq: http://stackoverflow.com/tags/alchemy/info – paleozogt Sep 01 '10 at 17:16

2 Answers2

9

fast, leightweight and easy-to-use gem for extracting keywords from longer content:

https://rubygems.org/gems/highscore

i use it in production, works like a charm.

The question is a bit older, but i'll leave this here for others who will come from google to see this question.

Maximilian Stroh
  • 1,086
  • 1
  • 10
  • 26
2

There is an OpenCalais gem which provides similar capability. In addition to entity extraction it can also detect events and relations between entities. It's not lightweight, though I couldn't tell if it's better or worse than Alchemy as I haven't used the Alchemy gem. Hope this helps.

RedFred
  • 999
  • 9
  • 20