In particular, I would like to be able to extract people, places, films, music, etc. entities and have the entities available in widely used linked data IDs such as DBpedia, Freebase, or OpenCyc.
Asked
Active
Viewed 719 times
1 Answers
2
Extractiv is a similar service which combines a web crawler from 80legs with natural language processing from Language Computer Corporation (LCC). This service currently provides more than 150 entity types, such as the ones you list, and links them to DBpedia.
While not yet deployed to Extractiv as web service, LCC's CiceroLite named entity tagger supports both Chinese and Japanese. This can be purchased as a standalone application. Another such company would be Basis Technology, although I do not know that these entities are linked.

John Lehmann
- 7,975
- 4
- 58
- 71