I know it's possible to extract out Wikipedia content via a dump. However, is it also possible to extract out the search aliases as well?
For instance, that "obama" is an alias of "Barack Obama"?
I know it's possible to extract out Wikipedia content via a dump. However, is it also possible to extract out the search aliases as well?
For instance, that "obama" is an alias of "Barack Obama"?
You can find the data you're looking for (in RDF format) in the redirects datasets that were extracted from Wikipedia by DBpedia.