-2

I need it to do some entity extraction. How do I get an annotated dataset with JobTitles?

Nathaniel Ford
  • 20,545
  • 20
  • 91
  • 102

1 Answers1

1

Here is what I suggest to do, if you haven't come across any datasets. Grab wikipedia occupation lists: https://en.wikipedia.org/wiki/Lists_of_occupations, create a gazetteer list of jobs and write regular expressions to capture them or any variations in the text and you have annotated data :).

user3639557
  • 4,791
  • 6
  • 30
  • 55
  • 1
    Or she can build a corpus from the wikipedia: when a link in a random wiki page link to one of the occupation Wiki page she found an occupation in raw text. – Istvan Nagy Feb 18 '16 at 08:16