I am trying to use the Stanford Topic Modeling Toolbox (TMT) to try out Topic Modeling [0]. I am a Scala beginner. However, I can't seem to prepare my data set by reading a CSV file. Here's my code
import scalanlp.io._;
val source = CSVFile("pubmed-oa-subset.csv") ~> IDColumn(1);
println(source.data.size);
This throws the following error
Stanford TMT\example-0-test.scala:6: error: not found: value IDColumn
val source = CSVFile("pubmed-oa-subset.csv") ~> IDColumn(1);
Similarly, I also get an error on other data pre-processing functions like Tokenizer. Here's the code
// Stanford TMT Example 0 - Basic data loading
// http://nlp.stanford.edu/software/tmt/0.4/
import scalanlp.io._;
val source = CSVFile("pubmed-oa-subset.csv") ;
println(source.data.size);
val tokenizer = {
SimpleEnglishTokenizer()
}
Here's the error received for the above code.
error: not found: value SimpleEnglishTokenizer
SimpleEnglishTokenizer()
I am using the same CSV file as given on the TMT homepage [1]. Also, the script and the data are in the same folder.
What is the issue? I am unable to run the exact same test examples from the TMT homepage.
[0] http://nlp.stanford.edu/software/tmt/tmt-0.4/
[1] http://nlp.stanford.edu/software/tmt/tmt-0.4/examples/pubmed-oa-subset.csv