Questions tagged [korpus]

An R Package for Text Analysis

About

A set of tools to analyze texts. Includes, amongst others, functions for automatic language detection, hyphenation, several indices of lexical diversity (e.g., type token ratio, HD-D/vocd-D, MTLD) and readability (e.g., Flesch, SMOG, LIX, Dale-Chall).

Links

11 questions
1
vote
0 answers

koRpus--tokenize command on large folder of word files

I have made some headway in getting koRpus to analyze my data, but there are lingering problems. The 'tokenize' command seems to work--kind of. I run the following line of code: word <- tokenize("/Users/gdballingrud/Desktop/WPSCASES 1/",…
Gordon
  • 11
  • 1
1
vote
0 answers

koRpus package utilising TreeTagger within R (RStudio) on macOS

im exploring the koRpus package in R on macOS, in trying to use the treetag function on the following object: text [1] "Because I could not stop for Death -" "He kindly stopped for me -" [3] "The Carriage held but just Ourselves -" "and…
1
vote
1 answer

Treetag package koRpus R

First, I'm sorry for my English, I speak very bad English. So, I have a question about treetag function from koRpus package on R. I want to lemmatized a text and it's ok for me with Treetagger. I used the treetag function : treetag(file,…
1
vote
0 answers

string replacement on koRpus treetag

The function tokenize("C:\\test_treetag.txt", clean.raw=list("aren't"="are not")) allows me to tokenize and replace "aren't" by the tokens "are not" I'm trying to do the same in the function in the treetag function.
1
vote
1 answer

Treetagger koRpus package error

I am trying to use the Treetag function in the koRpus package. The code I have used is tagged.text <-treetag("C:/Rec_By_Others.txt",treetagger="manual",lang="en",TT.options=list(path="C:\\Program Files\\TreeTagger", preset="en")) But I keep…
0
votes
1 answer

getting R to recognize a file full of Word files for koRpus analysis

I need some help with loading text-file data into R for analysis with packages like koRpus. The problem I am facing is getting R to recognize a folder full of Word files (about 4,000) as data which I can then make koRpus perform analyses like…
0
votes
3 answers

Compiling and analysing a Corpus with R and koRpus

I'm a student of literature lost in data sciences. I'm trying to analyse a corpus of 70 .txt-files, which are all in one directory. My final goal is to get a table containing the filename (or something similar), the sentence and word counts, a…
SamVimes
  • 39
  • 7
0
votes
2 answers

Using koRpus function treetagger with R

As I am a starter trying out the korpus to work with R, any advice will be great. I installer TreeTagger on my windows machine, then I installed koRpus package. My question is how Can I use treetagger using koRpus with R? Is there any settings to…
Poisson
  • 1,543
  • 6
  • 23
  • 34
0
votes
1 answer

Moving from tm object to koRpus object and vice versa

I have a problem moving from a tm object to a koRpus object. I have to normalize a corpus with tm tools, lemmatize the results with koRpus and return to tm to categorize the results. In order to do this I have to transform the tm object into a R…
Giorjet
  • 93
  • 2
  • 11
0
votes
1 answer

Calculate readability scores for several files with R

I would like to calculate the readability scores in R-3.3.2(R-Studio 3.4 for Win) using koRpus package for several txt.files and save results to excel or sqllite3 or txt. Now I can only calculate the readability score for one file only and print…
In777
  • 171
  • 1
  • 4
  • 15
0
votes
1 answer

treetager - Linux - error : Specified file cannot be found:

Hi I ma working treetager to extract nouns from the data, however I am getting the below error message. Here is my R script A <- Data_raw$SourceText[10:15] A ## [1] "Hi ... they ... they rather do very helpful and I like your ... have to move with…
Ravi Kumar
  • 161
  • 1
  • 6