I am trying to read-in my documents into R. Everything loads fine but I receive 36 warning messages such as:
"18: In readLines(y, encoding = x$Encoding) : incomplete final line found on 'C:/text_data/2006DefenseWhitePaper.docx'"
Additionally, when I inspect my corpus it looks like this:
$\`1998DefenseWhitePaper.docx`
PK
l"%3÷Þ3VƃÑÚšl µw%ë=–“^i7+Ù×ä-d&á”0ÞAÉ6€l4¼½L60#µÃ’ÍS
Oœ£œƒXø
For some reason the documents are encoded
Is this a formatting issue or are the sources from where i get the documents (online) encrypted.