I need to extract text from Word documents that contain parts formatted as Rich Text Content Control (RTCC). I am using officer. The problem is that I am not able to extract text formatted as RTCC. Any ideas on how to do this?
library(officer)
trtDoc <- read_docx("theFile.docx") %>%
docx_summary()
The code above gives me a data.frame
with the text, but the RTCC formatted text does not show.