0

I have 100s of Microsoft Word documents. Each document has the same headers. I need to be able to read the data present in those documents and create a table. Output in the form of a CSV file.

I tried to use Scrapy. But I am new, and I don’t know how to combine all documents at once. And how to proceed after that.

Peter Mortensen
  • 30,738
  • 21
  • 105
  • 131
  • Isn't Scrapy only for web sites? Perhaps not: *[Extracting text from Microsoft Word files in Python with Scrapy](https://stackoverflow.com/questions/25686285)* – Peter Mortensen Oct 01 '19 at 23:41
  • What language are you planning to use? – B. Go Oct 01 '19 at 23:42
  • Where are those Word documents? In a file system (locally or network drive)? On a web site? – Peter Mortensen Oct 01 '19 at 23:47
  • Intersection of the tags for Scrapy and Microsoft Word (about 14,000 each): [https://stackoverflow.com/questions/tagged/scrapy+ms-word](https://stackoverflow.com/questions/tagged/scrapy+ms-word). Result: only one question - [the question I linked](https://stackoverflow.com/questions/25686285). Very strange! – Peter Mortensen Oct 02 '19 at 00:14
  • Can you add what you have tried (by [editing your question](https://stackoverflow.com/posts/58193509/edit))? E.g. so it is more clear what is meant by *"I don’t know how to combine all documents at once"*. – Peter Mortensen Oct 02 '19 at 00:18
  • 3
    The 15 minutes of attention are gone. – Peter Mortensen Oct 02 '19 at 00:18

0 Answers0