I've tried several solutions to parse word documents to a string in PHP, however they sometimes have trouble with certain word documents. So I'm now trying PHPWord to attempt to parse the word document to a string.
I'm looking at this sample file in PHPWord which reads a Word document and outputs to another Word document:
include_once 'Sample_Header.php';
// Read contents
$name = basename(__FILE__, '.php');
$source = "resources/{$name}.doc";
echo date('H:i:s'), " Reading contents from `{$source}`", EOL;
$phpWord = \PhpOffice\PhpWord\IOFactory::load($source, 'MsDoc');
// (Re)write contents
$writers = array('Word2007' => 'docx', 'ODText' => 'odt', 'RTF' => 'rtf');
foreach ($writers as $writer => $extension) {
echo date('H:i:s'), " Write to {$writer} format", EOL;
$xmlWriter = \PhpOffice\PhpWord\IOFactory::createWriter($phpWord, $writer);
$xmlWriter->save("{$name}.{$extension}");
rename("{$name}.{$extension}", "results/{$name}.{$extension}");
}
include_once 'Sample_Footer.php';
However, I don't want to output another entire Word document, I just want to parse the contents to a string in PHP. How can this be modified to output the content to a string?