I am new to the world of Open XML and would greatly appreciate some guidance. I am assisting on a project which processes the document.xml file of a complex Word document. One part of this project requires extracting text from a Word table cell and copying it into an HTML table cell. The existing implementation loses all formatting and I have been asked to preserve it.
After a brief look at Open-Xml-PowerTools, I’m thinking the way to proceed would be to take the XML for the <w:tc> element and embed that into a dummy Word file, then convert this to HTML via the Convert-DocxToHtml tool and then extract just the HTML that I need. Does this sound reasonable, or is there a better way to accomplish the job? For example is there a function that directly converts a <w:tc> element to HTML? (Note: the project runs as a server-side PHP program in a browser.)