Manu_TR
Forum Replies Created
-
AuthorPosts
-
Could anyone please help me with this, it is very urgent?
Thanks in advance!
ManuHi Eric,
I don’t see the fieldcode for hyperlink in document.xml within the below TOC field paragraph. here is the snippet of the document.xml but i see that word interop or Aspose.words are recognizing this hyperlink field code : HYPERLINK \l “_Toc433904186″. could you please help me how interop/Aspose are identifying this fieldcode?
<w:p w:rsidRDefault=”00497C28″ w:rsidR=”00630630″>
<w:pPr>
<w:pStyle w:val=”TOC1″/>
-<w:rPr>
<w:rFonts w:cstheme=”minorBidi” w:hAnsiTheme=”minorHAnsi” w:eastAsiaTheme=”minorEastAsia” w:asciiTheme=”minorHAnsi”/>
<w:noProof/>
<w:kern w:val=”0″/>
<w:sz w:val=”22″/>
<w:szCs w:val=”22″/>
<w:lang w:eastAsia=”en-GB”/>
</w:rPr>
</w:pPr>
<w:r>
<w:fldChar w:fldCharType=”begin”/>
</w:r>
<w:r w:rsidR=”002223C4″>
<w:instrText xml:space=”preserve”> TOC \b “ContentA” \o “1-1″ \h \z \u </w:instrText>
</w:r>
<w:r>
<w:fldChar w:fldCharType=”separate”/>
</w:r>
<w:hyperlink w:history=”1″ w:anchor=”_Toc433904186″>
<w:r w:rsidRPr=”00CF2217″ w:rsidR=”00630630″>
<w:rPr>
<w:rStyle w:val=”Hyperlink”/>
<w:noProof/>
</w:rPr>
<w:t>1.</w:t>
</w:r>
<w:r w:rsidR=”00630630″>
<w:rPr>
<w:rFonts w:cstheme=”minorBidi” w:hAnsiTheme=”minorHAnsi” w:eastAsiaTheme=”minorEastAsia” w:asciiTheme=”minorHAnsi”/>
<w:noProof/>
<w:kern w:val=”0″/>
<w:sz w:val=”22″/>
<w:szCs w:val=”22″/>
<w:lang w:eastAsia=”en-GB”/>
</w:rPr>
<w:tab/>
</w:r>
<w:r w:rsidRPr=”00CF2217″ w:rsidR=”00630630″>
<w:rPr>
<w:rStyle w:val=”Hyperlink”/>
<w:noProof/>
</w:rPr>
<w:t>Interpretation</w:t>
</w:r>
<w:r w:rsidR=”00630630″>
<w:rPr>
<w:noProof/>
<w:webHidden/>
</w:rPr>
<w:tab/>
</w:r>
</w:hyperlink>
</w:p>THanks,
ManuI see that interop facilitates the Range feature which contains the beginning and end offset.
problem statement : if I have to highlight a word in the document, i need the beginning offset and the end offset of that word to bookmark that element. so I was wondering how can i get the beginning and end offset of that particular word.
THanks,
ManuSo are you telling \f denotes the end of section?
and also could you please tell how did interop get ‘\r’ character from the above section?
Where can I upload the sample document?
Below is one of the section part splitted from the maindocument as wmlDocument object:
<w:body>
<w:p w:rsidR=”00852464″ w:rsidRDefault=”00852464″>
<w:pPr>
<w:widowControl w:val=”0″ />
<w:autoSpaceDE w:val=”0″ />
<w:autoSpaceDN w:val=”0″ />
<w:adjustRightInd w:val=”0″ />
<w:spacing w:after=”0″ w:line=”240″ w:lineRule=”auto” />
<w:rPr>
<w:rFonts w:ascii=”sans-serif” w:hAnsi=”sans-serif” w:cs=”sans-serif” />
<w:color w:val=”000000″ />
<w:sz w:val=”24″ />
<w:szCs w:val=”24″ />
</w:rPr>
</w:pPr>
</w:p>
<w:sectPr w:rsidR=”00852464″>
<w:footerReference w:type=”even” r:id=”Rcf31b4517595440c” />
<w:footerReference w:type=”first” r:id=”R341e70aec4e340e3″ />
<w:headerReference w:type=”even” r:id=”R8cc439d0bee047a1″ />
<w:headerReference w:type=”first” r:id=”R174c347e0a56457a” />
<w:headerReference w:type=”default” r:id=”R0b9c2b0d93dc4d5a” />
<w:footerReference w:type=”default” r:id=”R164bf608aecd4f44″ />
<w:pgSz w:w=”12240″ w:h=”15840″ />
<w:pgMar w:top=”1080″ w:right=”1080″ w:bottom=”1080″ w:left=”1080″ w:header=”720″ w:footer=”720″ w:gutter=”0″ />
<w:cols w:space=”720″ />
<w:noEndnote />
</w:sectPr>
</w:body>When I extract the text from the above section I get an empty string.
But when I look at the text of the same section through the word interop. I get this text “\r\f”Please help!
Thank you very much Eric, this was very helpful.
Although the process of splitting sections is bit slower thanks again!.
I need the value of this property similar to interop in openxml.
Wdinformation.wdHorizontalPositionRelativeToPage
Thanks
Is there a way to extract the actual text content of all the sections separately using openxml? similar to word interop?
Thanks in Advance
Manuyes, I am using Open-Xml-PowerTools.
yes, the way how the ranges in the word interop have the start and end offset.
Regards,
manuThanks Eric.
But I have document which doesn’t have a line break element in document.xml. but document has many lines in it.
And is there a way to extract the text all the sections separately using openxml?
Is there a way to track all the character offsets in openxml?
Thanks! -
AuthorPosts