Spire.Doc is a professional Word .NET library specifically designed for developers to create, read, write, convert and print Word document files. Get free and professional technical support for Spire.Doc for .NET, Java, Android, C++, Python.

Wed Dec 20, 2017 10:17 am

Hi,

Doc having text with background color , while converting from doc to text.
That text not coming properly, some content loss or in different order.
Kindly check attached doc file, convert it in text and verify.
check below paragraph
3.2.2
A licensee’s board should be ultimately responsible for the sound and
prudent management of a licensee’s SVF business operations. As
such, the responsibilities, organization, functioning, and composition of
the licensee’s board of directors must be clearly defined and
documented.
Guideline
3.2.3
The board should have an adequate number and appropriate
composition of members to ensure sufficient checks and balances and
collective expertise for effective, objective decision-making. The size
and composition of the board will vary from institution to institution
depending on the size and complexity of the licensee and the nature
and scope of its activities. As a general benchmark for demonstrating
sufficiency of checks and balances, normally one-third of their board
members should be independent non-executive directors (INED).
Guideline
3.2.5
Whilst the board is ultimately responsible for the overall soundness of
a licensee, the appointment of competent management is key to
achieving the objective of a soundly and efficiently run licensee. The
board works with a senior management team (senior management) to
achieve this and senior management remains accountable to the
board.

Like wise there are paragraph.

ParasXOR
 
Posts: 28
Joined: Wed Nov 22, 2017 6:13 am

Thu Dec 21, 2017 3:21 am

Hello,

Thanks for your inquiry.
After a thorough investigation, we are sorry to tell that the issue couldn't be resolved. In your sample document, the content is stored in the order of the extracted text you see now. Since our Spire.Doc extracts the content according to the internal storage order, the result should be like that. If you use Word to convert the document directly to .txt file, you will see the content in the list 3.2.2 is ignored. We suppose Word might not manipulate the data in the "shape".
Hope I make it clear and below is the internal document structure for the related part 3.2.2.
12444.png


Sincerely,
Jane
E-iceblue support team
User avatar

Jane.Bai
 
Posts: 1156
Joined: Tue Nov 29, 2016 1:47 am

Thu Dec 21, 2017 9:34 am

Kindly check attached doc file special character not converting in text file.

Find blow sentence :
in Doc :
3.2.2 The audio signal shall be subject to pre-emphasis with a time-constant of 50 μs
± 2 μs

In Text file ;
3.2.2 The audio signal shall be subject to pre-emphasis with a time-constant of 50 s
± 2 s.

similar issue for another character check in doc file name SpecialCharIssue:
‘Arts and culture programmes’ means programmes which promote the development or appreciation of 

ParasXOR
 
Posts: 28
Joined: Wed Nov 22, 2017 6:13 am

Thu Dec 21, 2017 10:19 am

Hello,

Thanks for your feedback. We will look into the issue, once there is any update, we will inform you.

Sincerely,
Simon
E-iceblue support team
User avatar

Simon.yang
 
Posts: 620
Joined: Wed Jan 11, 2017 2:03 am

Fri Dec 22, 2017 3:50 am

Hello,

Thanks for your waiting. With further investigation, we find your requirement is not achievable. Actually the symbol characters could be converted correctly if setting the encoding as UTF8 or Unicode while converting, but the txt file only support one font at the same time and symbol characters only support to display in symbol font. If you open the result txt file and change the font into symbol, you will see the symbol characters display properly, but the others will not.
Code: Select all
            Document doc = new Document();
            doc.LoadFromFile(@"SpecialCharacter.docx");
            doc.SaveToTxt("result.txt",Encoding.UTF8)


Sincerely,
Simon
E-iceblue support team
User avatar

Simon.yang
 
Posts: 620
Joined: Wed Jan 11, 2017 2:03 am

Return to Spire.Doc