Spire.PDF is a professional PDF library applied to creating, writing, editing, handling and reading PDF files without any external dependencies. Get free and professional technical support for Spire.PDF for .NET, Java, Android, C++, Python.

Mon Oct 24, 2011 4:53 am

Hi,

I've been having trouble getting the Hebrew text from the PDF file while using Spire.PDF and ExtractText.
Is it supported? do I need to do anything special?

Thanks.

thefor
 
Posts: 1
Joined: Sat Jun 11, 2011 10:14 pm

Tue Oct 25, 2011 2:02 am

Hi thefor,

Thanks for your feedback!

So sorry that our Spire.PDF doesn't support Hebrew at this stage. We may add this in the future.

Please feel free to contact us for anything at any time. Have a great day!
e-iceblue support
User avatar

iceblue support
 
Posts: 240
Joined: Tue Dec 21, 2010 2:56 am

Wed Dec 12, 2012 1:24 pm

Is there any change? I also has that problem...

avner
 
Posts: 5
Joined: Tue Dec 11, 2012 7:40 am

Thu Dec 13, 2012 9:02 am

Hello,

Thanks for your inquiry.
Spire.pdf can support extracting the Hebrew text from the PDF file. We provide you some sample code. Please try to use Spire.PDF_2.6.32(the download link is http://www.e-iceblue.com/Download/download-pdf-for-net-now/spirepdf-packhot-fix2632.html?Itemid=0) to test it. If you encounter any problem, please tell us.

Code: Select all
        private void btnExtractText_Click(object sender, EventArgs e)
        {
            PdfDocument doc = new PdfDocument();
            doc.LoadFromFile(filepath);

            StringBuilder buffer = new StringBuilder();
            foreach (PdfPageBase page in doc.Pages)
            {
                buffer.Append(page.ExtractText());
            }
            richTextBox1.Text=buffer.ToString();
        }


Best regards,
Amy
E-iceblue support team
User avatar

amy.zhao
 
Posts: 2766
Joined: Wed Jun 27, 2012 8:50 am

Mon Dec 17, 2012 10:09 am

Hello,

Could you please tell us your test result? If you encounter any problem, please feel free to contact us.

Best regards,
Amy
E-iceblue support team
User avatar

amy.zhao
 
Posts: 2766
Joined: Wed Jun 27, 2012 8:50 am

Mon Dec 17, 2012 11:34 am

Hi, during the last week I checked the Hebrew solutions:
a. Doc to Html/txt works great.
b. Doc to pdf and pdf to doc work work bad.
c. Doc to bmp works bad.

Avner

avner
 
Posts: 5
Joined: Tue Dec 11, 2012 7:40 am

Tue Dec 18, 2012 2:15 am

Hello Avner,

Thanks for your feedback.
For #b and #c, we are sorry for the inconvenience. Could you please provide us your pdf file? So that we can reproduce the problems, and our dev team can do some researches. Thank you!

Best regards,
Amy
E-iceblue support team
User avatar

amy.zhao
 
Posts: 2766
Joined: Wed Jun 27, 2012 8:50 am

Tue Dec 18, 2012 8:06 am

I am not able to upload pdf/doc/docx: "The extension pdf is not allowed." But, You can see it by any Hebrew text also without understanding the language.

avner
 
Posts: 5
Joined: Tue Dec 11, 2012 7:40 am

Tue Dec 18, 2012 9:14 am

Hello Avner,

Please add your files to a .zip file or send your files to support@e-iceblue.com. Thank you!

Best regards,
Amy
E-iceblue support team
User avatar

amy.zhao
 
Posts: 2766
Joined: Wed Jun 27, 2012 8:50 am

Wed Dec 19, 2012 7:34 am

Hello Avner,

We had received your files from email.
We are sorry that doc to pdf conversion and doc to bmp conversion can't support the characters, the format of the characters is right-to-left at present. Our dev team have added them as new features to spire.doc. We will inform you immediately as soon as spire.doc can support the features. Could you please provide us the code of pdf to doc conversion you did? Thank you!

Best regards,
Amy
E-iceblue support team
User avatar

amy.zhao
 
Posts: 2766
Joined: Wed Jun 27, 2012 8:50 am

Return to Spire.PDF