Spire.PDF is a professional PDF library applied to creating, writing, editing, handling and reading PDF files without any external dependencies. Get free and professional technical support for Spire.PDF for .NET, Java, Android, C++, Python.
Mon Oct 24, 2011 4:53 am
Hi,
I've been having trouble getting the Hebrew text from the PDF file while using Spire.PDF and ExtractText.
Is it supported? do I need to do anything special?
Thanks.
-
thefor
-
- Posts: 1
- Joined: Sat Jun 11, 2011 10:14 pm
Tue Oct 25, 2011 2:02 am
Hi thefor,
Thanks for your feedback!
So sorry that our Spire.PDF doesn't support Hebrew at this stage. We may add this in the future.
Please feel free to contact us for anything at any time. Have a great day!
e-iceblue support
-
iceblue support
-
- Posts: 240
- Joined: Tue Dec 21, 2010 2:56 am
Wed Dec 12, 2012 1:24 pm
Is there any change? I also has that problem...
-
avner
-
- Posts: 5
- Joined: Tue Dec 11, 2012 7:40 am
Thu Dec 13, 2012 9:02 am
Hello,
Thanks for your inquiry.
Spire.pdf can support extracting the Hebrew text from the PDF file. We provide you some sample code. Please try to use Spire.PDF_2.6.32(the download link is http://www.e-iceblue.com/Download/download-pdf-for-net-now/spirepdf-packhot-fix2632.html?Itemid=0) to test it. If you encounter any problem, please tell us.
- Code: Select all
private void btnExtractText_Click(object sender, EventArgs e)
{
PdfDocument doc = new PdfDocument();
doc.LoadFromFile(filepath);
StringBuilder buffer = new StringBuilder();
foreach (PdfPageBase page in doc.Pages)
{
buffer.Append(page.ExtractText());
}
richTextBox1.Text=buffer.ToString();
}
Best regards,
Amy
E-iceblue support team
-
amy.zhao
-
- Posts: 2766
- Joined: Wed Jun 27, 2012 8:50 am
Mon Dec 17, 2012 10:09 am
Hello,
Could you please tell us your test result? If you encounter any problem, please feel free to contact us.
Best regards,
Amy
E-iceblue support team
-
amy.zhao
-
- Posts: 2766
- Joined: Wed Jun 27, 2012 8:50 am
Mon Dec 17, 2012 11:34 am
Hi, during the last week I checked the Hebrew solutions:
a. Doc to Html/txt works great.
b. Doc to pdf and pdf to doc work work bad.
c. Doc to bmp works bad.
Avner
-
avner
-
- Posts: 5
- Joined: Tue Dec 11, 2012 7:40 am
Tue Dec 18, 2012 2:15 am
Hello Avner,
Thanks for your feedback.
For #b and #c, we are sorry for the inconvenience. Could you please provide us your pdf file? So that we can reproduce the problems, and our dev team can do some researches. Thank you!
Best regards,
Amy
E-iceblue support team
-
amy.zhao
-
- Posts: 2766
- Joined: Wed Jun 27, 2012 8:50 am
Tue Dec 18, 2012 8:06 am
I am not able to upload pdf/doc/docx: "The extension pdf is not allowed." But, You can see it by any Hebrew text also without understanding the language.
-
avner
-
- Posts: 5
- Joined: Tue Dec 11, 2012 7:40 am
Tue Dec 18, 2012 9:14 am
Hello Avner,
Please add your files to a .zip file or send your files to support@e-iceblue.com. Thank you!
Best regards,
Amy
E-iceblue support team
-
amy.zhao
-
- Posts: 2766
- Joined: Wed Jun 27, 2012 8:50 am
Wed Dec 19, 2012 7:34 am
Hello Avner,
We had received your files from email.
We are sorry that doc to pdf conversion and doc to bmp conversion can't support the characters, the format of the characters is right-to-left at present. Our dev team have added them as new features to spire.doc. We will inform you immediately as soon as spire.doc can support the features. Could you please provide us the code of pdf to doc conversion you did? Thank you!
Best regards,
Amy
E-iceblue support team
-
amy.zhao
-
- Posts: 2766
- Joined: Wed Jun 27, 2012 8:50 am