Extract Text with location

Thu Apr 14, 2016 5:34 pm

Hi,

I have a PDF with tabular data, and would like to retrieve the text blocks, their locations, and contents. Is this possible with Spire? I perused the API docs but found nothing related to text location.

Fri Apr 15, 2016 2:38 am

Hi,

Thanks for your posting.
Sorry that our Spire.PDF cannot implement your requirement at present.

Best Regards,
Amy
E-iceblue support team

Sat Jan 09, 2021 1:36 pm

Are there any near future plans to implemet possibility of extracting text together with location information from .pdf?

Mon Jan 11, 2021 8:40 am

Hello,

Thanks for your inquiry and sorry for the late reply as weekend.

I am sorry that our Spire.PDF cannot extract text with location information now. But I would like to suggest you that using the FindAllText() method to find all text with its location, here is my test code.

Code: Select all: PdfDocument doc = new PdfDocument(); doc.LoadFromFile("E:\\testdoc\\sample2.pdf"); List<PdfTextFindCollection> collections = new List<PdfTextFindCollection>(); Dictionary<PointF, string> textANDposition = new Dictionary<PointF, string>(); foreach (PdfPageBase page in doc.Pages) { PdfTextFindCollection collection = page.FindAllText(); collections.Add(collection); } foreach(PdfTextFindCollection collection in collections) { foreach(PdfTextFind find in collection.Finds) { PointF point = find.Position; string str = find.MatchText; textANDposition.Add(point, str); } }

Hope it can match your needs. I am looking forward to your feedback.

Sincerely,
Marcia
E-iceblue support team

Mon Jan 11, 2021 10:51 am

This is what I was looking for. Thak you for quick answer.

Regards,
Brane

Tue Jan 12, 2021 1:20 am

Hello,

Thanks for your feedback!

Glad to hear that the code is match your needs.

If you encounter any issues related to our product in the future, just feel free to contact us.

Have a nice day!

Sincerely,
Marcia
E-iceblue support team

Extract Text with location

Purchase

Partnership

Products

Corporation