Spire.PDF is a professional PDF library applied to creating, writing, editing, handling and reading PDF files without any external dependencies. Get free and professional technical support for Spire.PDF for .NET, Java, Android, C++, Python.

Thu Mar 09, 2023 4:39 am

My Sample Data: https://drive.google.com/file/d/1lwgEYp ... share_link
1.png


I try extract text by Version 9.3.2, But some character can't extract:

Code: Select all
'Create a PdfDocument object
        Dim doc As PdfDocument = New PdfDocument()

        'Load a PDF file
        doc.LoadFromFile("D:\admin\SpirePDF\test.pdf")

        'Get the second page
        Dim page As PdfPageBase = doc.Pages(0)

        'Create a PdfTextExtractot object
        Dim textExtractor As PdfTextExtractor = New PdfTextExtractor(page)

        'Create a PdfTextExtractOptions object
        Dim extractOptions As PdfTextExtractOptions = New PdfTextExtractOptions()

        'Set isExtractAllText to true
        extractOptions.IsExtractAllText = True

        'Extract text from the page
        Dim text As String = textExtractor.ExtractText(extractOptions)


Result:

3.png


If i extract text by Version 7.3.3, It is ok:
2.png


Why Last Version of Spire PDF can't extract some text?

daitranthanhhoa
 
Posts: 51
Joined: Mon Sep 19, 2016 3:04 am

Thu Mar 09, 2023 7:53 am

Hi,

Thanks for your feedback.
After testing, I reproduced your issue and logged it into our issue tracking system with the ticket number SPIREPDF-5849, our developers will investigate and fix it. Sorry for the inconvenience caused. Once the issue is fixed, I will inform you asap.

Sincerely,
Triste
E-iceblue support team
User avatar

Triste.Dai
 
Posts: 1000
Joined: Tue Nov 15, 2022 3:59 am

Tue Apr 04, 2023 8:32 am

:?:

daitranthanhhoa
 
Posts: 51
Joined: Mon Sep 19, 2016 3:04 am

Tue Apr 04, 2023 8:59 am

Hi,

Thanks for your following-up.
I have checked on the status of your bug report and found that the code fix has been submitted. However, during testing, our team discovered that addressing the bug may have unintended consequences for other parts of the code, resulting in a failed test.
Our developers are currently working diligently to find alternative solutions to fix this issue . Please spare us more time.
We appreciate your patience and understanding during this process. If you have any further questions or concerns, please do not hesitate to contact us.

Best regards.
Triste
E-iceblue support team
User avatar

Triste.Dai
 
Posts: 1000
Joined: Tue Nov 15, 2022 3:59 am

Fri Apr 28, 2023 6:45 am

Hi,

Thanks for your patience.
Glad to inform you that we just released Spire.PDF 9.4.2 hotfix, which has fixed your issue SPIREPDF-5849, please download the new version from the following links and have a test.
Website link: https://www.e-iceblue.com/Download/download-pdf-for-net-now.html
Nuget link: https://www.nuget.org/packages/Spire.PDF/9.4.12

Best regards,
Triste
E-iceblue support team
User avatar

Triste.Dai
 
Posts: 1000
Joined: Tue Nov 15, 2022 3:59 am

Return to Spire.PDF