I have an issue extracting text from a PDF that has been scanned and OCR'd in Greece. When I call ExtractText, the string returned includes a bunch of undisplayable characters meaning that although I can cut and paste the contents from Adobe Reader, I can't get at from within my software.
Is there a way to specify the codepage or encoding of the string returned by ExtractText or do you have any alternate advice?
Many thanks in advance,
Darren.