Occasionally we need to parse a PDF document, and the text seems to be scrambled. The Fonts tab of Document Properties in Adobe Reader may say
Tahoma+1(Embedded)
Type: TrueType
Encoding: Built-in
If I try to extract text using either '(string)ACPDFCREACTIVEX.IacObject["Text"] or axPDFCreactiveX1.GetRawPageText(1), I get something similar to "=0B?,'1-7,;?:" rather than the actual text displayed (font issue?). PDF Suite Destop Edition displays the correct text, but I suspect it is using the funky font to display it.
How do I extract the text as displayed in the PDF?
Scrambled font
Re: Scrambled font
What version of the PDF Creator are you using?
Custom Brand the Amyuni PDF Printer Driver http://www.amyuni.com/en/developer/branding/index.html
Amyuni PDF Converter tested for true PDF performance. View results - http://www.amyuni.com/benchmark
Amyuni PDF Converter tested for true PDF performance. View results - http://www.amyuni.com/benchmark
Re: Scrambled font
I got essentially the same results in both 3.1 and 4.0
Re: Scrambled font
Hello,
We had another customer with a similar issue and setting the OptimizeDocument option resolved this situation.
Example:
axPDFCreactiveX1.OptimizeDocument 1
Thanks
Jose
We had another customer with a similar issue and setting the OptimizeDocument option resolved this situation.
Example:
axPDFCreactiveX1.OptimizeDocument 1
Thanks
Jose
Get PDF Suite, the expert .NET developer toolkit for PDF conversion, creation and editing - www.amyuni.com/pdfsuite