Letters gets changed when copied from PDF Document

  • Thread starter Thread starter Ponnurangam
  • Start date Start date
P

Ponnurangam

Hi,

I have a PDF document that has letters in Greek

When try to copy it to Word Document(Microsoft Office Word 2003), some
letters are getting replaced by other ones.(Font: TimesNewRoman)

Here is Two of them:

(1) "U+03AC: Greek small Letter Alpha With Tonos" of PDF is getting replaced
as "U+00DC: Latin Capital Letter U With Diaeresis" in Word

(2) "U+0394: Greek Capital Letter Delta" of PDF is getting replaced as
"U+00C4: Latin Capital Letter A With Diaeresis" in Word


Thanks
Ponnurangam
 
How are you copying? Word has no facility to translate anything to or from
PDF.

--
<>>< ><<> ><<> <>>< ><<> <>>< <>><<>
Graham Mayor - Word MVP

My web site www.gmayor.com

<>>< ><<> ><<> <>>< ><<> <>>< <>><<>
 
I am copying the text from PDF to clipboard and then pasting to the word.

Thanks
Ponnurangam
 
Try edit paste special rather than a simple paste, but my guess is that you
may not get any further with this. PDF is essentially a graphics format and
the conversion of graphics formats back to text is somewhat hit & miss and
unlike OCR software where at least you can train the software to use
particular characters in some circumstances, here you don't have that
option.

--
<>>< ><<> ><<> <>>< ><<> <>>< <>><<>
Graham Mayor - Word MVP

My web site www.gmayor.com

<>>< ><<> ><<> <>>< ><<> <>>< <>><<>
 
I tried Paste Special. It didn't work.Can you suggest some OCR software that
I can use to scan text from an image.

Thanks
Ponnurangam
 
I have had good results taking PDF files and using PaperPort's print driver
to convert them to a PaperPort file and then OCR to Word. Retyping may be
easier.
--

Charles Kenyon

Word New User FAQ & Web Directory: http://addbalance.com/word

Intermediate User's Guide to Microsoft Word (supplemented version of
Microsoft's Legal Users' Guide) http://addbalance.com/usersguide

See also the MVP FAQ: http://www.mvps.org/word which is awesome!
--------- --------- --------- --------- --------- ---------
This message is posted to a newsgroup. Please post replies
and questions to the newsgroup so that others can learn
from my ignorance and your wisdom.
 
Hi,

First, Thanks very much for all your replies

Abbyy Finereader also does the same replacement of characters as I mentioned
earlier. Any other ideas.

Also, I couldn't find any OCR Software for Greek Language. Do you know any
one of them

Thanks
Ponnurangam
 
Hi,

Can you tell me how to do that.

I mean how to convert from PDF to PaperPort file using PaperPort's print
driver and then OCR to Word.

Also, I couldn't find any OCR Software for Greek Language.Do you know any
one of them.

Thanks
Ponnurangam
 
Just a thought - try setting the Windows language to Greek whilst
converting.

--
<>>< ><<> ><<> <>>< ><<> <>>< <>><<>
Graham Mayor - Word MVP

My web site www.gmayor.com

<>>< ><<> ><<> <>>< ><<> <>>< <>><<>
 
Hi,

Setting the language to Greek while converting using Abbyy Finereader didn't
work

Thanks
Ponnurangam
 
If you like you could send me a copy of the pdf, via the link on my web
site, with confirmation of where the problem occurs and I'll try some
alternatives.

--
<>>< ><<> ><<> <>>< ><<> <>>< <>><<>
Graham Mayor - Word MVP

My web site www.gmayor.com

<>>< ><<> ><<> <>>< ><<> <>>< <>><<>
 
Hi,

I got a solution for this. It seems my PC doesn't have "Times New Roman Dual
Greek" font. I need to install it

Thanks very Much
Ponnurangam
 
TNR Unicode does contain the Basic Greek character subset, which should
include most of the needed glyphs (including all the ones you mention).

--
Suzanne S. Barnhill
Microsoft MVP (Word)
Words into Type
Fairhope, Alabama USA

Email cannot be acknowledged; please post all follow-ups to the newsgroup so
all may benefit.
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Back
Top