Text boxes in Microsoft Word 2000
Persoa que publicou o fío: david angel (X)
david angel (X)
david angel (X)
Local time: 21:29
French to English
+ ...
Mar 2, 2006

Do other people find text boxes one of the most annoying features of a translator's life? I have a programme - ScanSoft PDF Converter 3 - which will transfer a pdf document into microsoft word doc, but it puts the text in boxes which creates every kind of problem. My main question is: how do you get rid of them? Is it possible? I expect the answer is obvious but I have looked everywhere...

 
Jerzy Czopik
Jerzy Czopik  Identity Verified
Germany
Local time: 22:29
Membro (2003)
Polish to German
+ ...
Save the document without formatting Mar 2, 2006

and format it by hand. You will be faster than using any PDF-converter.
And the tex boxes are annoying, but AFAIK you cannot simply remove them without first copying the text from a box and then pasting it into the main document again - but even then you will need to reformat.

Regards
Jerzy


 
Antoní­n Otáhal
Antoní­n Otáhal
Local time: 22:29
Membro (2005)
English to Czech
+ ...
Try different software Mar 2, 2006

The best choice in my opinion is FineReader (which is an OCR program, but can be used for converting pdf files; there are no text boxes or frames in the output); there are quite a few "specialised pdf conversion" programs, but a good program of this sort will let you choose whether you want text boxes/frames in your output (and sometimes it is a good option when there are many images, captions, etc. in the file).

Whichever option you use, manual adjustments and/or corrections
... See more
The best choice in my opinion is FineReader (which is an OCR program, but can be used for converting pdf files; there are no text boxes or frames in the output); there are quite a few "specialised pdf conversion" programs, but a good program of this sort will let you choose whether you want text boxes/frames in your output (and sometimes it is a good option when there are many images, captions, etc. in the file).

Whichever option you use, manual adjustments and/or corrections are necessary as a rule (before or after the conversion).

If you can persuade your customer to provide you with the real source (from which their pdf was created), the result is likely to be much more professional.

HTH

Antonin
Collapse


 
david angel (X)
david angel (X)
Local time: 21:29
French to English
+ ...
INICIO DE TEMA
Thanks Mar 2, 2006

Jerzy Czopik wrote:

and format it by hand. You will be faster than using any PDF-converter.
And the tex boxes are annoying, but AFAIK you cannot simply remove them without first copying the text from a box and then pasting it into the main document again - but even then you will need to reformat.

Regards
Jerzy


Thanks, Jerzy. Probably because i only have Adobe Acrobat Reader, I can't copy text. I can only save it as txt, but then what does one do with it?

What's more, when I try to copy the contents of a text box, Word just copies the whole box.

Very frustrating.


 
Jerzy Czopik
Jerzy Czopik  Identity Verified
Germany
Local time: 22:29
Membro (2003)
Polish to German
+ ...
Save as txt and open in Word Mar 2, 2006

format as usuall, use styles.
Save as Word - this is the way I would go.
Finereader or PDF converter insert to many section breaks and other formatting issues for my taste. This is my very personal opinion, maybe because I´m quite fast in formatting with Word.

To copy text from a text box you need to select the text, but not select the last paragraph mark in the text box. This should usually copy only the text.... See more
format as usuall, use styles.
Save as Word - this is the way I would go.
Finereader or PDF converter insert to many section breaks and other formatting issues for my taste. This is my very personal opinion, maybe because I´m quite fast in formatting with Word.

To copy text from a text box you need to select the text, but not select the last paragraph mark in the text box. This should usually copy only the text.

Regards
Jerzy
Collapse


 
Antoní­n Otáhal
Antoní­n Otáhal
Local time: 22:29
Membro (2005)
English to Czech
+ ...
Of course it is a matter of personal taste Mar 2, 2006

In my experience, if you just save as txt or doc from Acrobat, even a simple flow of text may be corrupted (the correct order of lines is mixed, so the ends come before the beginnings) - I find this rather annoying.

With FineReader, if you do not let it analyse the layout automatically but do it by hand, and then choose the "softest" saving option (only tables, paragraphs and fonts), you eliminate most of the problems you refer to; and the rest can be easily resolved with a few macr
... See more
In my experience, if you just save as txt or doc from Acrobat, even a simple flow of text may be corrupted (the correct order of lines is mixed, so the ends come before the beginnings) - I find this rather annoying.

With FineReader, if you do not let it analyse the layout automatically but do it by hand, and then choose the "softest" saving option (only tables, paragraphs and fonts), you eliminate most of the problems you refer to; and the rest can be easily resolved with a few macros.

Unfortunately, I am compelled by my customers quite often to process pdf files (both scanned images and output form various creation tools), so I have developed a procedure I feel is optimal and takes the least time.

But we all must find our own ways in the end, I suppose.

Antonin
Collapse


 
Jerzy Czopik
Jerzy Czopik  Identity Verified
Germany
Local time: 22:29
Membro (2003)
Polish to German
+ ...
Antonin, you are absolutelty right Mar 2, 2006

With FineReader the possible settings and the results are very satisfactory. I do not know the other tool mentioned here...

IMO Finereader is the best OCR tool on the market.

Regards
Jerzy


 
May_L
May_L
Local time: 03:29
Membro
English to Thai
Do you mean getting texts in table? Mar 3, 2006

If so, it always happens. I always face this when copy texts from web pages.

In MS Word, I normally select the table containing the text and then > Table > Convert > Table to text. If there is table in table, then the process has to be repeated.

Hope this help!

May L.


 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

Text boxes in Microsoft Word 2000






Protemos translation business management system
Create your account in minutes, and start working! 3-month trial for agencies, and free for freelancers!

The system lets you keep client/vendor database, with contacts and rates, manage projects and assign jobs to vendors, issue invoices, track payments, store and manage project files, generate business reports on turnover profit per client/manager etc.

More info »
Anycount & Translation Office 3000
Translation Office 3000

Translation Office 3000 is an advanced accounting tool for freelance translators and small agencies. TO3000 easily and seamlessly integrates with the business life of professional freelance translators.

More info »