https://glg.proz.com/forum/apple_mac_operating_systems/342539-ocr_macro.html

OCR macro
Persoa que publicou o fío: Hans Lenting

Hans Lenting  Identity Verified
Netherlands
Membro (2006)
German to Dutch
Mar 29, 2020

I’ve just OCR’d a PDF with 7 pages of legal text with a Keyboard Maestro macro. The conversion is nearly perfect and was very fast. This feature makes KM a very good investment.

Camila Barbosa
 

Dylan Jan Hartmann  Identity Verified
Australia
Membro (2014)
Thai to English
+ ...

Moderador deste foro
Post CAT formatting? Mar 29, 2020

The issue we’ve found with OCR tools (and the reason why many LSPs forbid their use) is that post-CAT tool translation, formatting becomes a nightmare. This is especially the case if matching formatting with the OCR tool.

The only work around we found was to OCR export as plain text, clear formatting, insert correct formatting, run through CAT and then do a FE before delivery.

Let us know what the OCR macro results are like post-TR


Camila Barbosa
 

Camila Barbosa  Identity Verified
Brazil
Local time: 18:20
Membro (2019)
Portuguese to English
+ ...
OCR Mar 30, 2020

Dylan Jan Hartmann wrote:

The issue we’ve found with OCR tools (and the reason why many LSPs forbid their use) is that post-CAT tool translation, formatting becomes a nightmare. This is especially the case if matching formatting with the OCR tool.

The only work around we found was to OCR export as plain text, clear formatting, insert correct formatting, run through CAT and then do a FE before delivery.

Let us know what the OCR macro results are like post-TR


I agree. From experience, you need a very clear .pdf document to start with before running OCR.

I personally think that SmartCat OCR program does a good job but, like everything else realted to OCR, it is not perfect.


 

Hans Lenting  Identity Verified
Netherlands
Membro (2006)
German to Dutch
INICIO DE TEMA
Nice additional feature Mar 31, 2020

Rather than buying an expensive OCR suite for occasional use, this new feature of Keyboard Maestro (based on the open source Tesseract software) is a nice additional feature to quickly OCR a dialogue box, a single page or whatever you have to translate.

Screenshot 2020-03-31 at 09.46.57


 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

OCR macro

Advanced search






Anycount & Translation Office 3000
Translation Office 3000

Translation Office 3000 is an advanced accounting tool for freelance translators and small agencies. TO3000 easily and seamlessly integrates with the business life of professional freelance translators.

More info »
Protemos translation business management system
Create your account in minutes, and start working! 3-month trial for agencies, and free for freelancers!

The system lets you keep client/vendor database, with contacts and rates, manage projects and assign jobs to vendors, issue invoices, track payments, store and manage project files, generate business reports on turnover profit per client/manager etc.

More info »