How to count words in a scanned text Thread poster: Albana Dhimitri
|
Hi everybody,
I am negotiating translation of several PDF files, so I have to count the number of words, first.
I can convert the PDF files into .doc ones but, it won't be of any use, it seems, as the documents are scanned and word don't seem to count words in such documents.
Any ideas on what a solution could be?
Special thanks to any colleagues who can share their experience and have a productive day! | | | Albana Dhimitri Greece Local time: 06:40 Greek to Albanian + ... TOPIC STARTER
Can scanned texts be opened (and translated) in Trados (the 2007 version)? | | |
You can count them "manually". After all, this is one of the reasons you should charge extra for such texts.
As for translation of them, you need some form of OCR, but then again, the results are terrible and you need to spend a lot of time fixing things in the resulting document, so you'd be better off if you translated the text by hand and then type it in Word. Again, this is why you should charge extra. And in the case they need the layout of the original replicated, you can alwa... See more You can count them "manually". After all, this is one of the reasons you should charge extra for such texts.
As for translation of them, you need some form of OCR, but then again, the results are terrible and you need to spend a lot of time fixing things in the resulting document, so you'd be better off if you translated the text by hand and then type it in Word. Again, this is why you should charge extra. And in the case they need the layout of the original replicated, you can always charge extra for DTP services.
This is the way it goes: rubbish in, rubbish out. If they don't want the "rubbish out" part of the equation, they'll have to pay extra. ▲ Collapse | | |
It will scan the pdf text and then you can convert it to Word. | |
|
|
Get acquainted with OCR-software: http://en.wikipedia.org/wiki/Optical_character_recognition (there is also a list of such programs toward the end of the article).
Here are the steps involved:
1. Use OCR-software to convert the image (scan) into a document;
2. Count the words as you are accustomed, e.g., with Word Count in Word.
... See more Get acquainted with OCR-software: http://en.wikipedia.org/wiki/Optical_character_recognition (there is also a list of such programs toward the end of the article).
Here are the steps involved:
1. Use OCR-software to convert the image (scan) into a document;
2. Count the words as you are accustomed, e.g., with Word Count in Word.
 ▲ Collapse | | | Albana Dhimitri Greece Local time: 06:40 Greek to Albanian + ... TOPIC STARTER Thank you for helping! | Dec 2, 2009 |
Hi,
Epaminondas, Valery and Derek. I really appreciate it! Thank you for the advice. I will study OCRs and their use and hope it helps. | | | VIBOL KEO Local time: 11:40 English to Khmer (Central) TextBridge... | Dec 2, 2009 |
Well, I used to see an expert who was working very closely to me practiced with TextBrdige for both numerical data and narrative ones.... I have often used it...
I think it would be helpful if you tired then you should count it within one of MS Office...
Great and Green Day...! | | |
Anyount 7 is able to count PDF's, too! | |
|
|
Erik Freitag Germany Local time: 05:40 Member (2006) Dutch to German + ... No, it can't (AFAIK) | Dec 2, 2009 |
Jutta Deichselberger wrote:
Anyount 7 is able to count PDF's, too!
But not if they're scanned, which is what the OP is asking for.
Edit: Apologies, I didn't know Anycount can do OCR now.
[Bearbeitet am 2009-12-02 16:38 GMT] | | | Yes, it can:-) | Dec 2, 2009 |
That is the reason why I upgraded from version 6 to version 7.
I already tested it several times, it seems to count a little bit less, maybe 5%... | | |
However, I do not know whether the PDF files were scanned oder not...:-(( | | | Anyount 7 can count text in images | Dec 2, 2009 |
efreitag wrote:
Jutta Deichselberger wrote:
Anyount 7 is able to count PDF's, too!
But not if they're scanned, which is what the OP is asking for.
Yes if you save the scanned PDF into jpeg. Anyount 7 has OCR, so it can count image files | | | To report site rules violations or get help, contact a site moderator: You can also contact site staff by submitting a support request » How to count words in a scanned text Anycount & Translation Office 3000 | Translation Office 3000
Translation Office 3000 is an advanced accounting tool for freelance translators and small agencies. TO3000 easily and seamlessly integrates with the business life of professional freelance translators.
More info » |
| Protemos translation business management system | Create your account in minutes, and start working! 3-month trial for agencies, and free for freelancers!
The system lets you keep client/vendor database, with contacts and rates, manage projects and assign jobs to vendors, issue invoices, track payments, store and manage project files, generate business reports on turnover profit per client/manager etc.
More info » |
|
| | | | X Sign in to your ProZ.com account... | | | | | |