Pages in topic: < [1 2] | Translating a website: Tool for downloading hundreds of files and counting words Thread poster: Rajan Chopra
| Sebastian Witte Germany Local time: 17:17 Member (2004) English to German + ... | Translating a website: Tool for downloading hundreds of files and counting words | Dec 6, 2010 |
langclinic wrote:
Hi friends,
A translation agency wants me to tranlate a website that contains dozens of links and in every link there are many links and sub-links. If I download the files one by one, it will take a great deal of time. Is there any method to download all files available on a website (Word files, pdf files, html pages and scanned pages etc.) in a quick and convenient manner?
Secondly, there is no problem in counting the words in MS Word files as I can go to Tools and ascertain the word count but is there any tool to count words in pdf files, html pages etc as counting them manually will kill a lot of time.
Thanks in advance for your precious help.
Regards,
Chopra
Although I agree with what colleagues have already said, perhaps you should give a try to Aquino WebBudget to verify if it can be among your options...
[Edited at 2010-12-06 00:35 GMT] | | | Heiko Schmieder (X) Germany Local time: 17:17 English to German + ... HTML, CMS > what does the client need? | Dec 6, 2010 |
Hello,
I just want to add some thoughts to the topic website translation. It does not always make sense to 'download' the whole site as you don't know what system is behind. If someone uses a CMS the translated html, php whatever files might not be useful as he or she has to transfer the content anyway manually. So my first question would be - do you work with CMS? Next you might ask if they use CMS if there is the possibility to export content - e.g. via XML or any other format. ... See more Hello,
I just want to add some thoughts to the topic website translation. It does not always make sense to 'download' the whole site as you don't know what system is behind. If someone uses a CMS the translated html, php whatever files might not be useful as he or she has to transfer the content anyway manually. So my first question would be - do you work with CMS? Next you might ask if they use CMS if there is the possibility to export content - e.g. via XML or any other format.
Website translations are not quite trivial in terms of technology - so before downloading the whole site I would clarify this matter. So in case of CMS there won't be any Word files or kind of draft because its all written directly into the CMS...
Beside the technology there can be also the issue of translating php with Trados TagEditor which also can be a problem because TagEditor in standard config will add/delete some coding that makes php useless (at least in my case I experienced that)
Good luck and best,
Heiko ▲ Collapse | | | A word of warning | Dec 6, 2010 |
The agency does not seem very competent, just telling you to "translate the website".
You should keep in mind that the website could be updated by the owner while you're translating it, and apart from the fact that you would be trying to translate a moving target you could also have a complaint that you've not translated everything.
In addition, if the website is for example dynamic and you download the pages, the translation will be useless to the end customer if you t... See more The agency does not seem very competent, just telling you to "translate the website".
You should keep in mind that the website could be updated by the owner while you're translating it, and apart from the fact that you would be trying to translate a moving target you could also have a complaint that you've not translated everything.
In addition, if the website is for example dynamic and you download the pages, the translation will be useless to the end customer if you translate the individual pages, as such pages do not exist on the customer's site and they will not be able to easily match the text to the pages. Same if it is a CMS.
Now, how to proceed:
- If the website is static (plain html, pdfs, etc.), request that they send you the full package. ALL the site is hosted on a server, so it's not a problem to get the whole set, the webmaster should be able to do it easily. If the resulting set of files is too big for e-mail, let them zip or rar it, and use a service such as yousendit to forward you the full set. You can return it in the same way.
- If the site is dynamic (say, with a database), let them send you the pages with static text AND an export of the database (say, In Excel). Return exactly what they have sent you, so that they can import it again.
- If the site is based on a CMS (Content Management System), ask for either an export of access to the CMS, if it allows side-by-side languages.
I advise however very strongly against downloading the site, unless specifically instructed to do so. ▲ Collapse | |
|
|
TagEditor and PHP | Dec 6, 2010 |
Heiko Schmieder wrote:
Beside the technology there can be also the issue of translating php with Trados TagEditor which also can be a problem because TagEditor in standard config will add/delete some coding that makes php useless (at least in my case I experienced that)
Totally correct. And even if the coding is not deleted, Trados TagEditor takes PHP code as a tag, and prevents its edition. No problem if it is pure code, but if it has embedded text then you're in trouble.
Some time ago I wrote a post about whether TagEditor was adequate for website translation (See http://www.seo-translator.com/is-trados-tageditor-adequate-for-website-translation/ ), and the response was that it was "somewhat" adequate. | | | Laurent KRAULAND (X) France Local time: 17:17 French to German + ... "Somewhat adequate" | Dec 6, 2010 |
Indeed... I can think of at least one CAT which handles PHP (somewhat) better. | | | Rajan Chopra India Local time: 21:47 Member (2008) English to Hindi + ... TOPIC STARTER Thanks to all of you for your valuable advice | Dec 7, 2010 |
I am thankful to all of you for suggesting the links of many sites which help in downloading and counting the words.
Heiko Schmieder wrote:
So my first question would be - do you work with CMS? Next you might ask if they use CMS if there is the possibility to export content - e.g. via XML or any other format.
Heiko
I am required to provide them the Word files and they will be processing it further to upload to the website.
Regards,
Chopra | | |
langclinic wrote:
I am required to provide them the Word files and they will be processing it further to upload to the website.
Urgggh... another website owner & language agency that don't know shit about website translation or multilingual SEO. Word, for heaven's sake! | |
|
|
Laurent KRAULAND (X) France Local time: 17:17 French to German + ... Final responsibility | Dec 7, 2010 |
Ramon Somoza wrote:
langclinic wrote:
I am required to provide them the Word files and they will be processing it further to upload to the website.
Urgggh... another website owner & language agency that don't know shit about website translation or multilingual SEO. Word, for heaven's sake!
The final responsibility is with the client, so what?
I certainly hate this low-key approach as much as you do, Ramon, but sometimes we only have to shrug and look the other way. Like in "You can lead a horse to the river..."
[Modifié le 2010-12-07 20:20 GMT] | | | Pages in topic: < [1 2] | To report site rules violations or get help, contact a site moderator: You can also contact site staff by submitting a support request » Translating a website: Tool for downloading hundreds of files and counting words Anycount & Translation Office 3000 | Translation Office 3000
Translation Office 3000 is an advanced accounting tool for freelance translators and small agencies. TO3000 easily and seamlessly integrates with the business life of professional freelance translators.
More info » |
| Pastey | Your smart companion app
Pastey is an innovative desktop application that bridges the gap between human expertise and artificial intelligence. With intuitive keyboard shortcuts, Pastey transforms your source text into AI-powered draft translations.
Find out more » |
|
| | | | X Sign in to your ProZ.com account... | | | | | |