This site uses cookies.
Some of these cookies are essential to the operation of the site,
while others help to improve your experience by providing insights into how the site is being used.
For more information, please see the ProZ.com privacy policy.
MultiTerm Fatal Error while importing a Unicode *.mtf: Invalid character (Unicode: 0x19)
Thread poster: Pavel Tsvetkov
Pavel Tsvetkov Bulgaria Local time: 01:32 Member (2008) English to Bulgarian + ...
MODERATOR
Mar 3, 2018
Dear All,
1. I have a unicode .txt file with tab delimited terms in Bulgarian and English. Naturally, the contents of the file look like this:
"максимум 15 дни" "or a period of time not longer than 15 days"
"трудов стаж" "length of service"
"езависимо от стажа" "regardless of their length of service"
"съкращение в щата" "personnel downsizing"
"съпруг" "spouse"
1. I have a unicode .txt file with tab delimited terms in Bulgarian and English. Naturally, the contents of the file look like this:
"максимум 15 дни" "or a period of time not longer than 15 days"
"трудов стаж" "length of service"
"езависимо от стажа" "regardless of their length of service"
"съкращение в щата" "personnel downsizing"
"съпруг" "spouse"
2. I used SDL MultiTerm 2017 Convert to convert the .txt to the required .mtf file for MultiTerm import.
3. I created a new termbase with MultiTerm 2017.
4. When trying to import the .mtf file into the newly created termbase with the following options:
Import definition name: Default import definition
Termbase name: DEVNYA CEMENT
Import file: C:\Users\User\Desktop\TRANSP\déjà vu x 3 TM Export\*.mtf.xml
Import log file: C:\Users\User\Desktop\TRANSP\déjà vu x 3 TM Export\*.mtf.log
Exclusion file:
Allow over-complete entries: false
Allow incomplete entries: true
Ignore sub-languages: true
Full reorganization: true
Import Options:
Import all entries: Add import entry as new
I can only import about 4% of the terms and the log file contains the following error:
Fatal Error at (file C:\Users\User\Desktop\TRANSP\déjà vu x 3 TM Export\*.mtf.xml, line 3, column 143246): Invalid character (Unicode: 0x19)
5. My question is: how can I locate the invalid character defined as Unicode: 0x19, so that I can delete it?
Translate faster & easier, using a sophisticated CAT tool built by a translator / developer.
Accept jobs from clients who use Trados, MemoQ, Wordfast & major CAT tools.
Download and start using CafeTran Espresso -- for free
The leading translation software used by over 270,000 translators.
Designed with your feedback in mind, Trados Studio 2022 delivers an unrivalled, powerful desktop
and cloud solution, empowering you to work in the most efficient and cost-effective way.