Chinese / Unicode inside SIL file
Chinese / Unicode inside SIL file
We try to translate a application to chinese.
I picked out one string for examination, which is in unicode:
EA 81 03 BC D5 8B 77 8D
That means the unicode characters: $81EA, ...
After entering the data in the SIL-Editor the data is stored as
A6 DB 3F 3F B0 5F
The characters have been translated:
EA 81 => A6 DB, 03 BC => 3F, D5 8B => 3F, 77 8D => B0 5F
The first and the last characters seems to be ok on a chinese windows, but
the chars in the middle are now Ansi "?"-characters and of course are displayed as "?" on all windows versions.
What kind of "translation" occurs ?
The SIL-Editor already displays them as "?", only the DictionaryManager
can show the right characters
Characters $BC03 and $8BD5 are only shown correct, if I fill the string
during runtime into widestrings properties of TNT components.
How can I handle this using the SIL file ?
I picked out one string for examination, which is in unicode:
EA 81 03 BC D5 8B 77 8D
That means the unicode characters: $81EA, ...
After entering the data in the SIL-Editor the data is stored as
A6 DB 3F 3F B0 5F
The characters have been translated:
EA 81 => A6 DB, 03 BC => 3F, D5 8B => 3F, 77 8D => B0 5F
The first and the last characters seems to be ok on a chinese windows, but
the chars in the middle are now Ansi "?"-characters and of course are displayed as "?" on all windows versions.
What kind of "translation" occurs ?
The SIL-Editor already displays them as "?", only the DictionaryManager
can show the right characters
Characters $BC03 and $8BD5 are only shown correct, if I fill the string
during runtime into widestrings properties of TNT components.
How can I handle this using the SIL file ?
Have you adjusted settings for Font and Charset for Chinese language under Default Fonts (see in SIL Editor menu Tools->Default Fonts) and under Fonts and Charsets sections?
Once you adjust these you will be able either to type directly (if your OS supports this) or copy-paste from another applications like MS Word and others.
Please let me know if this helps.
Once you adjust these you will be able either to type directly (if your OS supports this) or copy-paste from another applications like MS Word and others.
Please let me know if this helps.
Best regards,
Igor Siticov.
Igor Siticov.
Yes we have done all this. We followed exactly the instructions on your web side:
http://www.tsilang.com/press/en/adding_ ... uages.html
We set the Font “Tahoma" and the Charset “CHINESEBIG5_CHARSET”.
We tried the same also under an chinese Windows.
Still we are not able to input (by copy paste or by 'Microsoft Pinyin IME’) all characters.
An example is the character feng (风).
http://www.tsilang.com/press/en/adding_ ... uages.html
We set the Font “Tahoma" and the Charset “CHINESEBIG5_CHARSET”.
We tried the same also under an chinese Windows.
Still we are not able to input (by copy paste or by 'Microsoft Pinyin IME’) all characters.
An example is the character feng (风).
Are you able to type all Chinese chars in MS Word? If yes, does copy/paste from MS Word to SIL Editor works? If nothing above works then the way could be as following:
1. Export all translations to Dictionary Manager using Add All.
2. Translate them in Dictionary Manager since it is Unicode and will be able to handle this correctly.
3. Auto-translate in SIL Editor using Auto-Translate function.
Please let me know about the results.
P.S. What OS do you use?
1. Export all translations to Dictionary Manager using Add All.
2. Translate them in Dictionary Manager since it is Unicode and will be able to handle this correctly.
3. Auto-translate in SIL Editor using Auto-Translate function.
Please let me know about the results.
P.S. What OS do you use?
Best regards,
Igor Siticov.
Igor Siticov.
-
- Posts: 4
- Joined: Tue Jan 31, 2006 11:31 am
Hello,
I'm working with Mr.Hagmann on the same problem, and I tried the steps,
but with the same result. Using auto-translate the same occurs. First and last char ok, the both in the middle are "?".
I erased the given translation, it's overwritten with the wrong chars again.
I've tried the font Arial Unicode MS too, same result.
Editing in Word works fine, copy&paste only works with 1. & 4. character.
I'm working with Mr.Hagmann on the same problem, and I tried the steps,
but with the same result. Using auto-translate the same occurs. First and last char ok, the both in the middle are "?".
I erased the given translation, it's overwritten with the wrong chars again.
I've tried the font Arial Unicode MS too, same result.
Editing in Word works fine, copy&paste only works with 1. & 4. character.
Using GB1232_CHARSET and Tahoma for Simplified Chinese (I guess it is better since Chinese translator set this) I was able to enter almost all Chinese characters including feng (风). You can check http://www.sicomponents.com/soft/resourcebuilder.sib it includes Chinese language as well. The only problem is with 밃 character (03 BC).
Best regards,
Igor Siticov.
Igor Siticov.
-
- Posts: 4
- Joined: Tue Jan 31, 2006 11:31 am
-
- Posts: 4
- Joined: Tue Jan 31, 2006 11:31 am
-
- Posts: 4
- Joined: Tue Jan 31, 2006 11:31 am
Hello,
Actually, we still unable to find a solution , except using OS with Chinese default locale and set DefaultCharset to be used for Chinese language everywhere.
Under other cases the only (at least all others used in our Chinese translations converted fine) problematic character is 밃 (03 BC).
Windows is unable to convert this character to ANSI multi-byte character.
I afraid Delphi application may fail to display such character as well.
Actually, we still unable to find a solution , except using OS with Chinese default locale and set DefaultCharset to be used for Chinese language everywhere.
Under other cases the only (at least all others used in our Chinese translations converted fine) problematic character is 밃 (03 BC).
Windows is unable to convert this character to ANSI multi-byte character.
I afraid Delphi application may fail to display such character as well.
Best regards,
Igor Siticov.
Igor Siticov.