Red Hat Bugzilla – Bug 438662
ordering of candidates is wrong in Zhu Yin
Last modified: 2008-06-19 05:19:06 EDT
Description of problem:
When SCIM is switched to Zhu Yin IM, I found the ordering of candidates is
different as compared to SCIM in FC5, FC6, and F7. Infrequently used characters
appeared earlier instead of later. FC5 is reasonable, but FC6 and F7 are "wrong".
For example, if I type the character "今", which is the first candidate in the
first page I should see after typing it, it is not there anymore. It looks like
positions of all candidates are just shuffled. (see attachment)
Based on FC5 and Windows, users expect candidate ordering prioritized by:
1. characters commonly used
2. number of strokes
Below are the scim packages I have:
This is a serious problem for Zhu Yin users, who are major IM users of
Traditional Chinese. They are not able to type in Chinese well because they will
waste time looking around for suitable candidates. For this reason, it is
unreasonable to upgrade away from FC5.
Version-Release number of selected component (if applicable):
If I type the character "今", which is the first candidate in the first page I
should see after typing it, it is not there anymore. It looks like positions of
all candidates are just shuffled.
Steps to Reproduce:
1. Switch to Zhu Yin
2. Type "rup"
Character "今", the first candidate in the first page. It is on a later page.
It should be on the first page.
Created attachment 298881 [details]
bad candidates on first page
Hi Peng, Would you know how Zhu Yin load in order? Is it rely on the order in
the table files?
I remember a frequency could be specified for every phrase in the table file.
The table engine will short the candidates by frequency. Does the Zhu Yin table
provide the correct frequencies for phrases or chars?
Changing version to '9' as part of upcoming Fedora 9 GA.
More information and reason for this action is here:
Compared the Zhu Yin table file in rawhide and FC-4, "今" is lower than those
characters in rawhide but it is the first candidate for "rup" in FC-4. The order
in the table file should be the default order for the candidates to be shown on
the look-up window.
I need to confirm if latest Zhu Yin has character/phrase frequency mechanism
which could adjust the candidate order from learning. Besides, generally
frequently used chars like "今" should be at higher rank initially. We should
have a more practical table file.
Wondering how the order of characters in the latest table file referred to.
The character order is changed on then:
Revision 1.9 - (view) (download) (annotate) - [select for diffs]
Mon Jan 16 07:06:14 2006 UTC (2 years, 5 months ago) by suzhe
Changes since 1.8: +43763 -43501 lines
Diff to previous 1.8
Update ZhuYin tables according to the latest table from CMEX.
I haven't estimate the workload to revert to the version before this change,
with all clean later fixes merged.
I could rearrange the candidate orders based on the table in FC-5. However,
there are changes which made by upstream since then which might have minor
effects on specific characters.
For example, "兒" was moved from "-" to "-6". Then the candidate after 兒 will
be moved up towards the head of candidate list. To ensure you have exact
character always at exact location in your current Fedora release version as it
was in FC-5, I suggest you recompile scim-tables with the ZhuYin.txt.in and
ZhuYin-Big.txt.in copied over from FC-5 sources (tarball/src.rpm).
Thank you very much.
Decided to revert the Zhu Yin table to the one in FC-5. Due to the huge size of
data (~58k entries), fixing the latest table manually by me alone isn't feasible
to be finished in short period.
Created attachment 309816 [details]
Zhu Yin table in FC-5.
Created attachment 309817 [details]
Zhu Yin table (Big) in FC-5.
Built to rawhide:
I compiled ZhuYin.txt.in and ZhuYin-Big.txt.in from FC-5 and Zhu Yin works fine
now. Candidate orders are back to how they were. The position of "兒" is also
Thank you very much! You have been great in trying to get this problem fixed.