Red Hat Bugzilla – Bug 182227
Reporting Errors for Scim tables for Bengali
Last modified: 2013-03-03 21:20:45 EST
From Bugzilla Helper:
User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.7.12) Gecko/20060130 Red Hat/1.0.7-1.4.3 Firefox/1.0.7
Description of problem:
Errors in the existing Scim tables for Bengali:
4. Unijoy [not tested due to pending legal advise]
Version-Release number of selected component (if applicable):
Steps to Reproduce:
1.Activate Scim IME
2.Choose Aprropriate Language i.e. Bengali and layout
Actual Results: Wrong representation of a few keypoints and missing keypoints
Expected Results: The modified keypoints have been mentioned in the attached files:
Since some of the keypoints could not be keyed in, the Unicode values of the characters to be represented have been mentioned.
Created attachment 124939 [details]
corrections for bn-inscript.mim
Created attachment 124940 [details]
corrections for bn-itrans.mim
Created attachment 124941 [details]
corrections for bn-probhat.mim
Runa, could you attach the final corrected versions of the .mim files
to make it easier for me to merge in your fixes.
Ignore comment 4 - your changes are quite clear and well documented. :)
A couple of comments:
1) in bn-probhat-revised.mim:
-("$" ?à§³) <-- this is actually U09F3
+("$" ?<Bangladeshi Currency symbol U09F3>)
perhaps it is a font issue?
2) in bn-itrans-revised.mim:
<!--These alphabets and dependent vowels are not used in modern bengali. Most of
the other keymaps do not provide them either--!>
<!--Not Unicode recognised codepoints--!>
So you think those mappings should be removed?
is the same, no?
Answer to comment 5:
#1.. it looks like a font issue
#2.. better to remove as they are not in unicode anyways
Answer to comment 6:
i put that in just for clarification.. coz the box might have created a confusion.
1) I take it it is ok to remove these too:
("LLi" (delete @-) "à§¢") ; not in ITRANS Bengali table
("L^i" (delete @-) "à§¢") ; not in ITRANS Bengali table
("#" "à§à¦°") ; not in ITRANS Bengali table
("$" "à¦°à§") ; not in ITRANS Bengali table
("^" "à¦¤à§à¦°") ; not in ITRANS Bengali table
("*" "à¦¶à§à¦°") ; not in ITRANS Bengali table
are bad because they are combinations of codepoints?
1) yup. should not be a problem
2) they are not unicode recognised codepoints these are like conjuncts or
semiconjuncts someone put in as they are used a lot. no official endorsement so
far. we did not include it in iiimf-inscript either
They have been removed in 1.3.3-2.