From Bugzilla Helper: User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.7.12) Gecko/20060130 Red Hat/1.0.7-1.4.3 Firefox/1.0.7 Description of problem: Errors in the existing Scim tables for Bengali: 1. Inscript 2. Probhat 3. Itrans 4. Unijoy [not tested due to pending legal advise] Version-Release number of selected component (if applicable): How reproducible: Always Steps to Reproduce: 1.Activate Scim IME 2.Choose Aprropriate Language i.e. Bengali and layout 3. Actual Results: Wrong representation of a few keypoints and missing keypoints Expected Results: The modified keypoints have been mentioned in the attached files: 1. bn-inscript-revised.mim 2. bn-probhat-revised.mim 3. bn-itrans-revised.mim Additional info: Since some of the keypoints could not be keyed in, the Unicode values of the characters to be represented have been mentioned.
Created attachment 124939 [details] corrections for bn-inscript.mim
Created attachment 124940 [details] corrections for bn-itrans.mim
Created attachment 124941 [details] corrections for bn-probhat.mim
Runa, could you attach the final corrected versions of the .mim files to make it easier for me to merge in your fixes.
Ignore comment 4 - your changes are quite clear and well documented. :) A couple of comments: 1) in bn-probhat-revised.mim: -("$" ?à§³) <-- this is actually U09F3 +("$" ?<Bangladeshi Currency symbol U09F3>) perhaps it is a font issue? 2) in bn-itrans-revised.mim: <!--These alphabets and dependent vowels are not used in modern bengali. Most of the other keymaps do not provide them either--!> <!--Not Unicode recognised codepoints--!> So you think those mappings should be removed?
Also this: -("t.h" "à§") +("t.h" "<U09CE>") is the same, no?
Answer to comment 5: #1.. it looks like a font issue #2.. better to remove as they are not in unicode anyways Answer to comment 6: i put that in just for clarification.. coz the box might have created a confusion. hth runa
More questions: 1) I take it it is ok to remove these too: ("LLi" (delete @-) "à§¢") ; not in ITRANS Bengali table ("L^i" (delete @-) "à§¢") ; not in ITRANS Bengali table ? 2) These: ("#" "à§à¦°") ; not in ITRANS Bengali table ("$" "রà§") ; not in ITRANS Bengali table ("^" "তà§à¦°") ; not in ITRANS Bengali table ("*" "শà§à¦°") ; not in ITRANS Bengali table are bad because they are combinations of codepoints?
1) yup. should not be a problem 2) they are not unicode recognised codepoints these are like conjuncts or semiconjuncts someone put in as they are used a lot. no official endorsement so far. we did not include it in iiimf-inscript either
They have been removed in 1.3.3-2.
Resolved.