Bug 1055112
| Summary: | Many basic Norwegian Bokmaal words are no longer recognised | ||
|---|---|---|---|
| Product: | [Fedora] Fedora | Reporter: | Peter <p.g.c.bugsreport> |
| Component: | hunspell-no | Assignee: | Caolan McNamara <caolanm> |
| Status: | CLOSED ERRATA | QA Contact: | Fedora Extras Quality Assurance <extras-qa> |
| Severity: | high | Docs Contact: | |
| Priority: | unspecified | ||
| Version: | 22 | CC: | caolanm, dtardon, extras-orphan, wberrier |
| Target Milestone: | --- | ||
| Target Release: | --- | ||
| Hardware: | Unspecified | ||
| OS: | Unspecified | ||
| Whiteboard: | |||
| Fixed In Version: | hunspell-no-2.0.10-1.fc22 | Doc Type: | Bug Fix |
| Doc Text: | Story Points: | --- | |
| Clone Of: | Environment: | ||
| Last Closed: | 2015-08-04 05:25:25 UTC | Type: | Bug |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
|
Description
Peter
2014-01-18 21:05:12 UTC
I've also encountered this en rhel7. The additional word I noticed was missing: "mitt" It appears that this an upstream issue with the Norwegian dictionary. I've sent a message to their mailing list at: https://lister.ping.uio.no/pipermail/i18n-no/ (My posting from today hasn't shown up yet) It seems like that until this gets addressed, the previous version should be used... ? I looked at Ubuntu 14.04: they ship myspell-nb, which appears to be an older version: 2.0.10. Not sure if this is the same as fc17, since the dictionaries are quite different. Anyway, I post this information to only suggest Ubuntu (maybe other distros?) may not be running into this because they are using an older version. I'll also paste my mailing list message here for reference since it hasn't show up on the mailing list archives yet: ( https://lister.ping.uio.no/pipermail/i18n-no/ ) ---------------------------- Hello, My end goal is to do Norwegian spell checking within emacs. Emacs on my linux distribution (centos7) is using hunspell. I got that working, but it turns out there are several basic words that are missing from the dictionary. For example, this command line example shows the issue: ----------------------------- $ hunspell -d nb_NO Hunspell 1.3.2 og & og 10 0: org, tog, log, dog, bog, os, or, ok, om, Ogna mitt & mitt 15 0: nitt, mutt, mott, mit, smitt, mett, mint, mist, sitt, ritt, titt, matt, milt, litt, kitt min * ----------------------------- This is showing that 'og' and 'mitt' were not in the dictionary, but 'min' was. Sure enough, they are missing from 'nb_NO.dic'. I noticed that the previous version (2.0.10) of the dictionary was 4.9 MB, where as the 2.1 version is only 3.9 MB. Comparing the two files with diffstat: diff -u 2.0/nb_NO.dic 2.1/nb_NO.dic | diffstat nb_NO.dic |77164 +++----------------------------------------------------------- 1 file changed, 4500 insertions(+), 72664 deletions(-) there are a LOT of removed words. Redhat linux has had this bug report for some time about the new version missing a lot of words: https://bugzilla.redhat.com/show_bug.cgi?id=1055112 Has this issue has been noticed before? Any plans to release an update? Thanks, Wade This message is a reminder that Fedora 20 is nearing its end of life. Approximately 4 (four) weeks from now Fedora will stop maintaining and issuing updates for Fedora 20. It is Fedora's policy to close all bug reports from releases that are no longer maintained. At that time this bug will be closed as EOL if it remains open with a Fedora 'version' of '20'. Package Maintainer: If you wish for this bug to remain open because you plan to fix it in a currently maintained version, simply change the 'version' to a later Fedora version. Thank you for reporting this issue and we are sorry that we were not able to fix it before Fedora 20 is end of life. If you would still like to see this bug fixed and are able to reproduce it against a later version of Fedora, you are encouraged change the 'version' to a later Fedora version prior this bug is closed as described in the policy above. Although we aim to fix as many bugs as possible during every release's lifetime, sometimes those efforts are overtaken by events. Often a more recent Fedora release includes newer upstream software that fixes bugs or makes them obsolete. This is still happening in fedora 22, but it doesn't appear I have rights to change the bug to that version. Is upstream just totally dead here, no new release since 2012 ? Is there any other group who have emerged to provide more comprehensive dictionaries ? Presumably https://addons.mozilla.org/en-us/firefox/addon/norsk-bokm%C3%A5l-ordliste/ it just the same dictionaries rebundled ? I did some testing with firefox: the dictionary from hunspell-nb show the same errors: the above words were counted as misspelled. Then I installed the dictionary that was linked in the previous comment: it worked as expected. There were some follow on comments from my post to the norwegian spelling mailing list. The relevant thread is: https://lister.ping.uio.no/pipermail/i18n-no/2015-March/008238.html They suggest it may be a problem with the build system. Also, that it only seems to be a problem with using hunspell. aspell and ispell seem to work fine. I'll have to do more testing in that regard. aspell functioned as expected: ---------------------- dnf install aspell aspell-no # aspell -a -d no @(#) International Ispell Version 3.1.20 (but really Aspell 0.60.6.1) jeg * og * oss * seg * samme * mitt * min * bread & bread 6 0: beråd, bred, bredd, breid, bre ad, bre-ad -------------- But, it looks like they are two different dictionaries... Looks like debian (sid) is still shipping the old version (2.0.10): https://packages.debian.org/source/sid/norwegian hunspell-no-2.0.10-1.fc22 has been submitted as an update for Fedora 22. https://admin.fedoraproject.org/updates/hunspell-no-2.0.10-1.fc22 hunspell-no-2.0.10-1.fc21 has been submitted as an update for Fedora 21. https://admin.fedoraproject.org/updates/hunspell-no-2.0.10-1.fc21 I've verified that the 2.0.10 build works on fedora 22. Would it be possible to send this update to el7 as well? el7 is not fedora and thus requires a whole different pile of process, you can file the bug for that against RHEL-7 and I can approve that I could fix it, but after that its out of my hands if it will be fixed. Package hunspell-no-2.0.10-1.fc21: * should fix your issue, * was pushed to the Fedora 21 testing repository, * should be available at your local mirror within two days. Update it with: # su -c 'yum update --enablerepo=updates-testing hunspell-no-2.0.10-1.fc21' as soon as you are able to. Please go to the following url: https://admin.fedoraproject.org/updates/FEDORA-2015-11711/hunspell-no-2.0.10-1.fc21 then log in and leave karma (feedback). hunspell-no-2.0.10-1.fc22 has been pushed to the Fedora 22 stable repository. If problems still persist, please make note of it in this bug report. |