Red Hat Bugzilla – Bug 1475575
Create global spelling checker for use by all GLS authors
Last modified: 2018-01-02 18:22:51 EST
Description of problem:
Getting authors to consistently run spelling checkers using an accepted dictionary is difficult. Different tools, different skills, different focus, etc. Needs to be easy and relatively fool-proof. Needs to use a standard, readily available tool, with a standard dictionary as a back end.
aspell has limitations, as does hunspell. Customization is a possibility.
Need to investigate possibilities, prototype, and roll out something that works, and make it part of the standard workflow.
There should be very few spelling mistakes in material received by technical editors. Currently, a lot of material has very basic errors, things that a simple scan or check would have found.
(See also http://etherpad.corp.redhat.com/GLS-global-spelling-checker for background information.)
On 07/26/2017 11:40 PM, Morgan Weetman wrote:
> Apologies for the lateness of the update
> - Process to generate custom dictionaries in aspell working
> - hyphen can be added as a valid word character, however, *all* hyphenated
> words would need to be added to the dictionary as aspell will no longer treat
> the individual parts on either side of the hyphen as whole words.
This is probably not an issue? Hyphenation is difficult, and people get
it wrong all the time. The rules are vague and hard to find. I'd be
happier adding known accepted hyphenation than letting incorrect
hyphenations live in the dictionary.
> - Couldn't get aspell to accept words with numbers like SHA-1
No suggestions atm... Ask google?
> So progress but not perfection as yet
cool. I'm starting right now to look at words that don't belong in the
dictionary, etc. Will get back to you.
A few example words to not include. These are surprisingly hard to find.
Currently testing process to d/l default dictionary files, import, and configure.
d/l, configure, etc., works ok.
More testing "live" to weed out words that don't belong and ensure only valid suggestions are made.
Currently in testing with curric-devel.
Dave, I've assigned you QA because you're pretty new to this sort of thing. Please yell about any issues.
Production repo now established:
Configuration updated to allow for words with numbers (e.g., 3scale).
Personal dictionaries disabled (set to /dev/null).
Process for adding words = raise PR or email David or Morgan.
This has been tested, is classified as "works", and is in production. Being brand new, it's bound to have a few wrinkles but for the most part, it's fine.