Bug 861318

Summary: RFE: Book checking tool to include taxonomy, blacklists & grammar
Product: [Community] PressGang CCMS Reporter: David Ryan <dryan>
Component: Web-UIAssignee: pressgang-ccms-dev
Status: CLOSED CURRENTRELEASE QA Contact:
Severity: unspecified Docs Contact:
Priority: unspecified    
Version: 1.0CC: lnewson, mcaspers, rlandman
Target Milestone: ---   
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2014-06-22 23:30:38 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Attachments:
Description Flags
Taxonomy guide none

Description David Ryan 2012-09-28 06:59:50 UTC
Created attachment 618439 [details]
Taxonomy guide

Q: Is it possible to check grammar, based on a package or our own custom lists?

Use case:

I just saw a minor typo in the EAP 6 user guide, where a word following a period at the end of a sentence was in lower case. I fixed this, but was thinking that Microsoft Word users take this kind of grammar check for granted. We have spell checking in Pressgang, but is there:

A) some open source grammar checking tool we can use
B) a way to create these rules ourselves

Q: Can we create taxonomy blacklists?

Use case:

I was reading Ludek Janda's tech talk on "The Accidental Taxonomist", and noticed on page 15 how he had:

"Controlled Vocabulary / Authority File" with "Restricted list of words". This is an implementation of what I asked about not long after I started, in terms of using basic analytics to wash our content against exclusions, or to direct "like content" into preferable phrasing. 

He also talks about their efforts towards a thesaurus, in terms of a "Controlled dictionary with detailed information about relationships among terms." Page 20 shows the language and shorthand that they use, with examples on page 21. Do we have a similar approach in the roadmap? Any tech that Ludek uses that we can implement?

Comment 1 Matthew Casperson 2013-03-06 06:22:17 UTC
Technically you can use any browser extension when checking spelling/grammar. There are quite a few (Google "Chrome Grammar Extension" to get an idea). I personally quite like After The Deadline (http://afterthedeadline.com/) because it ignores XML tags.

As it turns out After The Deadline also has an open source server component, which I believe could provide a common dictionary/thesaurus. But implementing this is not on any current road map.

Comment 2 Matthew Casperson 2013-06-17 21:26:56 UTC
This is now largely implemented through the use of custom dictionaries.

Blacklists are handled through the https://github.com/pressgang-ccms/PressGangCCMSUI/blob/Development/src/main/webapp/javascript/typojs/en_US-neg.aff and https://github.com/pressgang-ccms/PressGangCCMSUI/blob/Development/src/main/webapp/javascript/typojs/en_US-neg.dic files. New words can be added to https://github.com/pressgang-ccms/PressGangCCMSUI/blob/Development/src/main/webapp/javascript/typojs/en_US.dic.

More complete grammar checking is still only available through external browser extensions.

Comment 3 David Ryan 2014-03-12 06:58:15 UTC
I'm going to throw this to VERIFIED.

Partly because you nailed the implementation of what's in PressGang now, and partly because I'm happy with the cultural shift we've had here. Nice work.