Bug 2218460

Summary: en_US.dic virtually empty in hunspell-en-US-0.20201207-5.fc39
Product: [Fedora] Fedora Reporter: Paul Howarth <paul>
Component: hunspell-enAssignee: libreoffice-sig
Status: CLOSED RAWHIDE QA Contact: Fedora Extras Quality Assurance <extras-qa>
Severity: medium Docs Contact:
Priority: unspecified    
Version: rawhideCC: dani, gnome-sig, gwync, libreoffice-sig, mclasen, mfabian, rhughes, rstrode
Target Milestone: ---Keywords: Regression
Target Release: ---   
Hardware: Unspecified   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2023-07-05 18:25:29 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Paul Howarth 2023-06-29 08:31:06 UTC
The US English dictionary is much smaller in -5 (built with hunspell) than -4 (built with aspell):

$ rpm -qlpv hunspell-en-US-0.20201207-4.fc39.noarch.rpm
drwxr-xr-x    2 root     root                        0 Feb 22 00:00 /usr/share/doc/hunspell-en-US
-rw-r--r--    1 root     root                    15719 Feb 22 00:00 /usr/share/doc/hunspell-en-US/README_en_US.txt
-rw-r--r--    1 root     root                     3091 Feb 22 00:00 /usr/share/hunspell/en_US.aff
-rw-r--r--    1 root     root                   551962 Feb 22 00:00 /usr/share/hunspell/en_US.dic

$ rpm -qlpv hunspell-en-US-0.20201207-5.fc39.noarch.rpm
drwxr-xr-x    2 root     root                        0 Jun 28 01:00 /usr/share/doc/hunspell-en-US
-rw-r--r--    1 root     root                    15719 Jun 28 01:00 /usr/share/doc/hunspell-en-US/README_en_US.txt
-rw-r--r--    1 root     root                     3091 Jun 28 01:00 /usr/share/hunspell/en_US.aff
-rw-r--r--    1 root     root                      151 Jun 28 01:00 /usr/share/hunspell/en_US.dic

This is manifesting as build failures on packages that use this dictionary, e.g.:
https://kojipkgs.fedoraproject.org/work/tasks/4944/102724944/build.log for perl-MouseX-ConfigFromFile. I'm seeing a similar issue with my own local builds of perl-Text-SpellChecker.


Reproducible: Always

Steps to Reproduce:
1. Try a local mock build for Rawhide of perl-Text-SpellChecker

Actual Results:  
$ make test TEST_VERBOSE=1
PERL_DL_NONLAZY=1 "/usr/bin/perl" "-MExtUtils::Command::MM" "-MTest::Harness" "-e" "undef *Test::Harness::Switches; test_harness(1, 'blib/lib', 'blib/arch')" t/*.t
t/aspell.t .... 
1..10
ok 1 - use Text::SpellChecker;
ok 2 - object creation
ok 3 # skip Text::Aspell not installed
ok 4 # skip Text::Aspell not installed
ok 5 # skip Text::Aspell not installed
ok 6 # skip Text::Aspell not installed
ok 7 # skip Text::Aspell not installed
ok 8 # skip Text::Aspell not installed
ok 9 - freezing, thawing
ok 10 - freezing, thawing
ok

#   Failed test 'Iterator'
#   at t/hunspell.t line 16.

#   Failed test 'replacement'
#   at t/hunspell.t line 25.
# Looks like you failed 2 tests of 6.
t/hunspell.t .. 
1..6
ok 1 - use Text::SpellChecker;
ok 2 - object creation
ok 3 - Catching English word
not ok 4 - Iterator
ok 5 - suggestions
not ok 6 - replacement
Dubious, test returned 2 (wstat 512, 0x200)
Failed 2/6 subtests 
t/pod.t ....... 
1..1
ok 1 - POD test for blib/lib/Text/SpellChecker.pm
ok

Test Summary Report
-------------------
t/hunspell.t (Wstat: 512 (exited 2) Tests: 6 Failed: 2)
  Failed tests:  4, 6
  Non-zero exit status: 2
Files=3, Tests=17,  0 wallclock secs ( 0.01 usr  0.00 sys +  0.12 cusr  0.01 csys =  0.14 CPU)
Result: FAIL
Failed 1/3 test programs. 2/17 subtests failed.
make: *** [Makefile:772: test_dynamic] Error 255


Expected Results:  
Test suite passes (which it did with -4).

It looks like wordlist-rel-2020.12.07/scowl/speller/munch-list explicitly uses aspell.

Comment 1 Mike FABIAN 2023-07-03 14:17:47 UTC
Yes, I just ran into this as well. Caused the build of ibus-typing-booster on rawhide to fail.

```
mfabian@fedora:/usr/share/hunspell
$ cat en_US.dic 
25
0/nm
0th/pt
1/n1
1st/p
1th/tc
2/nm
2nd/p
2th/tc
3/nm
3rd/p
3th/tc
4/nm
4th/pt
5/nm
5th/pt
6/nm
6th/pt
7/nm
7th/pt
8/nm
8th/pt
9/nm
9th/pt
e.g.
i.e.
mfabian@fedora:/usr/share/hunspell
$ cat /etc/fedora-release 
Fedora release 39 (Rawhide)
mfabian@fedora:/usr/share/hunspell
$ 
```

This commit in hunspell-en package breaks it:

https://src.fedoraproject.org/rpms/hunspell-en/c/a5d6fd95d1a110f9fc9fdcc7c8f2d53269f3f137?branch=rawhide

Comment 2 danigm 2023-07-05 06:44:59 UTC
This is breaking also the tests in rpmlint: https://github.com/rpm-software-management/rpmlint/issues/1082

Comment 3 Jens Petersen 2023-07-11 05:50:05 UTC
Looks good now, thank you!