Bug 2388144
| Summary: | Review Request: python-datasets - HuggingFace community-driven open-source library of datasets | ||||||
|---|---|---|---|---|---|---|---|
| Product: | [Fedora] Fedora | Reporter: | Alexander Lent <lx> | ||||
| Component: | Package Review | Assignee: | Tom.Rix | ||||
| Status: | CLOSED ERRATA | QA Contact: | Fedora Extras Quality Assurance <extras-qa> | ||||
| Severity: | medium | Docs Contact: | |||||
| Priority: | medium | ||||||
| Version: | rawhide | CC: | package-review, Tom.Rix | ||||
| Target Milestone: | --- | Keywords: | AutomationTriaged | ||||
| Target Release: | --- | Flags: | Tom.Rix:
fedora-review+
|
||||
| Hardware: | All | ||||||
| OS: | Linux | ||||||
| URL: | https://github.com/huggingface/datasets | ||||||
| Whiteboard: | |||||||
| Fixed In Version: | Doc Type: | --- | |||||
| Doc Text: | Story Points: | --- | |||||
| Clone Of: | Environment: | ||||||
| Last Closed: | 2025-08-23 05:35:25 UTC | Type: | --- | ||||
| Regression: | --- | Mount Type: | --- | ||||
| Documentation: | --- | CRM: | |||||
| Verified Versions: | Category: | --- | |||||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
| Cloudforms Team: | --- | Target Upstream Version: | |||||
| Embargoed: | |||||||
| Attachments: |
|
||||||
|
Description
Alexander Lent
2025-08-13 02:09:25 UTC
Copr build: https://copr.fedorainfracloud.org/coprs/build/9406399 (succeeded) Review template: https://download.copr.fedorainfracloud.org/results/@fedora-review/fedora-review-2388144-python-datasets/fedora-rawhide-x86_64/09406399-python-datasets/fedora-review/review.txt Please take a look if any issues were found. --- This comment was created by the fedora-review-service https://github.com/FrostyX/fedora-review-service If you want to trigger a new Copr build, add a comment containing new Spec and SRPM URLs or [fedora-review-service-build] string. Looks good, a few things.
python3-datasets.noarch: E: non-executable-script /usr/lib/python3.14/site-packages/datasets/commands/datasets_cli.py 644 /usr/bin/env python
python3-datasets.noarch: E: non-executable-script /usr/lib/python3.14/site-packages/datasets/utils/_filelock.py 644 /usr/bin/env python
Likely you can/should remove the first line on both of these scripts.
Package Review
==============
Legend:
[x] = Pass, [!] = Fail, [-] = Not applicable, [?] = Not evaluated
[ ] = Manual review needed
===== MUST items =====
Generic:
[x]: Package is licensed with an open-source compatible license and meets
other legal requirements as defined in the legal section of Packaging
Guidelines.
[x]: License field in the package spec file matches the actual license.
Note: Checking patched sources after %prep for licenses. Licenses
found: "Unknown or generated", "*No copyright* Apache License 2.0",
"*No copyright* Apache License", "Apache License 2.0". 102 files have
unknown license. Detailed output of licensecheck in /sfs/fedora-
review/review-python-datasets/licensecheck.txt
[x]: Package must own all directories that it creates.
Note: Directories without known owners: /usr/lib/python3.14/site-
packages, /usr/lib/python3.14
[x]: Package contains no bundled libraries without FPC exception.
[x]: Changelog in prescribed format.
[x]: Sources contain only permissible code or content.
[x]: Macros in Summary, %description expandable at SRPM build time.
Note: Macros in: python3-datasets (description)
[-]: Package contains desktop file if it is a GUI application.
[-]: Development files must be in a -devel package
[-]: Package uses nothing in %doc for runtime.
[x]: Package consistently uses macros (instead of hard-coded directory
names).
[x]: Package is named according to the Package Naming Guidelines.
[x]: Package does not generate any conflict.
[x]: Package obeys FHS, except libexecdir and /usr/target.
[-]: If the package is a rename of another package, proper Obsoletes and
Provides are present.
[x]: Requires correct, justified where necessary.
[x]: Spec file is legible and written in American English.
[-]: Package contains systemd file(s) if in need.
[x]: Package is not known to require an ExcludeArch tag.
[-]: Large documentation must go in a -doc subpackage. Large could be size
(~1MB) or number of files.
Note: Documentation size is 11249 bytes in 1 files.
[x]: Package complies to the Packaging Guidelines
[x]: Package successfully compiles and builds into binary rpms on at least
one supported primary architecture.
[x]: Package installs properly.
[x]: Rpmlint is run on all rpms the build produces.
Note: There are rpmlint messages (see attachment).
[x]: If (and only if) the source package includes the text of the
license(s) in its own file, then that file, containing the text of the
license(s) for the package is included in %license.
[x]: The License field must be a valid SPDX expression.
[x]: Package requires other packages for directories it uses.
[x]: Package does not own files or directories owned by other packages.
[x]: Package uses either %{buildroot} or $RPM_BUILD_ROOT
[x]: Package does not run rm -rf %{buildroot} (or $RPM_BUILD_ROOT) at the
beginning of %install.
[x]: Dist tag is present.
[x]: Package does not contain duplicates in %files.
[x]: Permissions on files are set properly.
[x]: Package must not depend on deprecated() packages.
[x]: Package use %makeinstall only when make install DESTDIR=... doesn't
work.
[x]: Package is named using only allowed ASCII characters.
[x]: Package does not use a name that already exists.
[x]: Package is not relocatable.
[x]: Sources used to build the package match the upstream source, as
provided in the spec URL.
[x]: Spec file name must match the spec package %{name}, in the format
%{name}.spec.
[x]: File names are valid UTF-8.
[x]: Packages must not store files under /srv, /opt or /usr/local
Python:
[x]: Python eggs must not download any dependencies during the build
process.
[x]: A package which is used by another package via an egg interface should
provide egg info.
[x]: Package meets the Packaging Guidelines::Python
[x]: Package contains BR: python2-devel or python3-devel
[x]: Packages MUST NOT have dependencies (either build-time or runtime) on
packages named with the unversioned python- prefix unless no properly
versioned package exists. Dependencies on Python packages instead MUST
use names beginning with python2- or python3- as appropriate.
[x]: Python packages must not contain %{pythonX_site(lib|arch)}/* in %files
[x]: Binary eggs must be removed in %prep
===== SHOULD items =====
Generic:
[x]: If the source package does not include license text(s) as a separate
file from upstream, the packager SHOULD query upstream to include it.
[x]: Final provides and requires are sane (see attachments).
[x]: Package functions as described.
[x]: Latest version is packaged.
[x]: Package does not include license text files separate from upstream.
[ ]: Sources are verified with gpgverify first in %prep if upstream
publishes signatures.
Note: gpgverify is not used.
[ ]: Package should compile and build into binary rpms on all supported
architectures.
[x]: %check is present and all tests pass.
[x]: Packages should try to preserve timestamps of original installed
files.
[x]: Reviewer should test that the package builds in mock.
[x]: Buildroot is not present
[x]: Package has no %clean section with rm -rf %{buildroot} (or
$RPM_BUILD_ROOT)
[x]: No file requires outside of /etc, /bin, /sbin, /usr/bin, /usr/sbin.
[x]: Packager, Vendor, PreReq, Copyright tags should not be in spec file
[x]: Sources can be downloaded from URI in Source: tag
[x]: SourceX is a working URL.
[x]: Spec use %global instead of %define unless justified.
===== EXTRA items =====
Generic:
[x]: Rpmlint is run on all installed packages.
Note: There are rpmlint messages (see attachment).
[x]: Spec file according to URL is the same as in SRPM.
Rpmlint
-------
Checking: python3-datasets-4.0.0-1.fc43.noarch.rpm
python-datasets-4.0.0-1.fc43.src.rpm
============================ rpmlint session starts ============================
rpmlint: 2.7.0
configuration:
/usr/lib/python3.14/site-packages/rpmlint/configdefaults.toml
/etc/xdg/rpmlint/fedora-spdx-licenses.toml
/etc/xdg/rpmlint/fedora.toml
/etc/xdg/rpmlint/scoring.toml
/etc/xdg/rpmlint/users-groups.toml
/etc/xdg/rpmlint/warn-on-functions.toml
rpmlintrc: [PosixPath('/tmp/tmpkbmyz7mg')]
checks: 32, packages: 2
python-datasets.src: E: spelling-error ('dataloaders', '%description -l en_US dataloaders -> data loaders, data-loaders, freeloaders')
python-datasets.src: E: spelling-error ('pre', '%description -l en_US pre -> per, ore, pee')
python3-datasets.noarch: E: spelling-error ('dataloaders', '%description -l en_US dataloaders -> data loaders, data-loaders, freeloaders')
python3-datasets.noarch: E: spelling-error ('pre', '%description -l en_US pre -> per, ore, pee')
python3-datasets.noarch: E: non-executable-script /usr/lib/python3.14/site-packages/datasets/commands/datasets_cli.py 644 /usr/bin/env python
python3-datasets.noarch: E: non-executable-script /usr/lib/python3.14/site-packages/datasets/utils/_filelock.py 644 /usr/bin/env python
python3-datasets.noarch: W: no-manual-page-for-binary datasets-cli
2 packages and 0 specfiles checked; 6 errors, 1 warnings, 7 filtered, 6 badness; has taken 5.8 s
Rpmlint (installed packages)
----------------------------
============================ rpmlint session starts ============================
rpmlint: 2.7.0
configuration:
/usr/lib/python3.14/site-packages/rpmlint/configdefaults.toml
/etc/xdg/rpmlint/fedora-spdx-licenses.toml
/etc/xdg/rpmlint/fedora.toml
/etc/xdg/rpmlint/scoring.toml
/etc/xdg/rpmlint/users-groups.toml
/etc/xdg/rpmlint/warn-on-functions.toml
checks: 32, packages: 1
python3-datasets.noarch: E: spelling-error ('dataloaders', '%description -l en_US dataloaders -> data loaders, data-loaders, freeloaders')
python3-datasets.noarch: E: spelling-error ('pre', '%description -l en_US pre -> per, ore, pee')
python3-datasets.noarch: E: non-executable-script /usr/lib/python3.14/site-packages/datasets/commands/datasets_cli.py 644 /usr/bin/env python
python3-datasets.noarch: E: non-executable-script /usr/lib/python3.14/site-packages/datasets/utils/_filelock.py 644 /usr/bin/env python
python3-datasets.noarch: W: no-manual-page-for-binary datasets-cli
1 packages and 0 specfiles checked; 4 errors, 1 warnings, 3 filtered, 4 badness; has taken 0.3 s
Source checksums
----------------
https://files.pythonhosted.org/packages/source/d/datasets/datasets-4.0.0.tar.gz :
CHECKSUM(SHA256) this package : 9657e7140a9050db13443ba21cb5de185af8af944479b00e7ff1e00a61c8dbf1
CHECKSUM(SHA256) upstream package : 9657e7140a9050db13443ba21cb5de185af8af944479b00e7ff1e00a61c8dbf1
Requires
--------
python3-datasets (rpmlib, GLIBC filtered):
/usr/bin/python3
python(abi)
python3.14dist(dill)
python3.14dist(filelock)
python3.14dist(fsspec)
python3.14dist(fsspec[http])
python3.14dist(huggingface-hub)
python3.14dist(multiprocess)
python3.14dist(numpy)
python3.14dist(packaging)
python3.14dist(pandas)
python3.14dist(pyarrow)
python3.14dist(pyyaml)
python3.14dist(requests)
python3.14dist(tqdm)
python3.14dist(xxhash)
Provides
--------
python3-datasets:
python-datasets
python3-datasets
python3.14-datasets
python3.14dist(datasets)
python3dist(datasets)
Generated by fedora-review 0.10.0 (e79b66b) last change: 2023-07-24
Command line :/usr/bin/fedora-review -n python-datasets
Buildroot used: fedora-rawhide-x86_64
Active plugins: Generic, Shell-api, Python
Disabled plugins: PHP, Haskell, Perl, Ocaml, Java, C/C++, fonts, SugarActivity, R
Disabled flags: EXARCH, EPEL6, EPEL7, DISTTAG, BATCH
SRPM URL: https://gist.github.com/xanderlent/593f010f19cc041495e16570755c858d/raw/de1a48f11b52729e0748fda262c7ee020232f875/python-datasets-4.0.0-1.fc44.src.rpm Spec URL: https://gist.github.com/xanderlent/593f010f19cc041495e16570755c858d/raw/de1a48f11b52729e0748fda262c7ee020232f875/python-datasets.spec I believe this addresses all of the review comments. Please take a look. Created attachment 2104220 [details]
The .spec file difference from Copr build 9406399 to 9452095
Copr build: https://copr.fedorainfracloud.org/coprs/build/9452095 (succeeded) Review template: https://download.copr.fedorainfracloud.org/results/@fedora-review/fedora-review-2388144-python-datasets/fedora-rawhide-x86_64/09452095-python-datasets/fedora-review/review.txt Please take a look if any issues were found. --- This comment was created by the fedora-review-service https://github.com/FrostyX/fedora-review-service If you want to trigger a new Copr build, add a comment containing new Spec and SRPM URLs or [fedora-review-service-build] string. Thanks for the changes. Approved. The Pagure repository was created at https://src.fedoraproject.org/rpms/python-datasets FEDORA-2025-aca7ed02f4 (python-datasets-4.0.0-1.fc44) has been submitted as an update to Fedora 44. https://bodhi.fedoraproject.org/updates/FEDORA-2025-aca7ed02f4 FEDORA-2025-aca7ed02f4 (python-datasets-4.0.0-1.fc44) has been pushed to the Fedora 44 stable repository. If problem still persists, please make note of it in this bug report. |