Bug 2273614

Summary: (Geo)Parquet vector driver not compiled in
Product: [Fedora] Fedora Reporter: Cedric Rossi <cedric>
Component: gdalAssignee: Sandro Mani <manisandro>
Status: CLOSED ERRATA QA Contact: Fedora Extras Quality Assurance <extras-qa>
Severity: medium Docs Contact:
Priority: unspecified    
Version: 39CC: alex, devrim, manisandro, neteler, pavel.lisy, praiskup
Target Milestone: ---   
Target Release: ---   
Hardware: Unspecified   
OS: Linux   
URL: https://packages.fedoraproject.org/pkgs/gdal/gdal/fedora-39-updates.html
Whiteboard:
Fixed In Version: gdal-3.8.5-2.fc40 gdal-3.7.3-5.fc39 Doc Type: ---
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2024-04-23 01:14:51 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Cedric Rossi 2024-04-05 10:26:38 UTC
On a freshly installed Fedora 39 box, gdal is unable to open a parquet file (an exemple parquet file is here: https://github.com/opengeospatial/geoparquet/blob/main/examples/example.parquet)

> $ gdalinfo --version
> GDAL 3.7.3, released 2023/10/30

> $ gdalinfo ~/Downloads/example.parquet
> ERROR 4: `/home/cedricr/Downloads/example.parquet' not recognized as a supported file format.

And `gdal-config --formats` does not list the parquet format.

The required dependencies appear missing in the build.log (at least for x86_64) https://kojipkgs.fedoraproject.org//packages/gdal/3.7.3/4.fc39/data/logs/x86_64/build.log

> -- Found the Arrow shared library: /usr/lib64/libarrow.so.1300.0.0
> -- Found the Arrow import library: ARROW_IMPORT_LIB-NOTFOUND
> -- Found the Arrow static library: 
> -- Could NOT find Parquet (missing: Parquet_DIR)
> -- Could NOT find ArrowDataset (missing: ArrowDataset_DIR)

And in the "The following features have been disabled:" part
>  * ogr_PARQUET, Parquet
>  * Parquet
>    Apache Parquet C++ library
>  * ArrowDataset
>    Apache ArrowDataset C++ library

Note that the ArrowDataset library is required for at least some of Geoparquet support.
https://gdal.org/drivers/vector/parquet.html#dataset-partitioning-read-support

I've also looked at the Fedora 40 build log which are similar on those points.

The Parquet (and GeoParquet) file format is increasingly important in data science and in geographic information system. In the current state, QGIS (as installed via dnf) can’t open them either, as it uses gdal to do it.


Reproducible: Always

Comment 1 Fedora Update System 2024-04-14 21:21:43 UTC
FEDORA-2024-f01c22b4e8 (gdal-3.8.5-2.fc40) has been submitted as an update to Fedora 40.
https://bodhi.fedoraproject.org/updates/FEDORA-2024-f01c22b4e8

Comment 2 Fedora Update System 2024-04-14 21:21:44 UTC
FEDORA-2024-6d87487515 (gdal-3.7.3-5.fc39) has been submitted as an update to Fedora 39.
https://bodhi.fedoraproject.org/updates/FEDORA-2024-6d87487515

Comment 3 Fedora Update System 2024-04-15 01:11:59 UTC
FEDORA-2024-6d87487515 has been pushed to the Fedora 39 testing repository.
Soon you'll be able to install the update with the following command:
`sudo dnf upgrade --enablerepo=updates-testing --refresh --advisory=FEDORA-2024-6d87487515`
You can provide feedback for this update here: https://bodhi.fedoraproject.org/updates/FEDORA-2024-6d87487515

See also https://fedoraproject.org/wiki/QA:Updates_Testing for more information on how to test updates.

Comment 4 Fedora Update System 2024-04-15 02:19:21 UTC
FEDORA-2024-f01c22b4e8 has been pushed to the Fedora 40 testing repository.
Soon you'll be able to install the update with the following command:
`sudo dnf upgrade --enablerepo=updates-testing --refresh --advisory=FEDORA-2024-f01c22b4e8`
You can provide feedback for this update here: https://bodhi.fedoraproject.org/updates/FEDORA-2024-f01c22b4e8

See also https://fedoraproject.org/wiki/QA:Updates_Testing for more information on how to test updates.

Comment 5 Cedric Rossi 2024-04-16 10:30:16 UTC
Tested the Fedora 39 version -- works perfectly with the example geoparquet file at https://github.com/opengeospatial/geoparquet/blob/main/examples/example.parquet

```
$ ogrinfo --formats | grep -i parquet
  Parquet -vector- (rw+v): (Geo)Parquet

$ ogrinfo example.parquet
INFO: Open of `example.parquet'
      using driver `Parquet' successful.
1: example (Multi Polygon)
```

I can also confirm that QGIS can now open the file.

Many thanks!

Comment 6 Fedora Update System 2024-04-23 01:14:51 UTC
FEDORA-2024-f01c22b4e8 (gdal-3.8.5-2.fc40) has been pushed to the Fedora 40 stable repository.
If problem still persists, please make note of it in this bug report.

Comment 7 Fedora Update System 2024-04-23 01:19:54 UTC
FEDORA-2024-6d87487515 (gdal-3.7.3-5.fc39) has been pushed to the Fedora 39 stable repository.
If problem still persists, please make note of it in this bug report.