Derived data products from GBIF snapshots.
Processed GBIF occurrence data hexed at H3 resolutions 0-10, plus derived products.
GBIF releases new snapshots periodically. Releases are versioned by year-month (e.g., 2025-06) so multiple versions can coexist in the bucket during transitions. Always use the most recent release for new work.
Path: s3://public-gbif/2025-06/hex/
Global GBIF occurrences partitioned by H3 resolution 0 cell. 94 non-empty h0 partitions covering the globe.
gbifid, datasetkey, occurrenceid, kingdom, phylum, class, order, family, genus, species, infraspecificepithet, taxonrank, scientificname, verbatimscientificname, verbatimscientificnameauthorshipcountrycode, locality, stateprovince, occurrencestatus, individualcount, decimallatitude, decimallongitude, coordinateuncertaintyinmeters, coordinateprecision, elevation, depth, eventdate, day, month, year, taxonkey, specieskey, basisofrecord, institutioncode, collectioncode, catalognumber, recordnumber, identifiedby, dateidentified, license, rightsholder, recordedby, typestatus, establishmentmeans, lastinterpreted, mediatype, issue, geomh0 (VARCHAR, partition key), h1-h10 (UBIGINT)Path: s3://public-gbif/hex/ — DEPRECATED, do not use for new work.
This is the original hex partition with all h0-h11 columns stored as VARCHAR strings. It is retained only for backward compatibility with existing applications. It will be removed once no active apps depend on it. See GitHub issue for tracking.
File: s3://public-gbif/redlined_cities_gbif.parquet
Spatial join of GBIF occurrences with "Mapping Inequality" (Redlining) polygons for US cities.
Schema: gbifid, scientificname, kingdom, phylum, class, order, family, genus, species, recordedby, date, coordinateuncertaintyinmeters, city, state, grade (A-D), residential, commercial, industrial
Prefix: s3://public-gbif/taxonomy/ (partitioned by h0)
Aggregated counts of taxa within H3 resolution 0 hexagons.
Schema: scientificname, kingdom, phylum, ... species, n (count), h0
File: s3://public-gbif/taxa.parquet
Reference list of all taxa found in the dataset.