earth2studio.data: Data Sources#

Data sources used for downloading, caching and reading different weather / climate data APIs into Xarray data arrays. Used for fetching initial conditions for inference and validation data for scoring.

Warning

Each data source provided in Earth2Studio may have its own respective license. We encourage users to familiarize themselves with each and the limitations it may impose on their use case.

Region
Data Class
Product

data.ARCO

Analysis-Ready, Cloud Optimized (ARCO) is a data store of ERA5 re-analysis data currated by Google.

data.CDS

The climate data source (CDS) serving ERA5 re-analysis data.

data.CMIP6

CMIP6 data source for Earth2Studio.

data.CMIP6MultiRealm

CMIP6 data source for Earth2Studio with multiple realms.

data.GFS

The global forecast service (GFS) initial state data source provided on an equirectangular grid.

data.GOES

GOES (Geostationary Operational Environmental Satellite) data source.

data.HRRR

High-Resolution Rapid Refresh (HRRR) data source provides hourly North-American weather analysis data developed by NOAA (used to initialize the HRRR forecast model).

data.IFS

Integrated forecast system (IFS) HRES initial state (analysis) data source on an equirectangular grid at 0.25 degree resolution.

data.IFS_ENS

Integrated forecast system (IFS) ensemble (ENS) initial state data source on an equirectangular grid at 0.25 degree resolution.

data.JPSS

JPSS VIIRS data source for NOAA-20, NOAA-21, and Suomi-NPP supporting both SDR (L1) and EDR (L2) products.

data.MRMS

NOAA Multi-Radar/Multi-Sensor (MRMS) products via AWS S3.

data.NCAR_ERA5

ERA5 data provided by NSF NCAR via the AWS Open Data Sponsorship Program.

data.PlanetaryComputerECMWFOpenDataIFS

IFS analysis data from the ECMWF Open Data repository.

data.PlanetaryComputerGOES

GOES-R ABI L2 Cloud and Moisture Imagery on Planetary Computer.

data.PlanetaryComputerMODISFire

MODIS Thermal Anomalies/Fire Daily (FireMask, MaxFRP, QA).

data.PlanetaryComputerOISST

Daily 0.25° NOAA Optimum Interpolation SST from Microsoft Planetary Computer.

data.PlanetaryComputerSentinel3AOD

Sentinel-3 SYNERGY Level-2 aerosol optical depth and surface reflectance.

data.Random

A randomly generated normally distributed data.

data.WB2ERA5

ERA5 reanalysis data with several derived variables on a 0.25 degree lat-lon grid from 1959 to 2023 (incl) to 6 hour intervals on 13 pressure levels.

data.WB2ERA5_121x240

ERA5 reanalysis data with several derived variables down sampled to a 1.5 degree lat-lon grid from 1959 to 2023 (incl) to 6 hour intervals on 13 pressure levels.

data.WB2ERA5_32x64

ERA5 reanalysis data with several derived variables down sampled to a 5.625 degree lat-lon grid from 1959 to 2023 (incl) to 6 hour intervals on 13 pressure levels.

data.WB2Climatology

Climatology provided by WeatherBench2,

data.DataArrayFile

A local xarray dataarray file data source.

data.DataSetFile

A local xarray dataset file data source.

data.DataArrayPathList

A local xarray dataarray directory data source that handles multiple files.

earth2studio.data: AI Sources#

Data sources that leverage an AI model to generate weather / climate data that can be used for downstream tasks in real time. Unlike prognostic or diagnostic models, these sources do not require any input state for subsequent predictions.

Region
Data Class
Product

data.CBottle3D(core_model, sst_ds[, ...])

Climate in a bottle data source Climate in a Bottle (cBottle) is an AI model for emulating global km-scale climate simulations and reanalysis on the equal-area HEALPix grid.