Earth2Studio is now OSS!

earth2studio.data: Data Sources#

Data sources used for downloading, caching and reading different weather / climate data APIs into Xarray data arrays. Used for fetching initial conditions for inference and validation data for scoring.

Warning

Each data source provided in Earth2Studio may have its own respective license. We encourage users to familiarize themselves with each and the limitations it may impose on their use case.

data.ARCO([cache, verbose])

Analysis-Ready, Cloud Optimized (ARCO) is a data store of ERA5 re-analysis data currated by Google.

data.CDS([cache, verbose])

The climate data source (CDS) serving ERA5 re-analysis data.

data.GFS([cache, verbose])

The global forecast service (GFS) initial state data source provided on an equirectangular grid.

data.HRRR([cache, verbose])

High-Resolution Rapid Refresh (HRRR) is a North-American weather forecast model with hourly data-assimilation developed by NOAA.

data.IFS([cache, verbose])

The integrated forecast system (IFS) initial state data source provided on an equirectangular grid.

data.Random(domain_coords)

A randomly generated normally distributed data.

data.DataArrayFile(file_path)

A local xarray dataarray file data source.

data.DataSetFile(file_path, array_name)

A local xarray dataset file data source.

Functions#

data.fetch_data(source, time, variable[, ...])

Utility function to fetch data for models and load data on the target device.

data.prep_data_array(da[, device])

Prepares a data array from a data source for inference workflows by converting the data array to a torch tensor and the coordinate system to an OrderedDict.