skagit-met

Exploring meteorology data for use in hydrologic modeling of the Skagit River basin

Datasets

Dataset	Spatial Resolution	Range/Availability	Data Granularity	Temperature	Precipitation	Wind Speed	Relative Humidity	Long Wave Radiation	Short Wave Radiation
HRRR	3 km	2014 - present	Hourly	✔️	✔️	✔️	✔️	✔️	✔️
PRISM	4 km	1981 - present	Daily	✔️	✔️	X	Via Vapor Pressure Deficit	X	X
UCLA CMIP-6 (WRF)	9 km	1980 - 2100	Hourly	✔️	✔️	Via U and V components	Via Specific Humidity	✔️	✔️
ORNL	4 km	1980-2040	Daily	✔️	✔️	✔️	✔️	✔️	✔️
SNOTEL	Point Data (9 stations)	2010 - present	Hourly	✔️	✔️	X	X	X	X
PNNL	6 km	1981 - 2020	Hourly	✔️	✔️	Via U and V components	Via Specific Humidity	✔️	✔️

Setup

Thanks to Scott, this works really well with pixi, an alternate to conda environments that uses a project rather than environment paradigm. Pixi helps isolate and run entire projects through the use of configuration and lock files to manage dependencies all in one, but backed by the conda-forge repos and pypi.

To use pixi, install it on your machine, clone this repo, then run pixi install in the root of the repo. You may get an error big and long that mentions 'clang' or something like that. If that's the case run export CFLAGS="-Wno-incompatible-function-pointer-types -Wno-implicit-function-declaration" and try again. This is pre-run as a part of the setup script for the conda environment.

To play around with the jupyter notebooks, just run pixi run nb, and a local instance of jupyter will be launched for you with all the necessary packages and dependencies. It's quite magic.

To get running with pixi on the DSHydro environment

Run curl -fsSL https://pixi.sh/install.sh | bash to install pixi for your user
Install the data-download environment or analysis environment using pixi install -e download-data and pixi install -e analysis
To look at the analysis notebook, install the analysis kernel with the following command: ./skagit-met/.pixi/envs/analysis/bin/python3 -m ipykernel install --user --name=skagit_analysis
Once installed, open the Viz.ipynb file, and select the skagit_analyis kernel.

Assumptions:

I assume you have conda, and all its dependencies installed, and a base conda environment going
I assume you are using bash, but if you're using zsh or another shell, this will run better if you modify setup.sh to use the environment on line 7 of setup.sh
Clone this repo
In the root of the repo, run ./setup.sh - this creates the conda environment, installs whats needed, etc.
Enter the environment using conda activate skagit-met
If you're done, don't forget to conda deactivate.

hrrr_downloader script

To download bulk data, we have a python module/script that can be run.

It:

Downloads select parameters from hrrr archives over a specified using fast herbie for parallel computation
Geographically subsets that downloaded data using a provided geojson (geojson polygon boundary - see skagit_boundaries.json for more)
Cleans up all data that is not geographically subsetted
Saves the data as a zarr store to be read and manipulated - see hrr_model_downloader_notebook.ipynb for example usage

When done this way, each day of data takes only a few MB of disk space.

To run:

Activate the conda environment in the root of this repo (see setup above)
Run from the command line using the following command - be sure to adjust the dates and parameters as needed - pixi run hrrr --startDate 2023-02-01 --endDate 2023-02-08 --parameters 'TMP:surface,RH:2 m above ground,WIND:10 m above ground,APCP:surface:0-1 hour acc fcst,DSWRF:surface,DLWRF:surface'
For help with parameters, run pixi run hrrr -h

wrf_downloader script

This script downloads and formats bulk, downscaled WRF output data from the UCLA downscaled cmip6 archive. You can read more about the data tiers and various domains here

Similar to the hrr_downloader script, it:

Downloads select parameters from UCLA archives over a specified date range using multi-theraded connections to AWS.
Geographically subsets that downloaded data using a provided geojson (geojson polygon boundary - see skagit_bound_poly.json for more)
Uses masking to establish boundaries
Saves the data as a zarr store to be read and manipulated - see WRF_Downloader.ipynb for example usage

All 22 variables for WRF are about 56MB for a weeks worth of data, so 2.9GB as a zarr per year for that hourly data.

To run:

Activate the conda environment in the root of this repo (see setup above) or have pixi installed
Run from the command line using the following command - be sure to adjust the dates and parameters as needed - python wrf_downloader.py --model cesm2_r11i1p1f1_ssp245 --startDate 2023-01-01 --endDate 2023-01-08
If using pixi, run pixi run wrf --model cesm2_r11i1p1f1_ssp245 --startDate 2023-01-01 --endDate 2023-01-08 --outputDir data/weather_data/
For help with parameters, run pixi run wrf_downloader.py -h`

prism_downloader script

This script downloads and formats bulk, downscaled PRISM output data from the PRISM archives. You can read more about the data here.

Similar to the the other scripts, it:

Downloads select parameters from PRISM archives over a specified date range using multi-theraded connections to PRISM FTP servers.
Geographically subsets that downloaded data using a provided geojson
Uses masking to establish boundaries
Saves the data as a zarr store to be read and manipulated - see PRISM_Downloader.ipynb for example usage

To run:

Activate the conda environment in the root of this repo (see setup above) or have pixi installed
Run from the command line using the following command - be sure to adjust the dates and parameters as needed - python prism_downloader.py --startDate 2023-01-01 --endDate 2023-01-08 --outputDir data/weather_data/
If using pixi, run pixi run prism --startDate 2023-01-01 --endDate 2023-01-08 --outputDir data/weather_data/
For help with parameters, run pixi run prism_downloader.py -h

snotel_downloader script

This script downloads and formats bulk SNOTEL data using metloom. You can read more about metloom here.

Similar to the the other scripts, it:

Downloads select parameters from SNOTEL archives over a specified date range using metloom, given Snotel site ids or a geojson to specify the boundaries where you want to get snotel sites within.
Saves the data as a zarr store to be read and manipulated.

To run:

Using pixi: pixi run snotel --frequency daily --startDate 2023-01-01 --endDate 2023-01-08 --outputDir data/weather_data/
For help with parameters, run pixi run snotel_downloader.py -h

Another Environment:

Cryocloud: Built-in environment to access and manipulate data.

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
.github/workflows		.github/workflows
Data/GIS		Data/GIS
analysis		analysis
notebooks		notebooks
scripts		scripts
.gitignore		.gitignore
README.md		README.md
environment.yml		environment.yml
pixi.lock		pixi.lock
pixi.toml		pixi.toml
requirements.txt		requirements.txt
ruff.toml		ruff.toml
setup.sh		setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

skagit-met

Datasets

Setup

To get running with pixi on the DSHydro environment

Assumptions:

hrrr_downloader script

wrf_downloader script

prism_downloader script

snotel_downloader script

Another Environment:

Resources

About

Uh oh!

Releases

Packages

Languages

scottyhq/skagit-met

Folders and files

Latest commit

History

Repository files navigation

skagit-met

Datasets

Setup

To get running with pixi on the DSHydro environment

Assumptions:

hrrr_downloader script

wrf_downloader script

prism_downloader script

snotel_downloader script

Another Environment:

Resources

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages