The dataregistry package

The tutorials for the dataregistry package come in the form of Jupyter Notebooks (hosted in the docs/source/tutorial_notebooks directory in the repository).

These tutorial notebooks can be run interactively at NERSC (see here for more info), or serve as standalone reference material by viewing the notebooks on GitHub.

1) Getting started with the data registry

Getting started: Part 1 - Registering datasets: Link to notebook

Getting started: Part 2 - Simple queries: Link to notebook

2) Advanced tutorials

A deeper dive into datasets: Link to notebook

Pipeline datasets: Link to notebook

The production schema: Link to notebook

Using GCRCatalogs with dataregistry: Link to notebook