The dataregistry package

The tutorials for the dataregistry package come in the form of Jupyter Notebooks (hosted in the docs/source/tutorial_notebooks directory in the repository).

The tutorial notebooks can be run interactively at NERSC (see here for more info), or serve as standalone reference material by viewing the notebooks on GitHub.

See also some simple examples of queries and dataset registration here <usage> which you may be able to adapt to your needs.

1) Getting started with the data registry

Getting started: Part 1 - Registering datasets: Link to notebook

Getting started: Part 2 - Simple queries: Link to notebook

2) Advanced tutorials

A deeper dive into datasets: Link to notebook

Pipeline datasets: Link to notebook

The production schema: Link to notebook

Using GCRCatalogs with dataregistry: Link to notebook