Before signing up for the Data Catalog Collaboration Project (DCCP), please review all of the resources below. They are designed to help new institutions navigate their way through the data catalog code, documentation, and workflows. Please see our requirements for joining the DCCP before reaching out.

If you are interested in joining the DCCP as a collaborating member, please contact:

Kevin Read, DCCP Project Lead:

Open source code

The DCCP has made all of its code available on GitHub. We encourage other institutions looking to make their data discoverable to implement the code and join the DCCP.

The DCCP has created an open sandbox to interact with the Data Catalog as if it were installed locally at your institution. Within it you can create new metadata and add records for datasets. The system resets every evening at midnight so your entries will not be stored.

test data catalog



The DCCP ensures that the most recent version of its metadata is always available. We adapt the metadata as each institution encounters new use cases.  The metadata is described in two key documents:

Master Metadata Schema

  • Contains our most up to date metadata schema used by the DCCP

Metadata Documentation

  • Provides details on how to curate biomedical datasets using the metadata schema. This document provides instructions on how to enter metadata for each element.

quality control

To ensure that the Data Catalog is kept up to date, the metadata is accurate, and the technology is functioning correctly, we have developed comprehensive quality control documentation that includes guidance and scheduled maintenance. 

data catalog faq

For new and interested members of the DCCP, we have detailed FAQ's to help implement the Data Catalog, contact researchers, and curate data.