Neurobagel graph data files
Overview
Using the Neurobagel CLI (see also the section on the CLI),
a Neurobagel data dictionary (.json
) for a dataset can be processed together with the corresponding tabular data (.tsv
) and BIDS dataset (if available) to generate subject-level linked data that can be encoded in a knowledge graph.
The Neurobagel graph-ready data are stored in JSON-LD format (.jsonld
),
and include a representation of each subject's harmonized phenotypic properties and imaging metadata.
Another way to think about the difference between a Neurobagel data dictionary and a graph-ready .jsonld
data file is this:
more than one dataset can theoretically have the same data dictionary (if the tabular data include the same columns and unique column values),
but the .jsonld
file for each dataset is unique as long as the actual data of subjects differs across datasets.
Example .jsonld
files
Depending on whether a dataset annotated using Neurobagel includes BIDS imaging data,
the .jsonld
data for the dataset may or may not include imaging metadata of subjects (extracted automatically with the CLI).