DataLad is used as an infrastructure component or utility in a growing number of services and software packages:
- The OpenNeuro data portal uses DataLad for logistics, and provides a service to access all datasets on the platform as DataLad datasets via GitHub.
- DataLad is a supported data provider on the CBRAIN platform at the McGill Centre for Integrative Neuroscience.
- The DICOM conversion tool HeuDiConv uses DataLad to organize and version control its output datasets.
- The CBBS imaging platform builds on DataLad to manage data capture, management, and analysis workflows for neuroimaging research.
- There is an ongoing collaboration with the brainlife.io platform to create a unique marketplace that aids discoverability and re-use of publicly available datasets. Hundreds of DataLad datasets are discoverable and immediately usable via the BrainLife dataset search.
- In its SGA3 funding period, the Human Brain Project is supporting developments to establish interoperability between DataLad and its EBRAINS infrastructure.
- The Virtual Brain Cloud consortium supports the development of FAIR DataLad-based processing workflows and software adaptors for interoperability with its cloud infrastructure for biomedical research and clinical decision-making.
- The proposed NFDI-Neuro consortium to establish a national research data infrastructure for neurosciences in Germany builds on a DataLad-based technical backbone, the NFDI-Neuro Common Infrastructure (COIN).
- The DFG collaborative research center (CRC) 1451 on “Key Mechanisms of Motor Control in Health and Disease” at the University of Cologne (and its partners) uses DataLad as key technology for its information management needs.
- DataLad is the data management back-end of the remote monitoring platform JTrack for collecting data from a large number of wearable devices for the purpose of developing digital biomarkers for neurological and psychiatric diseases.
DataLad and its datasets are compatible with a number of existing portals and storage solutions:
- DataLad datasets can be hosted on GIN, and any data hosted on GIN can be accessed via DataLad. Moreover, the GIN service is also available for local deployment, offering a convenient, in-house storage back-end and web UI for DataLad datasets.
- With the datalad-osf extension package, DataLad datasets, with all data file content and version history, can be hosted on the Open Science Framework (example study).
- The datalad-ukbiobank extension package represents UK Biobank data as extensible DataLad datasets that can monitor future updates.
- DataLad datasets can be hosted (without git-annex'ed data) on any Git hosting portal such as GitHub and GitLab.
- DataLad datasets can be "linked" to a wide range of data hosting portals (e.g., AWS S3 and Glacier, Box.com and many others) to offload data online.