Page Last Updated: May 22, 2026

Software & Analytic Standards🔗

The following procedures are performed to support transparency, reproducibility, and standard environments for our databases and systems controls:

Processing & Analytic Standards🔗

See the full details on the HBCD Data Release Docs site here. This includes the use of standardized processing pipelines for MRI, EEG, and other modalities, as well as the use of standardized analytic approaches across sites. Independent code review is performed for all processing pipelines via the NMIND Coding Standards Checklist, with a focus on ensuring that the code is well-documented, reproducible, and follows best practices in software development. This includes version control of all internal and public-facing code bases, including the documentation websites.

New Processing Pipeline Integration🔗

Integrating new pipelines into HBCD’s file-based data processing workflow involves three main phases (further broken down into 6 stages below):

Approval: Review and sign-off to begin integration testing
Integration & Testing: Implementation and evaluation against HBCD Pipeline & Derivative Data Standards
Final Review: Workgroup subject matter experts (SMEs) and leads provide consultation and sign-off

HBCD Pipeline Integration Proposal (H-PIP) Stages

Stage	Description/Actions	Responsible Group(s)	Requirements
1. Approval	Senior stakeholder approves H-PIP testing via MRI WG consensus. Define scope and request initial timeline.	Senior stakeholder & MRI WG	—
2. Scoping	Refine proposed timeline and scope for integration.	Informatics Core & Developers	—
3. Integration	Pipeline integrated within Informatics Core.	Erik Lee, Tim Hendrickson, Sriharshitha Anuganti, pipeline developers	• Containerized • BIDSified • Independent per-session processing • NMIND bronze certified
4. Testing	Pipeline tested	FAB (Eric Feczko's Lab)Eric Feczko (PI), Begim Fayzullobekova, rae McCollum, Jacob Lundquist, Michael Anderson	• Validity • Reliability • Analytic reproducibility
5. Review	Findings presented to MRI WG for sign-off.	MRI WG	—
6. SME Consult	Findings presented to SME WG for additional consultation and final approval.	SME WG	—

Data Storage and Backup🔗

Large data files are stored in MSI's Tier 2 Ceph storage system, while the LORIS systems operate off of the storage in MSI's OpenStack cloud environment.
Research data is encrypted and backed up to AWS Deep Glacier on a nightly basis.
The LORIS systems (prod, sandbox, and staging) have their database backed up on a nightly basis to MSI Tier 2 storage.
Currently all data is held, although we are in the final phases of determining a system for retiring some nightly backups after six months, reducing to retaining weekly backups for another three months and then monthly after nine total months.

Code Versioning🔗

All code, both LORIS customizations and Puppet orchestration code, are maintained in the University of Minnesota's local Enterprise GitHub. These code bases are kept in separate internal GitHub organizations with different memberships and access controls. The orchestration code is also subject to our Internal Change Control process that requires a change to get a separate approval and verification from someone other than the implementer to ensure safety and reliability.

Access Controls🔗

Data access is modified through the same Internal Change Control process governing orchestration changes. All change requests must be signed off on by a senior staff manner and are vetted by data stewards or managers of the projects.