BioDataHub IT Infrastructure

Networked Systems for Data protection-compliant Research

The IT infrastructure of the DKTK BioDataHub forms the technical backbone for secure, standardized, and efficient access to biospecimen information and associated clinical oncology data.

Its objective is to enable researchers to use clinical and research data in a consistent and efficient manner while ensuring the highest standards of data protection at all times.
.
Federated Data Architecture

Each site operates a so-called Bridgehead, which serves as a local data hub.
This federated approach allows decentralized information (e.g., from tumor documentation systems, biobanks, or research projects) to be made available across the consortium without requiring physical data transfer.

Technologies and Tools Supporting Research

To advance translational research, the BioDataHub develops practice-oriented software solutions:

Tools for Researchers

  • Federated Search: Web application for exploratory, cross-site queries of patient data and biospecimens – ideal for feasibility analyses​​​​ BioDataHub Explorer
  • Cohort Dashboards: Overview and monitoring of ongoing biospecimen collections, e.g., in projects such as EXLIQUID und DKTK Organoid-Plattform
  • Federated Analysis: Secure, data protection–compliant analyses using DataSHIELD, where analytical algorithms are brought to the data – not vice versa.

Data Protection and Security

  • Mainzelliste: Pseudonymization and record linkage for the secure handling of personal data.
  • Authentication Service: Unified and secure access (single sign-on) to all applications.

Data Integration and Interoperability 

  • Samply.Bridgehead: Local data hub operated at each site.
  • Samply.Blaze: FHIR-based data repository enabling structured and interoperable data storage.
  • Samply.Beam: Secure data communication between sites.
  • Samply.TransFair and oBDS2FHIR-Pipeline: Tools for data integration and quality assurance through centralized, standardized mappings.

All software components are released under open-source licenses to ensure transparency, reusability, and long-term sustainability (https://github.com/samply/). 


Contact: For questions regarding the BioDataHub’s IT activities, please contact Prof. Dr. Martin Lablans (Heidelberg).
 

Documents & Data Protection Concepts (in German):