Cleveland Institute for Computational Biology

Informatics Team

Jonathan Haines, PhD, Director
David Kaelber, MD, PhD, Co-Director

Cleveland Institute for Computational Biology logo

The Cleveland Institute for Computational Biology (CICB) will advance our fundamental knowledge of human biology through the application of computational methods to large and diverse datasets. Further, the CICB will promote the translation of this knowledge into better diagnosis, prognosis, treatment, prevention and delivery of healthcare.

National COVID Cohort Collaborative (N3C)

National COVID Cohort Collaborative (N3C) logo

The National COVID Cohort Collaborative (N3C) is a collaboration between the Center for Data to Health (CD2H) and National Center for Advancing Translational Sciences (NCATS) for the purpose of creating the NIH COVID-19 Data Warehouse.

The NIH COVID-19 Data Warehouse will be an NIH data sharing resource containing clinical and imaging data from individuals who have received a Coronavirus Disease 2019 (“COVID-19”) test or whose symptoms are consistent with COVID-19. Data will also be collected from individuals infected with pathogens such as SARS 1, MERS, and H1N1 to support comparative studies. The data warehouse will accelerate discovery using near real-time data and observational research, clinical studies and clinical trials, and will bring these data to bear on COVID-19. This solution requires a single computational environment to support public health actions, clinical care, policy, and science underpinning vaccines, prophylaxis, and treatment. The data warehouse will be highly interoperable, secure, clinical data research environment that will harmonize clinical and patient data. The NIH COVID-19 Data Warehouse will serve as a national resource to address the COVID-19 pandemic, as well as demonstrate coordination of clinical and patient data, setting an example for how to approach urgent research needs during future healthcare challenges.

NIH intends to provide access to and use by researchers for public health purposes and decision making, including conducting and supporting research to define the clinical natural history of COVID-19 infection and assess therapeutic responses and outcomes, and conducting and supporting a broad range of studies, including the identification of COVID-19 risk factors and development of effective countermeasures and diagnostics. Importantly, the underlying data model to which data from OMOP, PCORNet, i2b2/ACT, and TriNetX will be ETL’d (extract-transformed-loaded), harmonized and aggregated is the OMOP common data model.

The CICB has engaged with the N3C via participation in the Data Ingestion & Harmonization work stream and the Phenotype & Data Acquisition work stream.

OnCore Enterprise Research

The OnCore Enterprise Research system is a clinical trial management system that integrates information systems and common operational components with an appropriate balance of centralization vs. decentralization.


LabmatrixTM is available to all members of the CWRU and affiliated hospitals research community and is housed inside of the CICB SHED (Securely Held Electronic Data) within the CWRU Secure Research Environment (SRE). Access is tightly controlled via role-based security and 2-factor authentication.

Computational Biology Core

The Computational Biology Core is a collaborative and fee for service facility within the CICB aimed at providing computational and analytical support for a variety of research questions. The Core supports standardized and custom pipelines for data processing and analysis of next-generation sequencing data.