Jean-Eudes Dazard, PhD

Assistant Professor
Center for Proteomics and Bioinformatics
School of Medicine
Assistant Professor
Department of Nutrition
School of Medicine
Biostatistics Consultant
Biostatistics and Bioinformatics Core Facility
Case Comprehensive Cancer Center
Member
Case Comprehensive Cancer Center

Research Information

Research Interests

I've initially been trained in Computer Science then in Molecular Biology and Bioinformatics and lately in Statistics. This mixed background allowed me to engage in research and teaching activities at the interface of these three areas.  Over the last 15 years, and especially since my starting date as a faculty at CWRU, I've been the witness of tremendous changes in these disciplines, not only on their own, but also how they intersect and complement each other. Now more than ever is an exciting time to work in this evolving field of Data Science to constantly create new and powerful ways to study and model complex systems and phenomenon such as in genomics, proteomics, metabolomics and biomedical science.

My research interest is focused on real-world research problem in computational and statistical biology with emphasis on developing data mining methods in high-dimensional data, mostly from high-throughput technologies. Because of my early background in computer science, I've always devoted time to develop computational resources such as webtools and softwares. Recently, I have been successful in securing NIH funding to support a long-term research project in so-called “Survival Bump Hunting for Identifying and Characterizing Informative Diagnostics and Prognostics Subgroups of Patients in High Dimensional Data"with direct clinical implications in precision medicine as well as early detection and intervention. This has lead so far to the publication of a few articles, book chapter, websites and softwares (see references in sections below and following links).

Google Scholar Link

Research Projects

Conventional statistical models are inappropriate when dealing with large datasets where the number of variables exceeds the number of observations (so-called p >> n paradigm). It is a challenging problem causing severe risks of model unfitting and statistical errors. Particular issues posed by high dimensional data are the control of error rates due to inherent noise of the employed technologies, the multi-collinearity of predictors due to the parallel nature variables interrogation, and the sparsity of informative predictors due to the massive number of variables interrogated compared to the fewness of variables at play.

My research focus is in Computational/Statistical Biology, with emphasis on developing data mining methods in high dimensional data (p >> n paradigm) as generated by metabolomics, proteomics and sequencing high-throughput technologies.

Recent focus has been in:

  1. Bump Hunting in Classification, Regression and Survival settings for High-Dimensional Data: General applications are in identifying, predicting and characterizing informative subgroups. One currently under development is in “Survival Bump Hunting” where the outcome of interest is a time-to-event, possibly censored. An application is in identifying and characterizing informative prognostics subgroups of patients for risk and reliability analysis. Direct clinical implications are in improved diagnostic and prognostic tools for personalized medicine as well as early diagnosis, prevention and intervention.
  2. Model Selection and Predictive Modeling (Bayesian and Frequentist) applied to Differential Expression, Association and Interaction Problems: Recent discoveries were made in genetic association, biomarker discovery, and proteomics interaction studies.
  3. Data Integration of High-Dimensional Data using either Statistical or Network-Based Models.
  4. Regularization and Variance Stabilization of High-Dimensional Data:  Statistical Computing and Software Development; Resampling and Monte-Carlo methods; Parallel Computing and Computational Complexity; Source Code Management and Collaborative Software Development (GitHub).

GitHub Link

Awards and Honors

Sir Charles Clore Fellow
2001
Weizmann Institute of Science, Rehovot, Israel
French Society of Dermatology (SFD) and L’Oreal Award (€45,000)
2000
French Society of Dermatology (SFD) and L’Oreal, Paris, France

External Appointments

  • 2013-2018: NIH NCI Research Grant Principal Investigator (R01 CA 160593)
  • 2003-2006: NIH NCI CoGEC Training Grant Fellow (R25 CA 094186)

Publications

View All Publications

Publications In Preparation or Submitted

  • DAZARD J-E., CHOE M., PAWITAN Y., RAO J.S. Identification and Characterization of Informative Prognostic Subgroups by Survival Bump Hunting. (in prep 2018).
  • DAZARD J-E., RAO J.S. Variable Selection Strategies for High-Dimensional Survival Bump Hunting using Recursive Peeling Methods. (in prep 2018).
  • DIAZ D.A., RAO J.S., DAZARD J-E. On the Explanatory Power of Principal Components. (in prep 2018). Archives of Cornell University Library
  • ZHANG Z., DAZARD J-E., BEBEK G. N-Node Subnetwork Enumerating Algorithm (N-SEA) Identifies Lower Grade Glioma Subtypes with Altered Subnetworks and Distinct Prognostics (in prep 2018).

Recent Publications

Book Chapters

  • DIAZ D.A., DAZARD J-E., RAO J.S. "Unsupervised Bump Hunting Using Principal Components". In: Ahmed SE, editor. Big and Complex Data Analysis: Methodologies and Applications. Contributions to Statistics, vol. Edited Refereed Volume. Cham Heidelberg New York: Springer. ISBN 978-3-319-41573-4. PMCID: pending. Archives of Cornell University Library.

Softwares