Digitizing omics profiles by divergence from a baseline.

TitleDigitizing omics profiles by divergence from a baseline.
Publication TypeJournal Article
Year of Publication2018
AuthorsDinalankara W, Ke Q, Xu Y, Ji L, Pagane N, Lien A, Matam T, Fertig EJ, Price ND, Younes L, Marchionni L, Geman D
JournalProc Natl Acad Sci U S A
Volume115
Issue18
Pagination4545-4552
Date Published2018 05 01
ISSN1091-6490
KeywordsComputational Biology, Data Interpretation, Statistical, Databases, Genetic, Gene Expression Profiling, Genomics, High-Throughput Nucleotide Sequencing, Humans, MicroRNAs, Neoplasms, Precision Medicine, Proteomics
Abstract

Data collected from omics technologies have revealed pervasive heterogeneity and stochasticity of molecular states within and between phenotypes. A prominent example of such heterogeneity occurs between genome-wide mRNA, microRNA, and methylation profiles from one individual tumor to another, even within a cancer subtype. However, current methods in bioinformatics, such as detecting differentially expressed genes or CpG sites, are population-based and therefore do not effectively model intersample diversity. Here we introduce a unified theory to quantify sample-level heterogeneity that is applicable to a single omics profile. Specifically, we simplify an omics profile to a digital representation based on the omics profiles from a set of samples from a reference or baseline population (e.g., normal tissues). The state of any subprofile (e.g., expression vector for a subset of genes) is said to be "divergent" if it lies outside the estimated support of the baseline distribution and is consequently interpreted as "dysregulated" relative to that baseline. We focus on two cases: single features (e.g., individual genes) and distinguished subsets (e.g., regulatory pathways). Notably, since the divergence analysis is at the individual sample level, dysregulation can be analyzed probabilistically; for example, one can estimate the probability that a gene or pathway is divergent in some population. Finally, the reduction in complexity facilitates a more "personalized" and biologically interpretable analysis of variation, as illustrated by experiments involving tissue characterization, disease detection and progression, and disease-pathway associations.

DOI10.1073/pnas.1721628115
Alternate JournalProc Natl Acad Sci U S A
PubMed ID29666255
PubMed Central IDPMC5939095
Grant ListR01 CA177669 / CA / NCI NIH HHS / United States
R01 CA200859 / CA / NCI NIH HHS / United States
U01 CA196390 / CA / NCI NIH HHS / United States
Related Faculty: 
Luigi Marchionni, M.D., Ph.D.

Pathology & Laboratory Medicine 1300 York Avenue New York, NY 10065 Phone: (212) 746-6464
Surgical Pathology: (212) 746-2700