Literature Lab: a method of automated literature interrogation to infer biology from microarray analysis.

TitleLiterature Lab: a method of automated literature interrogation to infer biology from microarray analysis.
Publication TypeJournal Article
Year of Publication2007
AuthorsFebbo PG, Mulligan MG, Slonina DA, Stegmaier K, Di Vizio D, Martinez PR, Loda M, Taylor SC
JournalBMC Genomics
Volume8
Pagination461
Date Published2007 Dec 18
ISSN1471-2164
KeywordsAnimals, Biology, Breast Neoplasms, Databases, Genetic, Female, Humans, Immunohistochemistry, Male, Microarray Analysis, Neoplasms, Pattern Recognition, Automated, Prostatic Neoplasms, Software
Abstract

BACKGROUND: The biomedical literature is a rich source of associative information but too vast for complete manual review. We have developed an automated method of literature interrogation called "Literature Lab" that identifies and ranks associations existing in the literature between gene sets, such as those derived from microarray experiments, and curated sets of key terms (i.e. pathway names, medical subject heading (MeSH) terms, etc).

RESULTS: Literature Lab was developed using differentially expressed gene sets from three previously published cancer experiments and tested on a fourth, novel gene set. When applied to the genesets from the published data including an in vitro experiment, an in vivo mouse experiment, and an experiment with human tumor samples, Literature Lab correctly identified known biological processes occurring within each experiment. When applied to a novel set of genes differentially expressed between locally invasive and metastatic prostate cancer, Literature Lab identified a strong association between the pathway term "FOSB" and genes with increased expression in metastatic prostate cancer. Immunohistochemistry subsequently confirmed increased nuclear FOSB staining in metastatic compared to locally invasive prostate cancers.

CONCLUSION: This work demonstrates that Literature Lab can discover key biological processes by identifying meritorious associations between experimentally derived gene sets and key terms within the biomedical literature.

DOI10.1186/1471-2164-8-461
Alternate JournalBMC Genomics
PubMed ID18088408
Grant ListCA89031 / CA / NCI NIH HHS / United States
CA123175 / CA / NCI NIH HHS / United States
Related Faculty: 
Massimo Loda, M.D.

Pathology & Laboratory Medicine 1300 York Avenue New York, NY 10065 Phone: (212) 746-6464
Surgical Pathology: (212) 746-2700