investigator_user investigator user funding collaborators pending menu bell message arrow_up arrow_down filter layers globe marker add arrow close download edit facebook info linkedin minus plus save share search sort twitter remove user-plus user-minus
  • Project leads
  • Collaborators

Statistical methods for genomic analysis of heterogeneous tumors

Wenyi Wang

0 Collaborator(s)

Funding source

National Institutes of Health (NIH)
Solid tissue samples frequently consist of two distinct compartments, an epithelium-derived tumor and its surrounding stroma. Current analysis of tissue samples composed of both tumor cells and stromal cells may under-detect gene expression signatures associated with cancer prognosis or response to treatment. Modeling the separate tissue compartments is necessary for a better understanding of the biological mechanisms underlying cancer. However, compartmental modeling is difficult from a methodological perspective, and adequate statistical methods have not yet been developed for this purpose. Current methods for in silico separation of expression levels from different compartments of a tissue sample have limited utility as they require previous knowledge of either the various mixing proportions of the patient samples, or the actual expression levels in a few genes (i.e., reference genes) across all tissue compartments. This challenge significantly limits our ability to identify molecular subtypes in both tumor and stroma that are predictive of personalized therapeutic targets. This proposal is to develop novel methods and analytic tools to address these important challenges for the in silico dissection of tumor samples and to demonstrate the utility of these tools by investigating the effect of individual tumor sample components and their interactions with drug treatments for lung cancer. Our Aim 1 will provide a Bayesian hierarchical model and related software tools that will have the ability to computationally "dissect" signals within patient samples. This model will take advantage of all existing data and multiple data types, which consequently reduces the need for the prior knowledge that would otherwise be difficult to obtain. This will enable researchers to investigate the expression profiles of individual tumor tissue and surrounding stromal tissues for a much larger set of samples than was previously feasible. It will also provide new ways to increase the accuracy of the genomic analysis of any mixed samples. Our Aim 2 will re-analyze, by deconvolution, what is to our knowledge the largest set of genomic data for the molecular profiling of lung tumors, all of which were collected at MD Anderson Cancer Center. Lung cancer leads amongst all cancers in causing death anywhere in the world. A thorough understanding of tumor biology is critical to the design of effective treatment modalities. Our analyses will include genomic data from more than 500 patients, generated from two innovative biomarker-based clinical trials: the Biomarker-integrated Approaches of Targeted Therapy for Lung Cancer Elimination (BATTLE) trials, and the Profiling of Resistance Patterns & Oncogenic Signaling Pathways in Evaluation of Cancers of the Thorax and Therapeutic Target Identification (PROSPECT) trials. We focus on the study of one prototype example, lung cancer, because of the public impact of the disease and also the likely role of the tumor-stroma interaction in determining clinical outcomes. Our proof-of-principle investigation of the lung cancer data would be the first of its kind, and has the potential to identify new biomarkers predictive of the effects of drug treatments on the survival time of individuals with lung cancer.

Related projects