TCGA

A Pan-Cancer Analysis Reveals High-Frequency Genetic Alterations in Mediators of Signaling by the TGF-β Superfamily

We present an integromic analysis of gene alterations that modulate transforming growth factor β (TGF-β)-Smad-mediated signaling in 9,125 tumor samples across 33 cancer types in The Cancer Genome Atlas (TCGA). Focusing on genes that encode mediators and regulators of TGF-β signaling, we found at least one genomic alteration (mutation, homozygous deletion, or amplification) in 39% of samples, with highest frequencies in gastrointestinal cancers. We identified mutation hotspots in genes that encode TGF-β ligands (BMP5), receptors (TGFBR2, AVCR2A, and BMPR2), and Smads (SMAD2 and SMAD4).

Comprehensive Molecular Characterization of the Hippo Signaling Pathway in Cancer

Hippo signaling has been recognized as a key tumor suppressor pathway. Here, we perform a comprehensive molecular characterization of 19 Hippo core genes in 9,125 tumor samples across 33 cancer types using multidimensional “omic” data from The Cancer Genome Atlas. We identify somatic drivers among Hippo genes and the related microRNA (miRNA) regulators, and using functional genomic approaches, we experimentally characterize YAP and TAZ mutation effects and miR-590 and miR-200a regulation for TAZ.

Comprehensive Analysis of Alternative Splicing Across Tumors from 8,705 Patients

Our comprehensive analysis of alternative splicing across 32 The Cancer Genome Atlas cancer types from 8,705 patients detects alternative splicing events and tumor variants by reanalyzing RNA and whole-exome sequencing data. Tumors have up to 30% more alternative splicing events than normal samples. Association analysis of somatic variants with alternative splicing events confirmed known trans associations with variants in SF3B1 and U2AF1 and identified additional trans-acting variants (e.g., TADA1, PPP2R1A).

An Integrated TCGA Pan-Cancer Clinical Data Resource to Drive High-Quality Survival Outcome Analytics

For a decade, The Cancer Genome Atlas (TCGA) program collected clinicopathologic annotation data along with multi-platform molecular profiles of more than 11,000 human tumors across 33 different cancer types. TCGA clinical data contain key features representing the democratized nature of the data collection process. To ensure proper use of this large clinical dataset associated with genomic features, we developed a standardized dataset named the TCGA Pan-Cancer Clinical Data Resource (TCGA-CDR), which includes four major clinical outcome endpoints.

Scalable Open Science Approach for Mutation Calling of Tumor Exomes Using Multiple Genomic Pipelines

The Cancer Genome Atlas (TCGA) cancer genomicsdataset includes over 10,000 tumor-normal exomepairs across 33 different cancer types, in total >400TB of raw data files requiring analysis. Here wedescribe the Multi-Center Mutation Calling in Multi-ple Cancers project, our effort to generate a compre-hensive encyclopedia of somatic mutation calls forthe TCGA data to enable robust cross-tumor-typeanalyses. Our approach accounts for varianceand batch effects introduced by the rapid advance-ment of DNA extraction, hybridization-capture,sequencing, and analysis methods over time.

Pan-cancer Alterations of the MYC Oncogene and Its Proximal Network across the Cancer Genome Atlas

Although theMYConcogene has been implicated incancer, a systematic assessment of alterations ofMYC, related transcription factors, and co-regulatoryproteins, forming the proximal MYC network (PMN),across human cancers is lacking. Using computa-tional approaches, we define genomic and proteo-mic features associated with MYC and the PMNacross the 33 cancers of The Cancer Genome Atlas.Pan-cancer, 28% of all samples had at least one ofthe MYC paralogs amplified.

Machine Learning Detects Pan-cancer Ras Pathway Activation in The Cancer Genome Atlas

Precision oncology uses genomic evidence to match patients with treatment but often fails to
identify all patients who may respond. The transcriptome of these “hidden responders” may reveal
responsive molecular states. We describe and evaluate a machine-learning approach to classify
aberrant pathway activity in tumors, which may aid in hidden responder identification. The
algorithm integrates RNA-seq, copy number, and mutations from 33 different cancer types across
The Cancer Genome Atlas (TCGA) PanCanAtlas project to predict aberrant molecular states in

Identification of Disease-miRNA networks across different cancer types using SWIM

MicroRNAs (miRNAs) are small noncoding RNAs (ncRNAs) involved in several biological processes and diseases. MiRNAs regulate gene expression at the posttranscriptional level, mostly downregulating their targets by binding specific regions of transcripts through imperfect sequence complementarity. Prediction of miRNA-binding sites is challenging, and target prediction algorithms are usually based on sequence complementarity.

© Università degli Studi di Roma "La Sapienza" - Piazzale Aldo Moro 5, 00185 Roma