Detecting copy number alterations in RNA-Seq using SuperFreq
- Author(s)
- Flensburg, C; Oshlack, A; Majewski, IJ;
- Journal Title
- Bioinformatics
- Publication Type
- epub ahead of print
- Abstract
- MOTIVATION: Calling copy number alterations (CNAs) from RNA sequencing (RNA-Seq) is challenging, because of the marked variability in coverage across genes and paucity of single nucleotide polymorphisms (SNPs). We have adapted SuperFreq to call absolute and allele sensitive CNAs from RNA-Seq. SuperFreq uses an error-propagation framework to combine and maximise information from read counts and B-allele frequencies (BAFs). RESULTS: We used datasets from The Cancer Genome Atlas (TCGA) to assess the validity of CNA calls from RNA-Seq. When ploidy estimates were consistent, we found agreement with DNA SNP-arrays for over 98% of the genome for acute myeloid leukaemia (TCGA-AML, n = 116) and 87% for colorectal cancer (TCGA-CRC, n = 377). The sensitivity of CNA calling from RNA-Seq was dependent on gene density. Using RNA-Seq, SuperFreq detected 78% of CNA calls covering 100 or more genes with a precision of 94%. Recall dropped for focal events, but this also depended on signal intensity. For example, in the CRC cohort SuperFreq identified all cases (7/7) with high-level amplification of ERBB2, where the copy number was typically >20, but identified only 6% of cases (1/17) with moderate amplification of IGF2, which occurs over a smaller interval. SuperFreq offers an integrated platform for identification of CNAs and point mutations. As evidence of how SuperFreq can be applied, we used it to reproduce the established relationship between somatic mutation load and CNA profile in CRC using RNA-Seq alone. AVAILABILITY: SuperFreq is implemented in R and the code is available through GitHub: https://github.com/ChristofferFlensburg/SuperFreq. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
- Publisher
- Oxford Academic
- Research Division(s)
- Blood Cells And Blood Cancer
- PubMed ID
- 34132781
- Publisher's Version
- https://doi.org/10.1093/bioinformatics/btab440
- Terms of Use/Rights Notice
- Refer to copyright notice on published article.
Creation Date: 2021-06-21 10:25:57
Last Modified: 2021-06-21 10:37:04