Repository logo
 

Long genes and genes with multiple splice variants are enriched in pathways linked to cancer and other multigenic diseases

Published version
Peer-reviewed

Repository DOI


Type

Article

Change log

Authors

Sahakyan, Aleksandr  ORCID logo  https://orcid.org/0000-0002-8343-3594
Balasubramanian, Shankar  ORCID logo  https://orcid.org/0000-0002-0281-5815

Abstract

Background

The role of random mutations and genetic errors in defining the etiology of cancer and other multigenic diseases has recently received much attention. With the view that complex genes should be particularly vulnerable to such events, here we explore the link between the simple properties of the human genes, such as transcript length, number of splice variants, exon/intron composition, and their involvement in the pathways linked to cancer and other multigenic diseases.

Results

We reveal a substantial enrichment of cancer pathways with long genes and genes that have multiple splice variants. Although the latter two factors are interdependent, we show that the overall gene length and splicing complexity increase in cancer pathways in a partially decoupled manner. Our systematic survey for the pathways enriched with top lengthy genes and with genes that have multiple splice variants reveal, along with cancer pathways, the pathways involved in various neuronal processes, cardiomyopathies and type II diabetes. We outline a correlation between the gene length and the number of somatic mutations.

Conclusions

Our work is a step forward in the assessment of the role of simple gene characteristics in cancer and a wider range of multigenic diseases. We demonstrate a significant accumulation of long genes and genes with multiple splice variants in pathways of multigenic diseases that have already been associated with de novo mutations. Unlike the cancer pathways, we note that the pathways of neuronal processes, cardiomyopathies and type II diabetes contain genes long enough for topoisomerase-dependent gene expression to also be a potential contributing factor in the emergence of pathologies, should topoisomerases become impaired.

Description

Keywords

long genes, splice variants, cancer, multigenic diseases, KEGG pathways, mutations, topoisomerases

Journal Title

BMC Genomics

Conference Name

Journal ISSN

1471-2164
1471-2164

Volume Title

17

Publisher

BioMed Central
Sponsorship
This research was supported by the Cancer Research UK and the Herchel Smith Fund. SB is a Wellcome Trust Senior Investigator.