Repository logo
 

Overcoming confounding plate effects in differential expression analyses of single-cell RNA-seq data

Published version
Peer-reviewed

Change log

Authors

Lun, ATL 
Marioni, JC 

Abstract

An increasing number of studies are using single-cell RNA-sequencing (scRNA-seq) to characterize the gene expression profiles of individual cells. One common analysis applied to scRNA-seq data involves detecting differentially expressed (DE) genes between cells in different biological groups. However, many experiments are designed such that the cells to be compared are processed in separate plates or chips, meaning that the groupings are confounded with systematic plate effects. This confounding aspect is frequently ignored in DE analyses of scRNA-seq data. In this article, we demonstrate that failing to consider plate effects in the statistical model results in loss of type I error control. A solution is proposed whereby counts are summed from all cells in each plate and the count sums for all plates are used in the DE analysis. This restores type I error control in the presence of plate effects without compromising detection power in simulated data. Summation is also robust to varying numbers and library sizes of cells on each plate. Similar results are observed in DE analyses of real data where the use of count sums instead of single-cell counts improves specificity and the ranking of relevant genes. This suggests that summation can assist in maintaining statistical rigour in DE analyses of scRNA-seq data with plate effects.

Description

Keywords

differential expression, plate effects, single-cell RNA sequencing, summation

Journal Title

Biostatistics

Conference Name

Journal ISSN

1465-4644
1468-4357

Volume Title

Publisher

Oxford University Press
Sponsorship
Cancer Research UK (C14303/A17197)
Cancer Research UK (22231)
This work was supported by the University of Cambridge, Cancer Research UK (award no. A17197) and Hutchison Whampoa Limited. J.C.M. was also supported by core funding from the European Molecular Biology Laboratory.
Relationships
Is derived from: