Multiple hypothesis testing in genomics

Stat Med. 2014 May 20;33(11):1946-78. doi: 10.1002/sim.6082. Epub 2014 Jan 8.

Abstract

This paper presents an overview of the current state of the art in multiple testing in genomics data from a user's perspective. We describe methods for familywise error control, false discovery rate control and false discovery proportion estimation and confidence, both conceptually and practically, and explain when to use which type of error rate. We elaborate on the assumptions underlying the methods and discuss pitfalls in the interpretation of results. In our discussion, we take into account the exploratory nature of genomics experiments, looking at selection of genes before or after testing, and at the role of validation experiments.

Keywords: Bonferroni; FDR; false discovery proportion; false discovery rate; familywise error rate.

MeSH terms

  • Data Interpretation, Statistical*
  • Gene Expression Profiling / methods
  • Genome-Wide Association Study / methods
  • Genomics / methods*
  • Humans