Publication:
Correlation among baseline variables yields non-uniformity of p-values

Thumbnail Image

Date

2017

Journal Title

Journal ISSN

Volume Title

Publisher

Public Library of Science
The Harvard community has made this article openly available. Please share how this access benefits you.

Research Projects

Organizational Units

Journal Issue

Citation

Betensky, Rebecca A., and Sy Han Chiou. 2017. “Correlation among baseline variables yields non-uniformity of p-values.” PLoS ONE 12 (9): e0184531. doi:10.1371/journal.pone.0184531. http://dx.doi.org/10.1371/journal.pone.0184531.

Research Data

Abstract

A recent paper in Neurology used statistical techniques to investigate the integrity of the randomization in 33 clinical trials conducted by a group of investigators. Without justification, the approach assumed that there would be no impact of correlation among baseline variables. We investigated the impact of correlation on the conclusions of the approach in several large-scale simulation studies that replicated the sample sizes and baseline variables of the clinical trials in question and utilized proper randomization. Additionally, we considered scenarios with larger numbers of baseline variables. We found that, with even moderate correlation, there can be substantial inflation of the type I error of statistical tests of randomization integrity. This is also the case under no correlation, in the presence of some discrete baseline variables, with a large number of variables. Thus, statistical techniques for assessing randomization integrity should be applied with extreme caution given that very low p-values, which are taken as evidence against valid randomization, can arise even in the case of valid randomization, in the presence of correlation. More generally, the use of tests of goodness of fit to uniformity for the purpose of testing a global null hypothesis is not advisable in the presence of correlation.

Description

Keywords

Mathematical and Statistical Techniques, Statistical Methods, Statistical Hypothesis Testing, Chi Square Tests, Physical Sciences, Mathematics, Statistics (Mathematics), Simulation and Modeling, Probability Theory, Probability Distribution, Normal Distribution, Random Variables, Medicine and Health Sciences, Clinical Medicine, Clinical Trials, Pharmacology, Drug Research and Development, Discrete Random Variables, Science Policy, Publication Ethics, Vascular Medicine, Blood Pressure

Terms of Use

This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service

Endorsement

Review

Supplemented By

Referenced By

Related Stories