Check your outliers! An introduction to identifying statistical outliers in R with easystats

Abstract

Beyond the challenge of keeping up to date with current best practices regarding the diagnosis and treatment of outliers, an additional difficulty arises concerning the mathematical implementation of the recommended methods. Here, we provide an overview of current recommendations and best practices and demonstrate how they can easily and conveniently be implemented in the R statistical computing software, using the {performance} package of the easystats ecosystem. We cover univariate, multivariate, and model-based statistical outlier detection methods, their recommended threshold, standard output, and plotting methods. We conclude by reviewing the different theoretical types of outliers, whether to exclude or winsorize them, and the importance of transparency. A preprint of this paper is available at: https://doi.org/10.31234/osf.io/bu6nt.

Publication
Behavior Research Methods, 56(4), 4162-4172. doi.org/10.3758/s13428-024-02356-w
Rémi Thériault
Rémi Thériault
PhD Student (Social Psychology)

My research interests include social/implicit cognition, altruism, and dreams.

Related