Check your outliers! An introduction to identifying statistical outliers in R with easystats

Résumé

Beyond the challenge of keeping up to date with current best practices regarding the diagnosis and treatment of outliers, an additional difficulty arises concerning the mathematical implementation of the recommended methods. Here, we provide an overview of current recommendations and best practices and demonstrate how they can easily and conveniently be implemented in the R statistical computing software, using the {performance} package of the easystats ecosystem. We cover univariate, multivariate, and model-based statistical outlier detection methods, their recommended threshold, standard output, and plotting methods. We conclude by reviewing the different theoretical types of outliers, whether to exclude or winsorize them, and the importance of transparency. A preprint of this paper is available at: https://doi.org/10.31234/osf.io/bu6nt.

Publication
Behavior Research Methods, 1-11. doi.org/10.3758/s13428-024-02356-w
Rémi Thériault
Rémi Thériault
Étudiant au doctorat (Psychologie sociale)

Mes intérêts de recherche incluent la cognition sociale/implicite, l’altruisme, et les rêves.

Sur le même sujet