Veronica M. White | Home Blog Contact |
Posted: 30 Jan 2022
Author: Veronica White
updated: 2/25/22
Collaborative work with organizations often involves analyzing whatever data is available. These analyses can often lead to statistical testing to help understand and communicate what the data tells you about the organization’s processes or programs. With so many statistical tests and methods out there, it’s essential to choose the right one(s) for your study.
Before you start, consider how these frequentist statistics (i.e., statistical methods that estimate p-values) play a role in your study. The American Statistician Association has written two statements that highlight the concerns, considerations, and alternative/supplementary approaches when conducting scientific studies that utilize frequentist statistics (see [1], [2]).
Suppose you have decided to go ahead and use frequentist statistics and, more specifically, hypothesis testing for your study. In that case, the next natural question is, what test do you use? To choose a test, you must describe both the purpose of the test and the data you are analyzing, which are outlined as the following:
Now that both the test’s purpose and data are well defined, you are ready to choose a test—table 1 summarizes when to use various hypothesis tests. Additional information on choosing a statistical test and on the various statistical tests can be found in ([6],[7]).
Each test can be implemented using various software such as SAS, R, SPSS, and STATA. See [10] for examples of implementing the various tests in your desired software. Are we all done? Not quite, re-read [2] and read [11] for interpretation and best practices of reporting p-values.
Purpose of Test | Continuous and normal data | Continuous, non-normal data OR non-continuous, discrete or ordinal data | Non-continuous, categorical data |
---|---|---|---|
Compare 1 mean with a population value | One sample z-test/t-testa | one-sample median | exact binomial test |
Compare 2 independent groups | Independent samples z-test/ t-testa | Mann-Whitney U/ Wilcoxon Sum of ranksb | Chi-squared test or Fisher’s exact testc |
Compare 2 paired groups | Paired t-test | Wilcoxon signed ranks test/ sign test | McNemar’s test |
Compare 3 or more independent groups | One-way Analysis of Variance | Kruskal Wallis test | Chi-squared test |
Compare 3 or more paired groups | Repeated measures Analysis of variance | Friedman test | Cochrane Q |
footnotes:
a: If the sample size is small (e.g., n < 30), use a t-test.
b: The Mann-Whitney U test is the same as conducting the Wilcoxon Sum of ranks test, see [8]
c: See [9] for a disscussion on using a chi squred vs fisher test https://www.datascienceblog.net/post/statistical_test/contingency_table_tests/
[1] R. L. Wasserstein and N. A. Lazar, “The ASA Statement on p-Values: Context, Process, and Purpose,” The American Statistician, vol. 70, no. 2, pp. 129–133, Apr. 2016, doi: 10.1080/00031305.2016.1154108.
[2] R. L. Wasserstein, A. L. Schirm, and N. A. Lazar, “Moving to a World Beyond ‘p < 0.05,’” The American Statistician, vol. 73, no. sup1, pp. 1–19, Mar. 2019, doi: 10.1080/00031305.2019.1583913.
[3] “Types of Statistical Data: Numerical, Categorical, and Ordinal,” dummies. https://www.dummies.com/article/academics-the-arts/math/statistics/types-of-statistical-data-numerical-categorical-and-ordinal-169735 (accessed Feb. 07, 2022).
[4] “Which statistical test should you use? | XLSTAT Support Center.” https://help.xlstat.com/s/article/which-statistical-test-should-you-use?language=en_US (accessed Feb. 07, 2022).
[5]“Parametric and Non-parametric tests for comparing two or more groups | Health Knowledge.” https://www.healthknowledge.org.uk/public-health-textbook/research-methods/1b-statistical-methods/parametric-nonparametric-tests (accessed Feb. 07, 2022).
[6] Wills, A. Research Methods and Statistics. Online Course Acessed 2/2/2022.http://www.bristol.ac.uk/medical-school/media/rms/red/which_test.html
[7]A. Ghasemi, and S. Zahediasl. “Normality tests for statistical analysis: a guide for non-statisticians.” International journal of endocrinology and metabolism 10, no. 2 (2012): 486. doi: 10.5812/ijem.3505
[8] “Mann–Whitney U test,” Wikipedia. Jan. 31, 2022. Accessed: Feb. 07, 2022. [Online]. Available: https://en.wikipedia.org/w/index.php?title=Mann%E2%80%93Whitney_U_test&oldid=1069150075
[9] “Testing Independence: Chi-Squared vs Fisher’s Exact Test,” Oct. 17, 2018. https://www.datascienceblog.net/post/statistical_test/contingency_table_tests/ (accessed Feb. 07, 2022).
[10] “Choosing the Correct Statistical Test in SAS, Stata, SPSS and R.” https://stats.oarc.ucla.edu/other/mult-pkg/whatstat/ (accessed Feb. 25, 2022).
[11] J. Storopoli. “Bayesian Statistics with Julia and Turing”. p-values. 2021. https://storopoli.io/Bayesian-Julia/pages/2_bayes_stats/#p-values