Statistical Reporting Guidelines for Academic Research
General Principles for Reporting Statistical Results in Academic Manuscripts
Principles for Reporting Statistical Results
- Managing Multiple Testing and Corrections
- Statistical Software and Reproducibility
- Justifying Statistical Tests and Checking Assumptions
- Handling Missing Data
- Sample Size Determination and Power Analysis
- Protocol Preregistration
- Ethical Reporting and Inclusion
- Summary Checklist for Manuscript Reporting

Recent Post
Introduction
Precision in Descriptive Statistics
1. Appropriate Precision
2. Sample Sizes and Proportions
1. Normal vs. Non-Normal Distributions
2. Avoiding Misuse of SEM in Descriptions
1. Tables
Tables are best used to report precise numerical data, including the mean, SD, median, IQR, sample sizes, and/or statistical test results. Tables should be displayed in a simple and readable form (Boutron et al. (2008); Huser et al. (2019)).
2. Figures
1. Exact p-Values and Confidence Intervals (CIs)
Inferential findings should report the exact p-values, reporting them as p = 0.012 rather than vague categories such as p < 0.05 (Johnston & Hauser (2014); Guillemin et al. (2019)). Report and report 95% CIs along with p-values so that they may be interpreted for both statistical significance and clinical importance (Huser et al. (2019)).
2. Effect Sizes and Test Statistics
3. Degrees of Freedom
Statistical Software and Reproducibility
Justifying Statistical Tests and Checking Assumptions
Handling Missing Data
Sample Size Determination and Power Analysis
Protocol Preregistration
Ethical Reporting and Inclusion
Summary Checklist for Manuscript Reporting
To promote rigorous reporting and publication, researchers should consider the following actions:
- Always report sample size as the total, and subgroups.
- Report raw data, and values presented as percentages.
- Report as mean/SD or median/IQR that reflects the distribution of the data, to summarize data.
- Avoid using SEM in any descriptive statistics.
- Present exact p-values, and confidence intervals.
Clearly report the test statistics including degrees of freedom, and effect size. - Make clear if the tests adjusted for multiple testing, and how/what they adjusted.
- Report both software and any analytical code made publicly available.
Justify any tests, transformations, or how you checked underlying assumptions. - Describe how missing data were dealt with i.e. excluded or imputed.
- Report on any pre-study power analyses, the outcomes of these, and document.
- Report and cite, preregister, and report actual deviations.
- Report, and if possible, analysis, demographic subgroups as they arise, if appropriate.
Conclusion
Following accurate and precise reporting standards increases stabilizes scientific credibility and reproducibility. We have compiled these guidelines—developed the recent BMJ, The Journal of Bone & Joint Surgery, BMC Medical Ethics, JAMIA, Annals of the New York Academy of Sciences, and Journal of Neuroscience Research publications—as a practical framework for good statistical reporting. When researchers delineate their manuscripts with clarity, accuracy and transparency, they help to strengthen the impact and reliability of their findings.
References
1. Boutron, I., Moher, D., Altman, D. G., Schulz, K. F., & Ravaud, P. (2008). Extending the CONSORT statement to randomized trials of nonpharmacologic treatment: Explanation and elaboration. BMJ, 329(7471), 883. https://www.bmj.com/content/329/7471/883.full
2. Concato, J., Shah, N., & Horwitz, R. I. (2000). Randomized, controlled trials, observational studies, and the hierarchy of research designs. The Journal of Bone & Joint Surgery, 87(Supplement_2), 2-7. https://journals.lww.com/jbjsjournal/fulltext/2009/05003/Analysis_of_Observational_Studies__A_Guide_to.9.aspx/1000
3. Guillemin, M., Gillam, L., Rosenthal, D., & Bolitho, A. (2019). Researcher discomfort in qualitative research: acknowledging emotional challenges in research with vulnerable populations. BMC Medical Ethics, 20, Article 39. https://link.springer.com/article/10.1186/s12910-019-0359-9
4. Huser, V., Cimino, J. J., & Lai, A. M. (2019). Desiderata for computable representations of electronic health records-driven phenotype algorithms. Journal of the American Medical Informatics Association, 26(3), 185–195. https://academic.oup.com/jamia/article-abstract/26/3/185/5301680
5. Ioannidis, J. P. A. (2018). The proposal to lower p value thresholds to .005. Annals of the New York Academy of Sciences, 1429(1), 1–10. https://nyaspubs.onlinelibrary.wiley.com/doi/abs/10.1111/nyas.13325
6. Johnston, M., & Hauser, S. L. (2014). Reporting standards for preclinical and clinical research in neuroscience. Journal of Neuroscience Research, 92(9), 1150–1152. https://onlinelibrary.wiley.com/doi/full/10.1002/jnr.24340
7. Shane, E., Burr, D., Abrahamsen, B., Adler, R. A., Brown, T. D., Cheung, A. M., … & Watts, N. B. (2019). Atypical subtrochanteric and diaphyseal femoral fractures: Second report of a task force of the American Society for Bone and Mineral Research. Journal of Bone and Mineral Research, 34(11), 1985–2012. https://academic.oup.com/jbmr/article-abstract/34/11/1981/7606045