Murine genetic models of obesity: type I error rates and the power of commonly used analyses as assessed by plasmode-based simulation
| dc.contributor.author | Ejima, Keisuke | |
| dc.contributor.author | Brown, Andrew W. | |
| dc.contributor.author | Smith, Daniel L., Jr. | |
| dc.contributor.author | Beyaztas, Ufuk | |
| dc.contributor.author | Allison, David B. | |
| dc.date.accessioned | 2025-10-18T13:23:01Z | |
| dc.date.created | 2020 | |
| dc.date.issued | 2020 | |
| dc.department | Fakülteler, Fen Fakültesi, Matematik Bölümü | |
| dc.description.abstract | Background/Objectives Genetic contributors to obesity are frequently studied in murine models. However, the sample sizes of these studies are often small, and the data may violate assumptions of common statistical tests, such as normality of distributions. We examined whether, in these cases, type I error rates and power are affected by the choice of statistical test. Subjects/Methods We conducted plasmode-based simulation using empirical data on body mass (weight) from murine genetic models of obesity. For the type I error simulation, the weight distributions were adjusted to ensure no difference in means between control and mutant groups. For the power simulation, the distributions of the mutant groups were shifted to ensure specific effect sizes. Three to twenty mice were resampled from the empirical distributions to create a plasmode. We then computed type I error rates and power for five common tests on the plasmodes: Student's t test, Welch's t test, Wilcoxon rank sum test (aka, Mann-Whitney U test), permutation test, and bootstrap test. Results We observed type I error inflation for all tests, except the bootstrap test, with small samples (<= 5). Type I error inflation decreased as sample size increased (>= 8) but remained. The Wilcoxon test should be avoided because of heterogeneity of distributions. For power, a departure from the reference was observed with small samples for all tests. Compared with the other tests, the bootstrap test had less power with small samples. Conclusions Overall, the bootstrap test is recommended for small samples to avoid type I error inflation, but this benefit comes at the cost of lower power. When sample size is large enough, Welch's t test is recommended because of high power with minimal type I error inflation. | |
| dc.description.sponsorship | NIH [3P30DK056336, R25DK099080, R25HL124208]; Japan Society for Promotion of Science (JSPS) KAKENHI grant [18K18146]; Lilly Endowment, Inc.; Indiana METACyt Initiative; Grants-in-Aid for Scientific Research [18K18146] Funding Source: KAKEN | |
| dc.description.sponsorship | This study was supported in part by NIH grants 3P30DK056336 (DBA), R25DK099080 (DBA), R25HL124208 (DBA) and Japan Society for Promotion of Science (JSPS) KAKENHI grant 18K18146 (KE). The data analyses and simulation were performed using a supercomputer, Karst, which was supported in part by Lilly Endowment, Inc., through its support for the Indiana University Pervasive Technology Institute, and in part by the Indiana METACyt Initiative. The Indiana METACyt Initiative at IU was also supported in part by Lilly Endowment, Inc. The opinions expressed are those of the authors and do not necessarily represent those of the NIH or any other organization. All the code which was used in this study will be available through the following webpage: https://doi.org/10.5281/zenodo.1488359.Supplementary information is available at the International Journal of Obesity's website. | |
| dc.identifier.doi | 10.1038/s41366-020-0554-2 | |
| dc.identifier.endpage | 1449 | |
| dc.identifier.issn | 0307-0565 | |
| dc.identifier.issn | 1476-5497 | |
| dc.identifier.issue | 6 | |
| dc.identifier.orcid | Beyaztas, Ufuk/0000-0002-5208-4950 | |
| dc.identifier.orcid | Brown, Andrew/0000-0002-1758-8205 | |
| dc.identifier.orcid | Smith Jr, Daniel/0000-0002-1602-2023 | |
| dc.identifier.orcid | Ejima, Keisuke/0000-0002-1185-3987 | |
| dc.identifier.pmid | 32099106 | |
| dc.identifier.scopus | 2-s2.0-85080028487 | |
| dc.identifier.scopusquality | Q1 | |
| dc.identifier.startpage | 1440 | |
| dc.identifier.uri | https://doi.org/10.1038/s41366-020-0554-2 | |
| dc.identifier.uri | https://hdl.handle.net/11772/22637 | |
| dc.identifier.volume | 44 | |
| dc.identifier.wos | WOS:000516439000001 | |
| dc.identifier.wosquality | Q2 | |
| dc.indekslendigikaynak | Web of Science | |
| dc.indekslendigikaynak | Scopus | |
| dc.indekslendigikaynak | PubMed | |
| dc.language.iso | en | |
| dc.publisher | Springernature | |
| dc.relation.ispartof | International Journal of Obesity | |
| dc.relation.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | |
| dc.relation.sdg | Goal-03: Good Health and Well-Being | |
| dc.rights | info:eu-repo/semantics/openAccess | |
| dc.snmz | WoS_20251016 | |
| dc.subject | Student T-Test | |
| dc.subject | Mann-Whitney | |
| dc.subject | Sample Sizes | |
| dc.subject | Guidelines | |
| dc.subject | Violation | |
| dc.subject | Tests | |
| dc.title | Murine genetic models of obesity: type I error rates and the power of commonly used analyses as assessed by plasmode-based simulation | |
| dc.type | Article | |
| dspace.entity.type | Publication |










