The authors of this paper present a novel R software package, called MPrESS, that enables researchers to determine the minimum number of samples required to address a given study hypothesis using 16S rRNA gene microbiome data with sufficient power and allows users to compute power calculations based only on a subset of DESeq2 identified taxa.
Deep sequencing has revealed that the 16S rRNA gene composition of the human microbiome can vary between populations. However, when existing data are insufficient to address the desired study questions due to limited sample sizes, Dirichlet mixture modeling (DMM) can simulate 16S rRNA gene predictions from experimental microbiome data. The authors examined the extent to which simulated 16S rRNA gene microbiome data can accurately reflect the diversity within that identified from experimental data and calculate the power. Even when experimental and simulated datasets differed by less than 10 percent, simulation by DMM consistently overestimates power, except when using only highly discriminating taxa. Admixtures of DMM with experimental data performed poorly compared to pure simulation and did not show the same correlation with experimental data p-value and power values. While multiple replications of random sampling remain the favored method of determining the power, when the estimated sample size required to achieve a certain power exceeds the sample number, then simulated samples based on DMM can be used. The authors introduce an R-Package, MPrESS, to assist in power calculation and sample size estimation for a 16S rRNA gene microbiome dataset to detect a difference between populations. MPrESS can be downloaded from GitHub.(Publisher Abstract Provided)
Downloads
Similar Publications
- Improved Nucleic Acid Recovery From Trace and Degraded Samples Using Affinity Purification
- "It's the Best Thing in the World Around Here": The Potential for Protective Places in a High Crime Neighborhood
- SAVRY Predictive Validity of Mississippi Justice-Involved Youth Recidivism: A Latent Variable Approach