Other packages > Find by keyword >

SimCorrMix  

Simulation of Correlated Data with Multiple Variable Types Including Continuous and Count Mixture Distributions
View on CRAN: Click here


Download and install SimCorrMix package within the R console
Install from CRAN:
install.packages("SimCorrMix")

Install from Github:
library("remotes")
install_github("cran/SimCorrMix")

Install by package version:
library("remotes")
install_version("SimCorrMix", "0.1.1")



Attach the package and use:
library("SimCorrMix")
Maintained by
Allison Cynthia Fialkowski
[Scholar Profile | Author Map]
First Published: 2018-02-26
Latest Update: 2018-07-01
Description:
Generate continuous (normal, non-normal, or mixture distributions), binary, ordinal, and count (regular or zero-inflated, Poisson or Negative Binomial) variables with a specified correlation matrix, or one continuous variable with a mixture distribution.This package can be used to simulate data sets that mimic real-world clinical or genetic data sets (i.e., plasmodes, as in Vaughan et al., 2009 <doi:10.1016/j.csda.2008.02.032>).The methods extend those found in the 'SimMultiCorrData' R package.Standard normal variables with an imposed intermediate correlation matrix are transformed to generate the desired distributions. Continuous variables are simulated using either Fleishman (1978)'s third order <doi:10.1007/BF02293811> or Headrick (2002)'s fifth order <doi:10.1016/S0167-9473(02)00072-5> polynomial transformation method (the power method transformation, PMT).Non-mixture distributions require the user to specify mean, variance, skewness, standardized kurtosis, and standardized fifth and sixth cumulants.Mixture distributions require these inputs for the component distributions plus the mixing probabilities.Simulation occurs at the component level for continuous mixture distributions.The target correlation matrix is specified in terms of correlations with components of continuous mixture variables.These components are transformed into the desired mixture variables using random multinomial variables based on the mixing probabilities.However, the package provides functions to approximate expected correlations with continuous mixture variables given target correlations with the components. Binary and ordinal variables are simulated using a modification of ordsample() in package 'GenOrd'. Count variables are simulated using the inverse CDF method.There are two simulation pathways which calculate intermediate correlations involving count variables differently. Correlation Method 1 adapts Yahav and Shmueli's 2012 method <doi:10.1002/asmb.901> and performs best with large count variable means and positive correlations or small means and negative correlations.Correlation Method 2 adapts Barbiero and Ferrari's 2015 modification of the 'GenOrd' package <doi:10.1002/asmb.2072> and performs best under the opposite scenarios.The optional error loop may be used to improve the accuracy of the final correlation matrix.The package also contains functions to calculate the standardized cumulants of continuous mixture distributions, check parameter inputs, calculate feasible correlation boundaries, and summarize and plot simulated variables.
How to cite:
Allison Cynthia Fialkowski (2018). SimCorrMix: Simulation of Correlated Data with Multiple Variable Types Including Continuous and Count Mixture Distributions. R package version 0.1.1, https://cran.r-project.org/web/packages/SimCorrMix. Accessed 07 Apr. 2025.
Previous versions and publish date:
0.1.0 (2018-02-26 20:04)
Other packages that cited SimCorrMix R package
View SimCorrMix citation profile
Other R packages that SimCorrMix depends, imports, suggests or enhances
Complete documentation for SimCorrMix
Downloads during the last 30 days
03/0803/0903/1003/1103/1203/1303/1403/1503/1603/1703/1803/1903/2003/2103/2203/2303/2403/2503/2603/2703/2803/2903/3003/3104/0104/0204/0304/0404/0504/06Downloads for SimCorrMix02468101214TrendBars

Today's Hot Picks in Authors and Packages

ASMap  
Linkage Map Construction using the MSTmap Algorithm
Functions for Accurate and Speedy linkage map construction, manipulation and diagnosis of Doubled Ha ...
Download / Learn more Package Citations See dependency  
nutriNetwork  
Structure Learning with Copula Graphical Model
Statistical tool for learning the structure of direct associations among variables for continuous d ...
Download / Learn more Package Citations See dependency  
probout  
Unsupervised Multivariate Outlier Probabilities for Large Datasets
Estimates unsupervised outlier probabilities for multivariate numeric data with many observations fr ...
Download / Learn more Package Citations See dependency  
MetaAnalyser  
An Interactive Visualisation of Meta-Analysis as a Physical Weighing Machine
An interactive application to visualise meta-analysis data as a physical weighing machine. The inte ...
Download / Learn more Package Citations See dependency  
letsR  
Data Handling and Analysis in Macroecology
Handling, processing, and analyzing geographic data on species' distributions and environmental var ...
Download / Learn more Package Citations See dependency  
r2resize  
In-Text Resize for Images, Tables and Fancy Resize Containers in 'shiny', 'rmarkdown' and 'quarto' Documents
Automatic resizing toolbar for containers, images and tables. Various resizable or expandable contai ...
Download / Learn more Package Citations See dependency  

24,012

R Packages

207,311

Dependencies

64,867

Author Associations

24,013

Publication Badges

© Copyright since 2022. All right reserved, rpkg.net.  Based in Cambridge, Massachusetts, USA