Other packages > Find by keyword >

smallsets  

Visual Documentation for Data Preprocessing
View on CRAN: Click here


Download and install smallsets package within the R console
Install from CRAN:
install.packages("smallsets")

Install from Github:
library("remotes")
install_github("cran/smallsets")

Install by package version:
library("remotes")
install_version("smallsets", "2.0.0")



Attach the package and use:
library("smallsets")
Maintained by
Lydia R. Lucchesi
[Scholar Profile | Author Map]
All associated links for this package
First Published: 2023-02-03
Latest Update: 2023-12-05
Description:
Data practitioners regularly use the 'R' and 'Python' programming languages to prepare data for analyses. Thus, they encode important data preprocessing decisions in 'R' and 'Python' code. The 'smallsets' package subsequently decodes these decisions into a Smallset Timeline, a static, compact visualisation of data preprocessing decisions (Lucchesi et al. (2022) <doi:10.1145/3531146.3533175>). The visualisation consists of small data snapshots of different preprocessing steps. The 'smallsets' package builds this visualisation from a user's dataset and preprocessing code located in an 'R', 'R Markdown', 'Python', or 'Jupyter Notebook' file. Users simply add structured comments with snapshot instructions to the preprocessing code. One optional feature in 'smallsets' requires installation of the 'Gurobi' optimisation software and 'gurobi' 'R' package, available from <https://www.gurobi.com>. More information regarding the optional feature and 'gurobi' installation can be found in the 'smallsets' vignette.
How to cite:
Lydia R. Lucchesi (2023). smallsets: Visual Documentation for Data Preprocessing. R package version 2.0.0, https://cran.r-project.org/web/packages/smallsets. Accessed 05 Mar. 2026.
Previous versions and publish date:
1.0.0 (2023-02-03 11:10)
Other packages that cited smallsets R package
View smallsets citation profile
Other R packages that smallsets depends, imports, suggests or enhances
Complete documentation for smallsets
Downloads during the last 30 days

Today's Hot Picks in Authors and Packages

diffIRT  
Diffusion IRT Models for Response and Response Time Data
Package to fit diffusion-based IRT models to response and response time data. Models are fit using ...
Download / Learn more Package Citations See dependency  
imagefx  
Extract Features from Images
Synthesize images into characteristic features for time-series analysis or machine learning applicat ...
Download / Learn more Package Citations See dependency  
ClimClass  
Climate Classification According to Several Indices
Classification of climate according to Koeppen - Geiger, of aridity indices, of continentality indi ...
Download / Learn more Package Citations See dependency  
pinp  
'pinp' is not 'PNAS'
A 'PNAS'-alike style for 'rmarkdown', derived from the 'Proceedings of the National Academy of Scie ...
Download / Learn more Package Citations See dependency  
neat  
Efficient Network Enrichment Analysis Test
Includes functions and examples to compute NEAT, the Network Enrichment Analysis Test described in ...
Download / Learn more Package Citations See dependency  
nextGenShinyApps  
Craft Exceptional 'R Shiny' Applications and Dashboards with Novel Responsive Tools
Nove responsive tools for designing and developing 'Shiny' dashboards and applications. The scripts ...
Download / Learn more Package Citations See dependency  

26,264

R Packages

223,360

Dependencies

70,244

Author Associations

26,265

Publication Badges

© Copyright since 2022. All right reserved, rpkg.net.  Based in Cambridge, Massachusetts, USA