Other packages > Find by keyword >

dataprep  

Efficient and Flexible Data Preprocessing Tools
View on CRAN: Click here


Download and install dataprep package within the R console
Install from CRAN:
install.packages("dataprep")

Install from Github:
library("remotes")
install_github("cran/dataprep")

Install by package version:
library("remotes")
install_version("dataprep", "0.1.5")



Attach the package and use:
library("dataprep")
Maintained by
Chun-Sheng Liang
[Scholar Profile | Author Map]
All associated links for this package
First Published: 2021-01-11
Latest Update: 2022-01-15
Description:
Efficiently and flexibly preprocess data using a set of data filtering, deletion, and interpolation tools. These data preprocessing methods are developed based on the principles of completeness, accuracy, threshold method, and linear interpolation and through the setting of constraint conditions, time completion & recovery, and fast & efficient calculation and grouping. Key preprocessing steps include deletions of variables and observations, outlier removal, and missing values (NA) interpolation, which are dependent on the incomplete and dispersed degrees of raw data. They clean data more accurately, keep more samples, and add no outliers after interpolation, compared with ordinary methods. Auto-identification of consecutive NA via run-length based grouping is used in observation deletion, outlier removal, and NA interpolation; thus, new outliers are not generated in interpolation. Conditional extremum is proposed to realize point-by-point weighed outlier removal that saves non-outliers from being removed. Plus, time series interpolation with values to refer to within short periods further ensures reliable interpolation. These methods are based on and improved from the reference: Liang, C.-S., Wu, H., Li, H.-Y., Zhang, Q., Li, Z. & He, K.-B. (2020) .
How to cite:
Chun-Sheng Liang (2021). dataprep: Efficient and Flexible Data Preprocessing Tools. R package version 0.1.5, https://cran.r-project.org/web/packages/dataprep. Accessed 22 Dec. 2024.
Previous versions and publish date:
0.1.0 (2021-01-11 11:00), 0.1.1 (2021-04-04 16:30), 0.1.2 (2021-05-25 06:50), 0.1.3 (2021-05-28 20:10), 0.1.4 (2021-07-04 16:50)
Other packages that cited dataprep R package
View dataprep citation profile
Other R packages that dataprep depends, imports, suggests or enhances
Complete documentation for dataprep
Functions, R codes and Examples using the dataprep R package
Some associated functions: condextr . data . data1 . dataprep . descdata . descplot . melt . obsedele . optisolu . percdata . percoutl . percplot . shorvalu . varidele . zerona . 
Some associated R codes: data.R . function.R . globals.R .  Full dataprep package functions and examples
Downloads during the last 30 days
Get rewarded with contribution points by helping add
Reviews / comments / questions /suggestions ↴↴↴

Today's Hot Picks in Authors and Packages

quickcode  
Quick and Essential 'R' Tricks for Better Scripts
The NOT functions, 'R' tricks and a compilation of some simple quick plus often used 'R' codes to im ...
Download / Learn more Package Citations See dependency  
elect  
Estimation of Life Expectancies Using Multi-State Models
Functions to compute state-specific and marginal life expectancies. The computation is based on a fi ...
Download / Learn more Package Citations See dependency  
Rfast2  
A Collection of Efficient and Extremely Fast R Functions II
A collection of fast statistical and utility functions for data analysis. Functions for regression, ...
Download / Learn more Package Citations See dependency  
wordspace  
Distributional Semantic Models in R
An interactive laboratory for research on distributional semantic models ('DSM', see < ...
Download / Learn more Package Citations See dependency  
LOGANTree  
Tree-Based Models for the Analysis of Log Files from Computer-Based Assessments
Enables researchers to model log-file data from computer-based assessments using machine-learning te ...
Download / Learn more Package Citations See dependency  
Maintainer: Qi Qin (view profile)
composits  
Compositional, Multivariate and Univariate Time Series Outlier Ensemble
A compositional multivariate and univariate time series outlier ensemble.It uses the four R packages ...
Download / Learn more Package Citations See dependency  

23,394

R Packages

201,798

Dependencies

63,416

Author Associations

23,395

Publication Badges

© Copyright 2022 - present. All right reserved, rpkg.net.  Based in Cambridge, Massachusetts, USA