Other packages > Find by keyword >

dataprep  

Efficient and Flexible Data Preprocessing Tools
View on CRAN: Click here


Download and install dataprep package within the R console
Install from CRAN:
install.packages("dataprep")

Install from Github:
library("remotes")
install_github("cran/dataprep")

Install by package version:
library("remotes")
install_version("dataprep", "0.1.5")



Attach the package and use:
library("dataprep")
Maintained by
Chun-Sheng Liang
[Scholar Profile | Author Map]
All associated links for this package
First Published: 2021-01-11
Latest Update: 2022-01-15
Description:
Efficiently and flexibly preprocess data using a set of data filtering, deletion, and interpolation tools. These data preprocessing methods are developed based on the principles of completeness, accuracy, threshold method, and linear interpolation and through the setting of constraint conditions, time completion & recovery, and fast & efficient calculation and grouping. Key preprocessing steps include deletions of variables and observations, outlier removal, and missing values (NA) interpolation, which are dependent on the incomplete and dispersed degrees of raw data. They clean data more accurately, keep more samples, and add no outliers after interpolation, compared with ordinary methods. Auto-identification of consecutive NA via run-length based grouping is used in observation deletion, outlier removal, and NA interpolation; thus, new outliers are not generated in interpolation. Conditional extremum is proposed to realize point-by-point weighed outlier removal that saves non-outliers from being removed. Plus, time series interpolation with values to refer to within short periods further ensures reliable interpolation. These methods are based on and improved from the reference: Liang, C.-S., Wu, H., Li, H.-Y., Zhang, Q., Li, Z. & He, K.-B. (2020) .
How to cite:
Chun-Sheng Liang (2021). dataprep: Efficient and Flexible Data Preprocessing Tools. R package version 0.1.5, https://cran.r-project.org/web/packages/dataprep. Accessed 21 Nov. 2024.
Previous versions and publish date:
0.1.0 (2021-01-11 11:00), 0.1.1 (2021-04-04 16:30), 0.1.2 (2021-05-25 06:50), 0.1.3 (2021-05-28 20:10), 0.1.4 (2021-07-04 16:50)
Other packages that cited dataprep R package
View dataprep citation profile
Other R packages that dataprep depends, imports, suggests or enhances
Complete documentation for dataprep
Functions, R codes and Examples using the dataprep R package
Some associated functions: condextr . data . data1 . dataprep . descdata . descplot . melt . obsedele . optisolu . percdata . percoutl . percplot . shorvalu . varidele . zerona . 
Some associated R codes: data.R . function.R . globals.R .  Full dataprep package functions and examples
Downloads during the last 30 days
Get rewarded with contribution points by helping add
Reviews / comments / questions /suggestions ↴↴↴

Today's Hot Picks in Authors and Packages

deductive  
Data Correction and Imputation Using Deductive Methods
Attempt to repair inconsistencies and missing values in data records by using information from vali ...
Download / Learn more Package Citations See dependency  
pkgdepends  
Package Dependency Resolution and Downloads
Find recursive dependencies of 'R' packages from various sources. Solve the dependencies to obtain ...
Download / Learn more Package Citations See dependency  
SCBiclust  
Identifies Mean, Variance, and Hierarchically Clustered Biclusters
Identifies a bicluster, a submatrix of the data such that the features and observations within the s ...
Download / Learn more Package Citations See dependency  
RcppHNSW  
'Rcpp' Bindings for 'hnswlib', a Library for Approximate Nearest Neighbors
'Hnswlib' is a C++ library for Approximate Nearest Neighbors. This package provides a minimal R int ...
Download / Learn more Package Citations See dependency  
crossrun  
Joint Distribution of Number of Crossings and Longest Run
Joint distribution of number of crossings and the longest run in a series of independent Bernoulli ...
Download / Learn more Package Citations See dependency  
kgschart  
KGS Rank Graph Parser
Restore underlining numeric data from rating history graph of KGS (an online platform of the game o ...
Download / Learn more Package Citations See dependency  

23,229

R Packages

199,929

Dependencies

62,984

Author Associations

23,230

Publication Badges

© Copyright 2022 - present. All right reserved, rpkg.net.  Based in Cambridge, Massachusetts, USA