Other packages > Find by keyword >

koRpus  

Text Analysis with Emphasis on POS Tagging, Readability, and Lexical Diversity
View on CRAN: Click here


Download and install koRpus package within the R console
Install from CRAN:
install.packages("koRpus")

Install from Github:
library("remotes")
install_github("cran/koRpus")

Install by package version:
library("remotes")
install_version("koRpus", "0.13-8")



Attach the package and use:
library("koRpus")
Maintained by
Meik Michalke
[Scholar Profile | Author Map]
All associated links for this package
First Published: 2012-03-08
Latest Update: 2021-05-17
Description:
A set of tools to analyze texts. Includes, amongst others, functions for automatic language detection, hyphenation, several indices of lexical diversity (e.g., type token ratio, HD-D/vocd-D, MTLD) and readability (e.g., Flesch, SMOG, LIX, Dale-Chall). Basic import functions for language corpora are also provided, to enable frequency analyses (supports Celex and Leipzig Corpora Collection file formats) and measures like tf-idf. Note: For full functionality a local installation of TreeTagger is recommended. It is also recommended to not load this package directly, but by loading one of the available language support packages from the 'l10n' repository . 'koRpus' also includes a plugin for the R GUI and IDE RKWard, providing graphical dialogs for its basic features. The respective R package 'rkward' cannot be installed directly from a repository, as it is a part of RKWard. To make full use of this feature, please install RKWard from (plugins are detected automatically). Due to some restrictions on CRAN, the full package sources are only available from the project homepage. To ask for help, report bugs, request features, or discuss the development of the package, please subscribe to the koRpus-dev mailing list ().
How to cite:
Meik Michalke (2012). koRpus: Text Analysis with Emphasis on POS Tagging, Readability, and Lexical Diversity. R package version 0.13-8, https://cran.r-project.org/web/packages/koRpus. Accessed 31 Jan. 2025.
Previous versions and publish date:
0.04-27 (2012-03-08 08:11), 0.04-36 (2012-08-28 07:32), 0.04-40 (2013-04-08 00:07), 0.05-3 (2013-12-21 21:21), 0.05-4 (2014-01-22 21:12), 0.05-5 (2014-03-20 03:17), 0.05-6 (2015-06-30 18:02), 0.06-4 (2016-03-08 06:22), 0.06-5 (2016-06-06 06:39), 0.10-1 (2017-03-02 14:37), 0.10-2 (2017-04-05 00:04), 0.11-5 (2018-10-28 08:50), 0.13-1 (2020-09-21 23:40), 0.13-2 (2020-09-24 10:00), 0.13-3 (2020-10-15 14:10), 0.13-4 (2020-12-11 23:50), 0.13-5 (2021-02-02 17:00), 0.13-6 (2021-05-09 07:10), 0.13-7 (2021-05-14 07:20)
Other packages that cited koRpus R package
View koRpus citation profile
Other R packages that koRpus depends, imports, suggests or enhances
Complete documentation for koRpus
Functions, R codes and Examples using the koRpus R package
Some associated functions: ARI . C.ld . CTTR . DRP . ELF . FOG . FORCAST . HDD . K.ld . LIX . MATTR . MSTTR . MTLD . R.ld . RIX . S.ld . SMOG . TRI . TTR . U.ld . available.koRpus.lang . bormuth . cTest-methods . clozeDelete-methods . coleman.liau . coleman . correct-methods . dale.chall . danielson.bryan . dickes.steiwer . docTermMatrix . farr.jenkins.paterson . filterByClass-methods . flesch.kincaid . flesch . freq.analysis-methods . fucks . get.kRp.env . guess.lang . gutierrez . harris.jacobson . hyphen-methods . install.koRpus.lang . jumbleWords-methods . kRp.POS.tags . kRp.TTR-class . kRp.cluster . kRp.corp.freq-class . kRp.lang-class . kRp.readability-class . kRp.text-class . kRp.text_get-methods . koRpus-deprecated . koRpus-package . lex.div-methods . lex.div.num . linsear.write . maas . nWS . pasteText-methods . plot-methods . query-methods . read.BAWL . read.corp.LCC . read.corp.celex . read.corp.custom-methods . readTagged-methods . readability-methods . readability.num . segment.optimizer . set.kRp.env . set.lang.support . show-methods . spache . split_by_doc_id . strain . summary-methods . textFeatures . textTransform-methods . tokenize-methods . traenkle.bailer . treetag-methods . tuldava . types.tokens-methods . wheeler.smith . 
Some associated R codes: 00_environment.R . 01_class_01_kRp.text.R . 01_class_02_kRp.TTR.R . 01_class_03_kRp.corp.freq.R . 01_class_04_kRp.lang.R . 01_class_05_kRp.readability.R . 01_class_81_kRp.connection_union.R . 01_class_90_deprecated_classes.R . 02_method_cTest.R . 02_method_clozeDelete.R . 02_method_correct.R . 02_method_docTermMatrix.R . 02_method_filterByClass.R . 02_method_freq.analysis.R . 02_method_get_set_kRp.text.R . 02_method_hyphen.R . 02_method_jumbleWords.R . 02_method_lex.div.R . 02_method_pasteText.R . 02_method_plot.kRp.text.R . 02_method_query.R . 02_method_read.corp.custom.R . 02_method_readTagged.R . 02_method_readability.R . 02_method_show.kRp.TTR.R . 02_method_show.kRp.corp.freq.R . 02_method_show.kRp.lang.R . 02_method_show.kRp.readability.R . 02_method_show.kRp.text.R . 02_method_split_by_doc_id.R . 02_method_summary.kRp.TTR.R . 02_method_summary.kRp.lang.R . 02_method_summary.kRp.readability.R . 02_method_summary.kRp.text.R . 02_method_textTransform.R . 02_method_tokenize.R . 02_method_treetag.R . 02_method_types_tokens.R . available.koRpus.lang.R . get.kRp.env.R . guess.lang.R . install.koRpus.lang.R . kRp.POS.tags.R . kRp.cluster.R . koRpus-internal.R . koRpus-internal.freq.analysis.R . koRpus-internal.import.R . koRpus-internal.lexdiv.formulae.R . koRpus-internal.rdb.formulae.R . koRpus-internal.rdb.params.grades.R . koRpus-internal.read.corp.custom.R . koRpus-package.R . lex.div.num.R . read.BAWL.R . read.corp.LCC.R . read.corp.celex.R . readability.num.R . segment.optimizer.R . set.kRp.env.R . set.lang.support.R . textFeatures.R . wrapper_functions_lex.div.R . wrapper_functions_readability.R .  Full koRpus package functions and examples
Downloads during the last 30 days
Get rewarded with contribution points by helping add
Reviews / comments / questions /suggestions ↴↴↴

Today's Hot Picks in Authors and Packages

pim  
Fit Probabilistic Index Models
Fit a probabilistic index model as described in Thas et al, 2012: ...
Download / Learn more Package Citations See dependency  
nlpred  
Estimators of Non-Linear Cross-Validated Risks Optimized for Small Samples
Methods for obtaining improved estimates of non-linear cross-validated risks are obtained using targ ...
Download / Learn more Package Citations See dependency  
nextGenShinyApps  
Craft Exceptional 'R Shiny' Applications and Dashboards with Novel Responsive Tools
Nove responsive tools for designing and developing 'Shiny' dashboards and applications. The scripts ...
Download / Learn more Package Citations See dependency  
GEEmediate  
Mediation Analysis for Generalized Linear Models Using the Difference Method
Causal mediation analysis for a single exposure/treatment and a single mediator, both allowed to be ...
Download / Learn more Package Citations See dependency  

23,580

R Packages

204,057

Dependencies

63,980

Author Associations

23,581

Publication Badges

© Copyright since 2022. All right reserved, rpkg.net.  Based in Cambridge, Massachusetts, USA