Other packages > Find by keyword >

udpipe  

Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing with the 'UDPipe' 'NLP' Toolkit
View on CRAN: Click here


Download and install udpipe package within the R console
Install from CRAN:
install.packages("udpipe")

Install from Github:
library("remotes")
install_github("cran/udpipe")

Install by package version:
library("remotes")
install_version("udpipe", "0.8.11")



Attach the package and use:
library("udpipe")
Maintained by
Jan Wijffels
[Scholar Profile | Author Map]
First Published: 2017-09-01
Latest Update: 2023-01-06
Description:
This natural language processing toolkit provides language-agnostic 'tokenization', 'parts of speech tagging', 'lemmatization' and 'dependency parsing' of raw text. Next to text parsing, the package also allows you to train annotation models based on data of 'treebanks' in 'CoNLL-U' format as provided at <https://universaldependencies.org/format.html>. The techniques are explained in detail in the paper: 'Tokenizing, POS Tagging, Lemmatizing and Parsing UD 2.0 with UDPipe', available at <doi:10.18653/v1/K17-3009>. The toolkit also contains functionalities for commonly used data manipulations on texts which are enriched with the output of the parser. Namely functionalities and algorithms for collocations, token co-occurrence, document term matrix handling, term frequency inverse document frequency calculations, information retrieval metrics (Okapi BM25), handling of multi-word expressions, keyword detection (Rapid Automatic Keyword Extraction, noun phrase extraction, syntactical patterns) sentiment scoring and semantic similarity analysis.
How to cite:
Jan Wijffels (2017). udpipe: Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing with the 'UDPipe' 'NLP' Toolkit. R package version 0.8.11, https://cran.r-project.org/web/packages/udpipe. Accessed 06 Apr. 2025.
Previous versions and publish date:
0.1.0 (2017-09-01 19:06), 0.1.1 (2017-09-13 19:25), 0.2.1 (2017-12-06 19:15), 0.2.2 (2017-12-07 14:08), 0.2 (2017-11-13 11:29), 0.3 (2018-01-15 15:45), 0.4 (2018-02-07 14:28), 0.5 (2018-03-12 15:48), 0.6.1 (2018-07-30 15:40), 0.6 (2018-05-14 11:25), 0.7 (2018-09-10 11:50), 0.8.1 (2019-02-15 15:00), 0.8.2 (2019-05-29 10:00), 0.8.3 (2019-07-06 00:00), 0.8.4-1 (2020-10-12 06:40), 0.8.4 (2020-10-10 08:30), 0.8.5 (2020-12-10 17:40), 0.8.6 (2021-06-01 15:20), 0.8.8 (2021-12-02 20:30), 0.8.9 (2022-03-24 10:30), 0.8.10 (2022-11-10 16:50), 0.8 (2018-12-09 16:10)
Other packages that cited udpipe R package
View udpipe citation profile
Other R packages that udpipe depends, imports, suggests or enhances
Complete documentation for udpipe
Functions, R codes and Examples using the udpipe R package
Some associated functions: as.data.frame.udpipe_connlu . as.matrix.cooccurrence . as_conllu . as_cooccurrence . as_fasttext . as_phrasemachine . as_word2vec . brussels_listings . brussels_reviews . brussels_reviews_anno . brussels_reviews_w2v_embeddings_lemma_nl . cbind_dependencies . cbind_morphological . cooccurrence . document_term_frequencies . document_term_frequencies_statistics . document_term_matrix . dtm_align . dtm_bind . dtm_chisq . dtm_colsums . dtm_conform . dtm_cor . dtm_remove_lowfreq . dtm_remove_sparseterms . dtm_remove_terms . dtm_remove_tfidf . dtm_reverse . dtm_sample . dtm_svd_similarity . dtm_tfidf . keywords_collocation . keywords_phrases . keywords_rake . paste.data.frame . predict.LDA . strsplit.data.frame . syntaxpatterns . syntaxrelation . txt_collapse . txt_contains . txt_context . txt_count . txt_freq . txt_grepl . txt_highlight . txt_next . txt_nextgram . txt_overlap . txt_paste . txt_previous . txt_previousgram . txt_recode . txt_recode_ngram . txt_sample . txt_sentiment . txt_show . txt_tagsequence . udpipe . udpipe_accuracy . udpipe_annotate . udpipe_annotation_params . udpipe_download_model . udpipe_load_model . udpipe_read_conllu . udpipe_train . unique_identifier . unlist_tokens . 
Some associated R codes: RcppExports.R . datasets.R . nlp_collocation.R . nlp_cooccurrence.R . nlp_dependency_parsing.R . nlp_dependency_utlities.R . nlp_flow.R . nlp_morphological.R . nlp_phrase_sequences.R . nlp_rake.R . pkg.R . sentiment.R . topicmodels.R . udpipe_models.R . udpipe_parse.R . udpipe_reconstruct.R . udpipe_train.R . utils.R .  Full udpipe package functions and examples
Downloads during the last 30 days
03/0703/0803/0903/1003/1103/1203/1303/1403/1503/1603/1703/1803/1903/2003/2103/2203/2303/2403/2503/2603/2703/2803/2903/3003/3104/0104/0204/0304/0404/05Downloads for udpipe80100120140160180200220240260280300320340360TrendBars

Today's Hot Picks in Authors and Packages

danstat  
R Client for the Statistics Denmark Databank API
The purpose of the package is to enable an R function interface into the Statistics Denmark Databank ...
Download / Learn more Package Citations See dependency  
ider  
Various Methods for Estimating Intrinsic Dimension
An implementation of various methods for estimating intrinsic dimension of vector-valued dataset or ...
Download / Learn more Package Citations See dependency  
listarrays  
A Toolbox for Working with R Arrays in a Functional Programming Style
A toolbox for R arrays. Flexibly split, bind, reshape, modify, subset and name arrays. ...
Download / Learn more Package Citations See dependency  
spCP  
Spatially Varying Change Points
Implements a spatially varying change point model with unique intercepts, slopes, variance intercep ...
Download / Learn more Package Citations See dependency  
FuzzyQ  
Fuzzy Quantification of Common and Rare Species
Fuzzy clustering of species in an ecological community as common or rare based on their abundance a ...
Download / Learn more Package Citations See dependency  
r2resize  
In-Text Resize for Images, Tables and Fancy Resize Containers in 'shiny', 'rmarkdown' and 'quarto' Documents
Automatic resizing toolbar for containers, images and tables. Various resizable or expandable contai ...
Download / Learn more Package Citations See dependency  

23,990

R Packages

207,311

Dependencies

64,809

Author Associations

23,991

Publication Badges

© Copyright since 2022. All right reserved, rpkg.net.  Based in Cambridge, Massachusetts, USA