Other packages > Find by keyword >

udpipe  

Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing with the 'UDPipe' 'NLP' Toolkit
View on CRAN: Click here


Download and install udpipe package within the R console
Install from CRAN:
install.packages("udpipe")

Install from Github:
library("remotes")
install_github("cran/udpipe")

Install by package version:
library("remotes")
install_version("udpipe", "0.8.11")



Attach the package and use:
library("udpipe")
Maintained by
Jan Wijffels
[Scholar Profile | Author Map]
First Published: 2017-09-01
Latest Update: 2023-01-06
Description:
This natural language processing toolkit provides language-agnostic 'tokenization', 'parts of speech tagging', 'lemmatization' and 'dependency parsing' of raw text. Next to text parsing, the package also allows you to train annotation models based on data of 'treebanks' in 'CoNLL-U' format as provided at <https://universaldependencies.org/format.html>. The techniques are explained in detail in the paper: 'Tokenizing, POS Tagging, Lemmatizing and Parsing UD 2.0 with UDPipe', available at <doi:10.18653/v1/K17-3009>. The toolkit also contains functionalities for commonly used data manipulations on texts which are enriched with the output of the parser. Namely functionalities and algorithms for collocations, token co-occurrence, document term matrix handling, term frequency inverse document frequency calculations, information retrieval metrics (Okapi BM25), handling of multi-word expressions, keyword detection (Rapid Automatic Keyword Extraction, noun phrase extraction, syntactical patterns) sentiment scoring and semantic similarity analysis.
How to cite:
Jan Wijffels (2017). udpipe: Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing with the 'UDPipe' 'NLP' Toolkit. R package version 0.8.11, https://cran.r-project.org/web/packages/udpipe. Accessed 29 Apr. 2025.
Previous versions and publish date:
0.1.0 (2017-09-01 19:06), 0.1.1 (2017-09-13 19:25), 0.2.1 (2017-12-06 19:15), 0.2.2 (2017-12-07 14:08), 0.2 (2017-11-13 11:29), 0.3 (2018-01-15 15:45), 0.4 (2018-02-07 14:28), 0.5 (2018-03-12 15:48), 0.6.1 (2018-07-30 15:40), 0.6 (2018-05-14 11:25), 0.7 (2018-09-10 11:50), 0.8.1 (2019-02-15 15:00), 0.8.2 (2019-05-29 10:00), 0.8.3 (2019-07-06 00:00), 0.8.4-1 (2020-10-12 06:40), 0.8.4 (2020-10-10 08:30), 0.8.5 (2020-12-10 17:40), 0.8.6 (2021-06-01 15:20), 0.8.8 (2021-12-02 20:30), 0.8.9 (2022-03-24 10:30), 0.8.10 (2022-11-10 16:50), 0.8 (2018-12-09 16:10)
Other packages that cited udpipe R package
View udpipe citation profile
Other R packages that udpipe depends, imports, suggests or enhances
Complete documentation for udpipe
Functions, R codes and Examples using the udpipe R package
Some associated functions: as.data.frame.udpipe_connlu . as.matrix.cooccurrence . as_conllu . as_cooccurrence . as_fasttext . as_phrasemachine . as_word2vec . brussels_listings . brussels_reviews . brussels_reviews_anno . brussels_reviews_w2v_embeddings_lemma_nl . cbind_dependencies . cbind_morphological . cooccurrence . document_term_frequencies . document_term_frequencies_statistics . document_term_matrix . dtm_align . dtm_bind . dtm_chisq . dtm_colsums . dtm_conform . dtm_cor . dtm_remove_lowfreq . dtm_remove_sparseterms . dtm_remove_terms . dtm_remove_tfidf . dtm_reverse . dtm_sample . dtm_svd_similarity . dtm_tfidf . keywords_collocation . keywords_phrases . keywords_rake . paste.data.frame . predict.LDA . strsplit.data.frame . syntaxpatterns . syntaxrelation . txt_collapse . txt_contains . txt_context . txt_count . txt_freq . txt_grepl . txt_highlight . txt_next . txt_nextgram . txt_overlap . txt_paste . txt_previous . txt_previousgram . txt_recode . txt_recode_ngram . txt_sample . txt_sentiment . txt_show . txt_tagsequence . udpipe . udpipe_accuracy . udpipe_annotate . udpipe_annotation_params . udpipe_download_model . udpipe_load_model . udpipe_read_conllu . udpipe_train . unique_identifier . unlist_tokens . 
Some associated R codes: RcppExports.R . datasets.R . nlp_collocation.R . nlp_cooccurrence.R . nlp_dependency_parsing.R . nlp_dependency_utlities.R . nlp_flow.R . nlp_morphological.R . nlp_phrase_sequences.R . nlp_rake.R . pkg.R . sentiment.R . topicmodels.R . udpipe_models.R . udpipe_parse.R . udpipe_reconstruct.R . udpipe_train.R . utils.R .  Full udpipe package functions and examples
Downloads during the last 30 days
03/3003/3104/0104/0204/0304/0404/0504/0604/0704/0804/0904/1004/1104/1204/1304/1404/1504/1604/1704/1804/1904/2004/2104/2204/2304/2404/2504/2604/2704/28Downloads for udpipe100120140160180200220240260280300320340360TrendBars

Today's Hot Picks in Authors and Packages

NBBDesigns  
Neighbour Balanced Block Designs (NBBDesigns)
Neighbour-balanced designs ensure that no treatment is disadvantaged unfairly by its surroundings. T ...
Download / Learn more Package Citations See dependency  
bigGP  
Distributed Gaussian Process Calculations
Distributes Gaussian process calculations across nodes in a distributed memory setting, using Rmpi. ...
Download / Learn more Package Citations See dependency  
quickcode  
Quick and Essential 'R' Tricks for Better Scripts
The NOT functions, 'R' tricks and a compilation of some simple quick plus often used 'R' codes to im ...
Download / Learn more Package Citations See dependency  
PCAmatchR  
Match Cases to Controls Based on Genotype Principal Components
Matches cases to controls based on genotype principal components (PC). In order to produce better r ...
Download / Learn more Package Citations See dependency  
knockoff  
The Knockoff Filter for Controlled Variable Selection
The knockoff filter is a general procedure for controlling the false discovery rate (FDR) when perf ...
Download / Learn more Package Citations See dependency  
phecodemap  
Visualization for PheCode Mapping with ICD-9 and ICD-10-CM Codes
To build a shiny app for visualization of the hierarchy of PheCode Mapping with International Classi ...
Download / Learn more Package Citations See dependency  

24,142

R Packages

207,311

Dependencies

65,176

Author Associations

24,143

Publication Badges

© Copyright since 2022. All right reserved, rpkg.net.  Based in Cambridge, Massachusetts, USA