Other packages > Find by keyword >

udpipe  

Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing with the 'UDPipe' 'NLP' Toolkit
View on CRAN: Click here


Download and install udpipe package within the R console
Install from CRAN:
install.packages("udpipe")

Install from Github:
library("remotes")
install_github("cran/udpipe")

Install by package version:
library("remotes")
install_version("udpipe", "0.8.11")



Attach the package and use:
library("udpipe")
Maintained by
Jan Wijffels
[Scholar Profile | Author Map]
First Published: 2017-09-01
Latest Update: 2023-01-06
Description:
This natural language processing toolkit provides language-agnostic 'tokenization', 'parts of speech tagging', 'lemmatization' and 'dependency parsing' of raw text. Next to text parsing, the package also allows you to train annotation models based on data of 'treebanks' in 'CoNLL-U' format as provided at <https://universaldependencies.org/format.html>. The techniques are explained in detail in the paper: 'Tokenizing, POS Tagging, Lemmatizing and Parsing UD 2.0 with UDPipe', available at <doi:10.18653/v1/K17-3009>. The toolkit also contains functionalities for commonly used data manipulations on texts which are enriched with the output of the parser. Namely functionalities and algorithms for collocations, token co-occurrence, document term matrix handling, term frequency inverse document frequency calculations, information retrieval metrics (Okapi BM25), handling of multi-word expressions, keyword detection (Rapid Automatic Keyword Extraction, noun phrase extraction, syntactical patterns) sentiment scoring and semantic similarity analysis.
How to cite:
Jan Wijffels (2017). udpipe: Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing with the 'UDPipe' 'NLP' Toolkit. R package version 0.8.11, https://cran.r-project.org/web/packages/udpipe. Accessed 10 Jan. 2025.
Previous versions and publish date:
0.1.0 (2017-09-01 19:06), 0.1.1 (2017-09-13 19:25), 0.2.1 (2017-12-06 19:15), 0.2.2 (2017-12-07 14:08), 0.2 (2017-11-13 11:29), 0.3 (2018-01-15 15:45), 0.4 (2018-02-07 14:28), 0.5 (2018-03-12 15:48), 0.6.1 (2018-07-30 15:40), 0.6 (2018-05-14 11:25), 0.7 (2018-09-10 11:50), 0.8.1 (2019-02-15 15:00), 0.8.2 (2019-05-29 10:00), 0.8.3 (2019-07-06 00:00), 0.8.4-1 (2020-10-12 06:40), 0.8.4 (2020-10-10 08:30), 0.8.5 (2020-12-10 17:40), 0.8.6 (2021-06-01 15:20), 0.8.8 (2021-12-02 20:30), 0.8.9 (2022-03-24 10:30), 0.8.10 (2022-11-10 16:50), 0.8 (2018-12-09 16:10)
Other packages that cited udpipe R package
View udpipe citation profile
Other R packages that udpipe depends, imports, suggests or enhances
Complete documentation for udpipe
Functions, R codes and Examples using the udpipe R package
Some associated functions: as.data.frame.udpipe_connlu . as.matrix.cooccurrence . as_conllu . as_cooccurrence . as_fasttext . as_phrasemachine . as_word2vec . brussels_listings . brussels_reviews . brussels_reviews_anno . brussels_reviews_w2v_embeddings_lemma_nl . cbind_dependencies . cbind_morphological . cooccurrence . document_term_frequencies . document_term_frequencies_statistics . document_term_matrix . dtm_align . dtm_bind . dtm_chisq . dtm_colsums . dtm_conform . dtm_cor . dtm_remove_lowfreq . dtm_remove_sparseterms . dtm_remove_terms . dtm_remove_tfidf . dtm_reverse . dtm_sample . dtm_svd_similarity . dtm_tfidf . keywords_collocation . keywords_phrases . keywords_rake . paste.data.frame . predict.LDA . strsplit.data.frame . syntaxpatterns . syntaxrelation . txt_collapse . txt_contains . txt_context . txt_count . txt_freq . txt_grepl . txt_highlight . txt_next . txt_nextgram . txt_overlap . txt_paste . txt_previous . txt_previousgram . txt_recode . txt_recode_ngram . txt_sample . txt_sentiment . txt_show . txt_tagsequence . udpipe . udpipe_accuracy . udpipe_annotate . udpipe_annotation_params . udpipe_download_model . udpipe_load_model . udpipe_read_conllu . udpipe_train . unique_identifier . unlist_tokens . 
Some associated R codes: RcppExports.R . datasets.R . nlp_collocation.R . nlp_cooccurrence.R . nlp_dependency_parsing.R . nlp_dependency_utlities.R . nlp_flow.R . nlp_morphological.R . nlp_phrase_sequences.R . nlp_rake.R . pkg.R . sentiment.R . topicmodels.R . udpipe_models.R . udpipe_parse.R . udpipe_reconstruct.R . udpipe_train.R . utils.R .  Full udpipe package functions and examples
Downloads during the last 30 days
12/1112/1212/1312/1412/1512/1612/1712/1812/1912/2012/2112/2212/2312/2412/2512/2612/2712/2812/2912/3012/3101/0101/0201/0301/0401/0501/0601/0701/08Downloads for udpipe6080100120140160180200220240260280300TrendBars
Get rewarded with contribution points by helping add
Reviews / comments / questions /suggestions ↴↴↴

Today's Hot Picks in Authors and Packages

daterangepicker  
Create a Shiny Date-Range Input
A Shiny Input for date-ranges, which pops up two calendars for selecting dates, times, or predefined ...
Download / Learn more Package Citations See dependency  
knitrBootstrap  
'knitr' Bootstrap Framework
A framework to create Bootstrap HTML reports from 'knitr' 'rmarkdown'. ...
Download / Learn more Package Citations See dependency  
basetheme  
Themes for Base Graphics Plots
Functions to create and select graphical themes for the base plotting system. Contains: 1) several c ...
Download / Learn more Package Citations See dependency  
nextGenShinyApps  
Craft Exceptional 'R Shiny' Applications and Dashboards with Novel Responsive Tools
Nove responsive tools for designing and developing 'Shiny' dashboards and applications. The scripts ...
Download / Learn more Package Citations See dependency  
mapme.biodiversity  
Efficient Monitoring of Global Biodiversity Portfolios
Biodiversity areas, especially primary forest, serve a multitude of functions for local economy, re ...
Download / Learn more Package Citations See dependency  
Q7  
Types and Features for Object Oriented Programming
Construct message-passing style objects with types and features. Q7 types uses composition instead ...
Download / Learn more Package Citations See dependency  

23,440

R Packages

202,297

Dependencies

63,567

Author Associations

23,441

Publication Badges

© Copyright since 2022. All right reserved, rpkg.net.  Based in Cambridge, Massachusetts, USA