R package citation, R package reverse dependencies, R package scholars, install an r package from GitHub hy is package acceptance pending why is package undeliverable amazon why is package on hold dhl tour packages why in r package r and r package full form why is r free why r is bad which r package to install which r package has which r package which r package version which r package readxl which r package ggplot which r package fread which r package license where is package.json where is package-lock.json where is package.swift where is package explorer in eclipse where is package where is package manager unity where is package installer android where is package manager console in visual studio who r package which r package to install which r package version who is package who is package deal who is package design r and r package full form r and r package meaning what r package has what package r what is package in java what is package what is package-lock.json what is package in python what is package.json what is package installer do r package can't install r packages r can't find package r can't load package can't load xlsx package r can't install psych package r can't install sf package r Write if else in NONMEM pk pd
sentencepiece
View on CRAN: Click
here
Download and install sentencepiece package within the R console
Install from CRAN:
install.packages("sentencepiece")
Install from Github:
library("remotes")
install_github("cran/sentencepiece") Install by package version:
library("remotes")
install_version("sentencepiece", "0.2.4") Attach the package and use:
library("sentencepiece")
Maintained by
Jan Wijffels
[Scholar Profile | Author Map]
[Scholar Profile | Author Map]
All associated links for this package
First Published: 2020-06-04
Latest Update: 2022-11-13
Description:
Unsupervised text tokenizer allowing to perform byte pair encoding and unigram modelling.
Wraps the 'sentencepiece' library which provides a language independent tokenizer to split text in words and smaller subword units.
The techniques are explained in the paper "SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing" by Taku Kudo and John Richardson (2018) .
Provides as well straightforward access to pretrained byte pair encoding models and subword embeddings trained on Wikipedia using 'word2vec',
as described in "BPEmb: Tokenization-free Pre-trained Subword Embeddings in 275 Languages" by Benjamin Heinzerling and Michael Strube (2018) .
How to cite:
Jan Wijffels (2020). sentencepiece: Text Tokenization using Byte Pair Encoding and Unigram Modelling. R package version 0.2.4, https://cran.r-project.org/web/packages/sentencepiece. Accessed 05 Mar. 2026.
Previous versions and publish date:
Other packages that cited sentencepiece R package
View sentencepiece citation profile
Other R packages that sentencepiece depends,
imports, suggests or enhances
Complete documentation for sentencepiece
Functions, R codes and Examples using
the sentencepiece R package
Some associated functions: BPEembed . BPEembedder . predict.BPEembed . read_word2vec . sentencepiece . sentencepiece_decode . sentencepiece_download_model . sentencepiece_encode . sentencepiece_load_model . txt_remove_ . wordpiece_encode .
Some associated R codes: AAA.R . RcppExports.R . bpemb.R . pkg.R . sentencepiece.R . utils.R . word2vec.R . wordpiece.R . Full sentencepiece package functions and examples
Downloads during the last 30 days
Today's Hot Picks in Authors and Packages
neat
Includes functions and examples to compute NEAT, the Network
Enrichment Analysis Test described in ...
Download / Learn more Package Citations See dependency
Download / Learn more Package Citations See dependency
Maintainer: Mirko Signorelli (view profile)
diffIRT
Package to fit diffusion-based IRT models to response and
response time data. Models are fit using ...
Download / Learn more Package Citations See dependency
Download / Learn more Package Citations See dependency
Maintainer: Dylan Molenaar (view profile)
ClimClass
Classification of climate according to Koeppen - Geiger, of aridity
indices, of continentality indi ...
Download / Learn more Package Citations See dependency
Download / Learn more Package Citations See dependency
Maintainer: Emanuele Eccel (view profile)
nextGenShinyApps
Nove responsive tools for designing and developing 'Shiny' dashboards and applications. The scripts ...
Download / Learn more Package Citations See dependency
Download / Learn more Package Citations See dependency
Maintainer: Obinna Obianom (view profile)
pinp
A 'PNAS'-alike style for 'rmarkdown', derived from the
'Proceedings of the National Academy of Scie ...
Download / Learn more Package Citations See dependency
Download / Learn more Package Citations See dependency
Maintainer: Dirk Eddelbuettel (view profile)
roccv
Cross validate large genetic data while specifying clinical variables that should always be in the m ...
Download / Learn more Package Citations See dependency
Download / Learn more Package Citations See dependency
Maintainer: Ben Sherwood (view profile)
26,264
R Packages
223,360
Dependencies
70,244
Author Associations
26,265
Publication Badges
