Other packages > Find by keyword >

AhoCorasickTrie  

Fast Searching for Multiple Keywords in Multiple Texts
View on CRAN: Click here


Download and install AhoCorasickTrie package within the R console
Install from CRAN:
install.packages("AhoCorasickTrie")

Install from Github:
library("remotes")
install_github("cran/AhoCorasickTrie")

Install by package version:
library("remotes")
install_version("AhoCorasickTrie", "0.1.2")



Attach the package and use:
library("AhoCorasickTrie")
Maintained by
Matt Chambers
[Scholar Profile | Author Map]
All associated links for this package
First Published: 2016-07-29
Latest Update: 2020-09-29
Description:
Aho-Corasick is an optimal algorithm for finding many keywords in a text. It can locate all matches in a text in O(N+M) time; i.e., the time needed scales linearly with the number of keywords (N) and the size of the text (M). Compare this to the naive approach which takes O(N*M) time to loop through each pattern and scan for it in the text. This implementation builds the trie (the generic name of the data structure) and runs the search in a single function call. If you want to search multiple texts with the same trie, the function will take a list or vector of texts and return a list of matches to each text. By default, all 128 ASCII characters are allowed in both the keywords and the text. A more efficient trie is possible if the alphabet size can be reduced. For example, DNA sequences use at most 19 distinct characters and usually only 4; protein sequences use at most 26 distinct characters and usually only 20. UTF-8 (Unicode) matching is not currently supported.
How to cite:
Matt Chambers (2016). AhoCorasickTrie: Fast Searching for Multiple Keywords in Multiple Texts. R package version 0.1.2, https://cran.r-project.org/web/packages/AhoCorasickTrie
Previous versions and publish date:
0.1.0 (2016-07-29 06:40)
Other packages that cited AhoCorasickTrie R package
View AhoCorasickTrie citation profile
Other R packages that AhoCorasickTrie depends, imports, suggests or enhances
Functions, R codes and Examples using the AhoCorasickTrie R package
Some associated functions: AhoCorasickSearch . AhoCorasickSearchList . AhoCorasickTrie . 
Some associated R codes: AhoCorasickTrie.R . RcppExports.R .  Full AhoCorasickTrie package functions and examples
Downloads during the last 30 days
Get rewarded with contribution points by helping add
Reviews / comments / questions /suggestions ↴↴↴

Today's Hot Picks in Authors and Packages

wordpiece.data  
Data for Wordpiece-Style Tokenization
Provides data to be used by the wordpiece algorithm in order to tokenize text into somewhat meaning ...
Download / Learn more Package Citations See dependency  
ids  
Generate Random Identifiers
Generate random or human readable and pronounceable identifiers. ...
Download / Learn more Package Citations See dependency  
coro  
'Coroutines' for R
Provides 'coroutines' for R, a family of functions that can be suspended and resumed later on. This ...
Download / Learn more Package Citations See dependency  
baguette  
Efficient Model Functions for Bagging
Tree- and rule-based models can be bagged () using this package and their p ...
Download / Learn more Package Citations See dependency  
foghorn  
Summarize CRAN Check Results in the Terminal
The CRAN check results and where your package stands in the CRAN submission queue in your R termina ...
Download / Learn more Package Citations See dependency  
nextGenShinyApps  
Craft Exceptional 'R Shiny' Applications and Dashboards with Novel Responsive Tools
Nove responsive tools for designing and developing 'Shiny' dashboards and applications. The scripts ...
Download / Learn more Package Citations See dependency  

22,187

R Packages

188,753

Dependencies

55,244

Author Associations

22,188

Publication Badges

© Copyright 2022 - present. All right reserved, rpkg.net. Contact Us / Suggestions / Concerns