Package: sbo 0.5.0

Valerio Gherardi

sbo: Text Prediction via Stupid Back-Off N-Gram Models

Utilities for training and evaluating text predictors based on Stupid Back-Off N-gram models (Brants et al., 2007, <https://www.aclweb.org/anthology/D07-1090/>).

Authors:Valerio Gherardi

sbo_0.5.0.tar.gz
sbo_0.5.0.zip(r-4.7)sbo_0.5.0.zip(r-4.6)sbo_0.5.0.zip(r-4.5)
sbo_0.5.0.tgz(r-4.6-x86_64)sbo_0.5.0.tgz(r-4.6-arm64)sbo_0.5.0.tgz(r-4.5-x86_64)sbo_0.5.0.tgz(r-4.5-arm64)
sbo_0.5.0.tar.gz(r-4.7-arm64)sbo_0.5.0.tar.gz(r-4.7-x86_64)sbo_0.5.0.tar.gz(r-4.6-arm64)sbo_0.5.0.tar.gz(r-4.6-x86_64)
sbo_0.5.0.tgz(r-4.6-emscripten)
manual.pdf |manual.html
card.svg |card.png
sbo/json (API)
NEWS

# Install 'sbo' in R:
install.packages('sbo', repos = c('https://vgherard.r-universe.dev', 'https://cloud.r-project.org'))

Bug tracker:https://github.com/vgherard/sbo/issues

Pkgdown/docs site:https://vgherard.github.io

Uses libs:
  • c++– GNU Standard C++ Library v3
Datasets:

On CRAN:

Conda:

natural-language-processingngram-modelspredictive-textsbocpp

2.74 score 10 stars 11 scripts 33 downloads 17 exports 37 dependencies

Last updated from:75374a5bf5 (on v0.5.0). Checks:12 ERROR, 1 OK. Indexed: yes.

TargetResultTimeFilesSyslog
linux-devel-arm64ERROR206
linux-devel-x86_64ERROR203
source / vignettesERROR258
linux-release-arm64ERROR184
linux-release-x86_64ERROR178
macos-release-arm64ERROR117
macos-release-x86_64ERROR289
macos-oldrel-arm64ERROR113
macos-oldrel-x86_64ERROR215
windows-develERROR205
windows-releaseERROR172
windows-oldrelERROR144
wasm-releaseOK133

Exports:as_sbo_dictionarybabbledictionaryeval_sbo_predictorkgram_freqskgram_freqs_fastpredictorpredtablepreprocessprunesbo_dictionarysbo_kgram_freqssbo_kgram_freqs_fastsbo_predictorsbo_predtabletokenize_sentencesword_coverage

Dependencies:briocallrclicpp11crayondescdiffobjdplyrevaluatefsgenericsgluejsonlitelifecyclemagrittrpillarpkgbuildpkgconfigpkgloadpraiseprocessxpspurrrR6Rcpprlangrprojrootstringistringrtestthattibbletidyrtidyselectutf8vctrswaldowithr

Readme and manuals

Help Manual

Help pageTopics
Coerce to dictionaryas_sbo_dictionary as_sbo_dictionary.character
Babble!babble
Evaluate Stupid Back-off next-word predictionseval_sbo_predictor
k-gram frequency tableskgram_freqs kgram_freqs_fast sbo_kgram_freqs sbo_kgram_freqs_fast
Plot method for word_coverage objectsplot.word_coverage
Predict method for k-gram frequency tablespredict.sbo_kgram_freqs
Predict method for Stupid Back-off text predictorpredict.sbo_predictor
Preprocess text corpuspreprocess
Prune k-gram objectsprune prune.sbo_kgram_freqs prune.sbo_predtable
Dictionariesdictionary sbo_dictionary
Stupid Back-off text predictionspredictor predtable sbo_predictions sbo_predictor sbo_predictor.character sbo_predictor.sbo_kgram_freqs sbo_predictor.sbo_predtable sbo_predtable sbo_predtable.character sbo_predtable.sbo_kgram_freqs
Sentence tokenizertokenize_sentences
Top 1000 dictionary from Twitter training settwitter_dict
k-gram frequencies from Twitter training settwitter_freqs
Next-word prediction tables from 3-gram model trained on Twitter training settwitter_predtable
Twitter test settwitter_test
Twitter training settwitter_train
Word coverage fractionword_coverage word_coverage.character word_coverage.sbo_dictionary word_coverage.sbo_kgram_freqs word_coverage.sbo_predictions