vtreat: A Statistically Sound 'data.frame' Processor/Conditioner

A 'data.frame' processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. 'vtreat' prepares variables so that data has fewer exceptional cases, making it easier to safely use models in production. Common problems 'vtreat' defends against: 'Inf', 'NA', too many categorical levels, rare categorical levels, and new categorical levels (levels seen during application, but not during training). Reference: "'vtreat': a data.frame Processor for Predictive Modeling", Zumel, Mount, 2016, <doi:10.5281/zenodo.1173313>.

Version: 1.6.5
Depends: R (≥ 3.4.0), wrapr (≥ 2.1.0)
Imports: stats, digest
Suggests: rquery (≥ 1.4.99), rqdatatable (≥ 1.3.3), data.table (≥ 1.12.2), knitr, rmarkdown, parallel, DBI, RSQLite, datasets, R.rsp, tinytest
Published: 2024-06-12
DOI: 10.32614/CRAN.package.vtreat
Author: John Mount [aut, cre], Nina Zumel [aut], Win-Vector LLC [cph]
Maintainer: John Mount <jmount at win-vector.com>
BugReports: https://github.com/WinVector/vtreat/issues
License: GPL-2 | GPL-3
URL: https://github.com/WinVector/vtreat/, https://winvector.github.io/vtreat/
NeedsCompilation: no
Materials: README, NEWS
CRAN checks: vtreat results

Documentation:

Reference manual: vtreat.html , vtreat.pdf
Vignettes: Multi Class vtreat (source, R code)
Saving Treatment Plans (source, R code)
vtreat Variable Importance (source, R code)
vtreat package (source, R code)
vtreat cross frames (source, R code)
vtreat grouping example (source, R code)
vtreat overfit (source, R code)
vtreat Rare Levels (source, R code)
vtreat scale mode (source, R code)
vtreat significance (source, R code)
vtreat data splitting (source, R code)
Variable Types (source, R code)
vtreat Formal Article (source)

Downloads:

Package source: vtreat_1.6.5.tar.gz
Windows binaries: r-devel: vtreat_1.6.5.zip, r-release: vtreat_1.6.5.zip, r-oldrel: vtreat_1.6.5.zip
macOS binaries: r-release (arm64): vtreat_1.6.5.tgz, r-oldrel (arm64): vtreat_1.6.5.tgz, r-release (x86_64): vtreat_1.6.5.tgz, r-oldrel (x86_64): vtreat_1.6.5.tgz
Old sources: vtreat archive

Reverse dependencies:

Reverse imports: crispRdesignR
Reverse suggests: mlr3pipelines

Linking:

Please use the canonical form https://CRAN.R-project.org/package=vtreat to link to this page.