Skip to content
forked from kogalur/varPro

Fork of varPro dev version at `kogalur/varPro`

Notifications You must be signed in to change notification settings

ehrlinger/varPro

 
 

Repository files navigation

varPro - Model independent variable selection via rule based variable priority

cranlogs CRAN_Status_Badge

active

R-CMD-check

A new framework of variable selection, which instead of generating artificial covariates such as permutation importance and knockoffs, creates release rules to examine the affect on the response for each covariate where the conditional distribution of the response variable can be arbitrary and unknown.

While achieving high prediction accuracy is a fundamental goal in machine learning, an equally important task is finding a small number of features with high explanatory power. One popular selection technique is permutation importance, which assesses a variable’s impact by measuring the change in prediction error after permuting the variable.

Variable Priority (VarPro) works by utilizing rules without the need to generate artificial data or evaluate prediction error. The method only requires the calculation of sample averages of simple statistics, and can be applied to many data settings, including regression, classification, and survival.

Installation

Install the development version into your R environment using the devtools package:

install.packages("devtools") # If you don't have it.
devtools::install_github("kogalur/varpro")

References

Lu M., Ishwaran H. Model-independent variable selection via the rule-based variable priority arXiv:2409.09003 [stat.ML]

About

Fork of varPro dev version at `kogalur/varPro`

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • C 70.0%
  • R 29.5%
  • Other 0.5%