Hal9001
🤠 📿 The Highly Adaptive Lasso
Install / Use
/learn @tlverse/Hal9001README
R/hal9001
The Scalable Highly Adaptive Lasso
Authors: Jeremy Coyle, Nima Hejazi, Rachael Phillips, Lars van der Laan, and Mark van der Laan
What’s hal9001?
hal9001 is an R package providing an implementation of the scalable
highly adaptive lasso (HAL), a nonparametric regression estimator that
applies L1-regularized lasso regression to a design matrix composed of
indicator functions corresponding to the support of the functional over
a set of covariates and interactions thereof. HAL regression allows for
arbitrarily complex functional forms to be estimated at fast
(near-parametric) convergence rates under only global smoothness
assumptions (van der Laan 2017a; Bibaut and van der Laan 2019). For
detailed theoretical discussions of the highly adaptive lasso estimator,
consider consulting, for example, van der Laan (2017a), van der Laan
(2017b), and van der Laan and Bibaut (2017). For a computational
demonstration of the versatility of HAL regression, see Benkeser and van
der Laan (2016). Recent theoretical works have demonstrated success in
building efficient estimators of complex parameters when particular
variations of HAL regression are used to estimate nuisance parameters
(e.g., van der Laan, Benkeser, and Cai 2019; Ertefaie, Hejazi, and van
der Laan 2020).
Installation
For standard use, we recommend installing the package from CRAN via
install.packages("hal9001")
To contribute, install the development version of hal9001 from
GitHub via remotes:
remotes::install_github("tlverse/hal9001")
Issues
If you encounter any bugs or have any specific feature requests, please file an issue.
Example
Consider the following minimal example in using hal9001 to generate
predictions via Highly Adaptive Lasso regression:
# load the package and set a seed
library(hal9001)
#> Loading required package: Rcpp
#> hal9001 v0.4.6: The Scalable Highly Adaptive Lasso
#> note: fit_hal defaults have changed. See ?fit_hal for details
set.seed(385971)
# simulate data
n <- 100
p <- 3
x <- matrix(rnorm(n * p), n, p)
y <- x[, 1] * sin(x[, 2]) + rnorm(n, mean = 0, sd = 0.2)
# fit the HAL regression
hal_fit <- fit_hal(X = x, Y = y, yolo = TRUE)
#> [1] "I'm sorry, Dave. I'm afraid I can't do that."
hal_fit$times
#> user.self sys.self elapsed user.child sys.child
#> enumerate_basis 0.003 0.000 0.002 0 0
#> design_matrix 0.000 0.000 0.000 0 0
#> reduce_basis 0.000 0.000 0.000 0 0
#> remove_duplicates 0.000 0.000 0.000 0 0
#> lasso 0.362 0.002 0.365 0 0
#> total 0.365 0.002 0.367 0 0
# training sample prediction
preds <- predict(hal_fit, new_data = x)
mean(hal_mse <- (preds - y)^2)
#> [1] 0.03587941
Contributions
Contributions are very welcome. Interested contributors should consult our contribution guidelines prior to submitting a pull request.
Citation
After using the hal9001 R package, please cite both of the following:
@software{coyle2022hal9001-rpkg,
author = {Coyle, Jeremy R and Hejazi, Nima S and Phillips, Rachael V
and {van der Laan}, Lars and {van der Laan}, Mark J},
title = {{hal9001}: The scalable highly adaptive lasso},
year = {2022},
url = {https://doi.org/10.5281/zenodo.3558313},
doi = {10.5281/zenodo.3558313}
note = {{R} package version 0.4.2}
}
@article{hejazi2020hal9001-joss,
author = {Hejazi, Nima S and Coyle, Jeremy R and {van der Laan}, Mark
J},
title = {{hal9001}: Scalable highly adaptive lasso regression in
{R}},
year = {2020},
url = {https://doi.org/10.21105/joss.02526},
doi = {10.21105/joss.02526},
journal = {Journal of Open Source Software},
publisher = {The Open Journal}
}
License
© 2017-2024 Jeremy Coyle and Nima Hejazi
The contents of this repository are distributed under the GPL-3 license.
See file LICENSE for details.
References
<div id="refs" class="references csl-bib-body hanging-indent" entry-spacing="0"> <div id="ref-benkeser2016hal" class="csl-entry">Benkeser, David, and Mark J van der Laan. 2016. “The Highly Adaptive Lasso Estimator.” In 2016 IEEE International Conference on Data Science and Advanced Analytics (DSAA). IEEE. https://doi.org/10.1109/dsaa.2016.93.
</div> <div id="ref-bibaut2019fast" class="csl-entry">Bibaut, Aurélien F, and Mark J van der Laan. 2019. “Fast Rates for Empirical Risk Minimization over Càdlàg Functions with Bounded Sectional Variation Norm.” https://arxiv.org/abs/1907.09244.
</div> <div id="ref-ertefaie2020nonparametric" class="csl-entry">Ertefaie, Ashkan, Nima S Hejazi, and Mark J van der Laan. 2020. “Nonparametric Inverse Probability Weighted Estimators Based on the Highly Adaptive Lasso.” https://arxiv.org/abs/2005.11303.
</div> <div id="ref-vdl2017generally" class="csl-entry">van der Laan, Mark J. 2017a. “A Generally Efficient Targeted Minimum Loss Based Estimator Based on the Highly Adaptive Lasso.” The International Journal of Biostatistics. https://doi.org/10.1515/ijb-2015-0097.
</div> <div id="ref-vdl2017finite" class="csl-entry">———. 2017b. “Finite Sample Inference for Targeted Learning.” https://arxiv.org/abs/1708.09502.
</div> <div id="ref-vdl2019efficient" class="csl-entry">van der Laan, Mark J, David Benkeser, and Weixin Cai. 2019. “Efficient Estimation of Pathwise Differentiable Target Parameters with the Undersmoothed Highly Adaptive Lasso.” https://arxiv.org/abs/1908.05607.
</div> <div id="ref-vdl2017uniform" class="csl-entry">van der Laan, Mark J, and Aurélien F Bibaut. 2017. “Uniform Consistency of the Highly Adaptive Lasso Estimator of Infinite-Dimensional Parameters.” https://arxiv.org/abs/1709.06256.
</div> </div>Related Skills
proje
Interactive vocabulary learning platform with smart flashcards and spaced repetition for effective language acquisition.
YC-Killer
2.7kA library of enterprise-grade AI agents designed to democratize artificial intelligence and provide free, open-source alternatives to overvalued Y Combinator startups. If you are excited about democratizing AI access & AI agents, please star ⭐️ this repository and use the link in the readme to join our open source AI research team.
best-practices-researcher
The most comprehensive Claude Code skills registry | Web Search: https://skills-registry-web.vercel.app
groundhog
401Groundhog's primary purpose is to teach people how Cursor and all these other coding agents work under the hood. If you understand how these coding assistants work from first principles, then you can drive these tools harder (or perhaps make your own!).
