Prediction: Difference between revisions
Jump to navigation
Jump to search
No edit summary |
|||
Line 1: | Line 1: | ||
= Feature selection = | |||
* [http://r-statistics.co/Variable-Selection-and-Importance-With-R.html Feature Selection Approaches] from http://r-statistics.co | |||
== Boruta == | |||
* [https://towardsdatascience.com/boruta-explained-the-way-i-wish-someone-explained-it-to-me-4489d70e154a Boruta Explained Exactly How You Wished Someone Explained to You] | |||
* [https://www.r-bloggers.com/2021/05/random-forest-feature-selection/ Random Forest Feature Selection] | |||
= Random forest = | = Random forest = | ||
* https://en.wikipedia.org/wiki/Random_forest | * https://en.wikipedia.org/wiki/Random_forest | ||
Line 14: | Line 21: | ||
** [https://avikarn.com/2020-06-26-MachineLearning_rf_glmnet/ Utilizing Machine Learning algorithms (GLMnet and Random Forest models) for Genomic Prediction of a Quantitative trait] | ** [https://avikarn.com/2020-06-26-MachineLearning_rf_glmnet/ Utilizing Machine Learning algorithms (GLMnet and Random Forest models) for Genomic Prediction of a Quantitative trait] | ||
* [https://link.springer.com/article/10.1186/1471-2105-7-3 Gene selection and classification of microarray data using random forest] 2006, and the R package [https://cran.r-project.org/web/packages/varSelRF/ varSelRF] | * [https://link.springer.com/article/10.1186/1471-2105-7-3 Gene selection and classification of microarray data using random forest] 2006, and the R package [https://cran.r-project.org/web/packages/varSelRF/ varSelRF] | ||
* [https://www.r-bloggers.com/2021/07/feature-importance-in-random-forest/ Feature Importance in Random Forest] | |||
= Gradient boost = | = Gradient boost = |
Revision as of 10:16, 10 October 2022
Feature selection
Boruta
Random forest
- https://en.wikipedia.org/wiki/Random_forest
- randomForest package
- Error: protect(): protection stack overflow. The trick works on my data with 26 obs and 28110 variables.
- A complete guide to Random Forest in R
- 8.5 Permutation Feature Importance from Interpretable Machine Learning by Christoph Molnar
- 15 Variable Importance from caret
- How to find the most important variables in R
- ?importance
- Random Forest Variable Importance and a simple example with 2 correlated predictors.
- If you just print the importance object from the model they are the raw importance values. However, when you use the importance function, the default for the scale argument is TRUE which returns the importance values divided by the standard error.
- Measures of variable importance in random forests
- Difference between varImp (caret) and importance (randomForest) for Random Forest
- Utilizing Machine Learning algorithms (GLMnet and Random Forest models) for Genomic Prediction of a Quantitative trait
- Gene selection and classification of microarray data using random forest 2006, and the R package varSelRF
- Feature Importance in Random Forest