METACRAN search results

Random Forest Two-Sample Tests

An implementation of Random Forest-based two-sample tests as introduced in Hediger & Michel & Naef (2022).

CompositionalRF — by Michail Tsagris, 6 months ago

Multivariate Random Forest with Compositional Responses

Multivariate random forests with compositional responses and Euclidean predictors is performed. The compositional data are first transformed using the additive log-ratio transformation, or the alpha-transformation of Tsagris, Preston and Wood (2011), , and then the multivariate random forest of Rahman R., Otridge J. and Pal R. (2017), , is applied.

outqrf — by Tengfei Xu, a year ago

Find the Outlier by Quantile Random Forests

Provides a method to find the outlier in custom data by quantile random forests method. Introduced by Meinshausen Nicolai (2006) < https://dl.acm.org/doi/10.5555/1248547.1248582>. It directly calls the ranger() function of the 'ranger' package to perform data fitting and prediction. We also implement the evaluation of outlier prediction results. Compared with random forest detection of outliers, this method has higher accuracy and stability on large datasets.

https://github.com/flystar233/outqrf

IPMRF — by Irene Epifanio, 5 months ago

Intervention in Prediction Measure for Random Forests

Computes intervention in prediction measure for assessing variable importance for random forests. See details at I. Epifanio (2017) .

RFCCA — by Cansu Alakus, 2 years ago

Random Forest with Canonical Correlation Analysis

Random Forest with Canonical Correlation Analysis (RFCCA) is a random forest method for estimating the canonical correlations between two sets of variables depending on the subject-related covariates. The trees are built with a splitting rule specifically designed to partition the data to maximize the canonical correlation heterogeneity between child nodes. The method is described in Alakus et al. (2021) . 'RFCCA' uses 'randomForestSRC' package (Ishwaran and Kogalur, 2020) by freezing at the version 2.9.3. The custom splitting rule feature is utilised to apply the proposed splitting rule. The 'randomForestSRC' package implements 'OpenMP' by default, contingent upon the support provided by the target architecture and operating system. In this package, 'LAPACK' and 'BLAS' libraries are used for matrix decompositions.

https://github.com/calakus/RFCCA

spatialRF — by Blas M. Benito, 2 months ago

Easy Spatial Modeling with Random Forest

Automatic generation and selection of spatial predictors for Random Forest models fitted to spatially structured data. Spatial predictors are constructed from a distance matrix among training samples using Moran's Eigenvector Maps (MEMs; Dray, Legendre, and Peres-Neto 2006 ) or the RFsp approach (Hengl et al. ). These predictors are used alongside user-supplied explanatory variables in Random Forest models. The package provides functions for model fitting, multicollinearity reduction, interaction identification, hyperparameter tuning, evaluation via spatial cross-validation, and result visualization using partial dependence and interaction plots. Model fitting relies on the 'ranger' package (Wright and Ziegler 2017 ).

https://blasbenito.github.io/spatialRF/

fru — by Miron Bartosz Kursa, 18 days ago

A Blazing Fast Implementation of Random Forest

Yet another implementation of the Random Forest method by Breiman (2001) , written in Rust and tailored towards stability, correctness, efficiency and scalability on modern multi-core machines. Handles both classification and regression, as well as provides permutation feature importance via a novel, highly optimised algorithm.

https://gitlab.com/mbq/fru

varSelRF — by Ramon Diaz-Uriarte, 24 days ago

Variable Selection using Random Forests

Variable selection from random forests using both backwards variable elimination (for the selection of small sets of non-redundant variables) and selection based on the importance spectrum (somewhat similar to scree plots; for the selection of large, potentially highly-correlated variables). Main applications in high-dimensional data (e.g., microarray data, and other genomics and proteomics applications).

https://ligarto.org/rdiaz/Software/Software.html, https://github.com/rdiaz02/varSelRF

rfVarImpOOB — by Markus Loecher, 4 years ago

Unbiased Variable Importance for Random Forests

Computes a novel variable importance for random forests: Impurity reduction importance scores for out-of-bag (OOB) data complementing the existing inbag Gini importance, see also . The Gini impurities for inbag and OOB data are combined in three different ways, after which the information gain is computed at each split. This gain is aggregated for each split variable in a tree and averaged across trees.

VSURF — by Robin Genuer, 4 months ago

Variable Selection Using Random Forests

Three steps variable selection procedure based on random forests. Initially developed to handle high dimensional data (for which number of variables largely exceeds number of observations), the package is very versatile and can treat most dimensions of data, for regression and supervised classification problems. First step is dedicated to eliminate irrelevant variables from the dataset. Second step aims to select all variables related to the response for interpretation purpose. Third step refines the selection by eliminating redundancy in the set of variables selected by the second step, for prediction purpose. Genuer, R. Poggi, J.-M. and Tuleau-Malot, C. (2015) < https://journal.r-project.org/articles/RJ-2015-018/>.

https://github.com/robingenuer/VSURF

Search results

R links

R homepage

Download R

Mailing lists

R documentation

R manuals

R FAQs

The R Journal

CRAN links

CRAN homepage

CRAN repository policy

Submit a package

METACRAN stuff

About METACRAN

At github

Report a bug