Examples: visualization, C++, networks, data cleaning, html widgets, ropensci.

Found 59 packages in 0.01 seconds

sejmRP — by Piotr Smuda, 9 years ago

An Information About Deputies and Votings in Polish Diet from Seventh to Eighth Term of Office

Set of functions that access information about deputies and votings in Polish diet from webpage < http://www.sejm.gov.pl>. The package was developed as a result of an internship in MI2 Group - < http://mi2.mini.pw.edu.pl>, Faculty of Mathematics and Information Science, Warsaw University of Technology.

FuzzyResampling — by Maciej Romaniuk, 2 years ago

Resampling Methods for Triangular and Trapezoidal Fuzzy Numbers

The classical (i.e. Efron's, see Efron and Tibshirani (1994, ISBN:978-0412042317) "An Introduction to the Bootstrap") bootstrap is widely used for both the real (i.e. "crisp") and fuzzy data. The main aim of the algorithms implemented in this package is to overcome a problem with repetition of a few distinct values and to create fuzzy numbers, which are "similar" (but not the same) to values from the initial sample. To do this, different characteristics of triangular/trapezoidal numbers are kept (like the value, the ambiguity, etc., see Grzegorzewski et al. , Grzegorzewski et al. (2020) , Grzegorzewski et al. (2020) , Grzegorzewski and Romaniuk (2022) , Romaniuk and Hryniewicz (2019) ). Some additional procedures related to these resampling methods are also provided, like calculation of the Bertoluzza et al.'s distance (aka the mid/spread distance, see Bertoluzza et al. (1995) "On a new class of distances between fuzzy numbers") and estimation of the p-value of the one- and two- sample bootstrapped test for the mean (see Lubiano et al. (2016, )). Additionally, there are procedures which randomly generate trapezoidal fuzzy numbers using some well-known statistical distributions.

arenar — by Piotr Piątyszek, 6 years ago

Arena for the Exploration and Comparison of any ML Models

Generates data for challenging machine learning models in 'Arena' < https://arena.drwhy.ai> - an interactive web application. You can start the server with XAI (Explainable Artificial Intelligence) plots to be generated on-demand or precalculate and auto-upload data file beside shareable 'Arena' URL.

triplot — by Katarzyna Pekala, 6 years ago

Explaining Correlated Features in Machine Learning Models

Tools for exploring effects of correlated features in predictive models. The predict_triplot() function delivers instance-level explanations that calculate the importance of the groups of explanatory variables. The model_triplot() function delivers data-level explanations. The generic plot function visualises in a concise way importance of hierarchical groups of predictors. All of the the tools are model agnostic, therefore works for any predictive machine learning models. Find more details in Biecek (2018) .

EIX — by Szymon Maksymiuk, 4 months ago

Explain Interactions in 'XGBoost'

Structure mining from 'XGBoost' and 'LightGBM' models. Key functionalities of this package cover: visualisation of tree-based ensembles models, identification of interactions, measuring of variable importance, measuring of interaction importance, explanation of single prediction with break down plots (based on 'xgboostExplainer' and 'iBreakDown' packages). To download the 'LightGBM' use the following link: < https://github.com/Microsoft/LightGBM>. 'EIX' is a part of the 'DrWhy.AI' universe.

factorMerger — by Tomasz Mikołajczyk, 7 years ago

The Merging Path Plot

The Merging Path Plot is a methodology for adaptive fusing of k-groups with likelihood-based model selection. This package contains tools for exploration and visualization of k-group dissimilarities. Comparison of k-groups is one of the most important issues in exploratory analyses and it has zillions of applications. The traditional approach is to use pairwise post hoc tests in order to verify which groups differ significantly. However, this approach fails with a large number of groups in both interpretation and visualization layer. The Merging Path Plot solves this problem by using an easy-to-understand description of dissimilarity among groups based on Likelihood Ratio Test (LRT) statistic (Sitko, Biecek 2017) . 'factorMerger' is a part of the 'DrWhy.AI' universe (Biecek 2018) . Work on this package was financially supported by the 'NCN Opus grant 2016/21/B/ST6/02176'.

modelDown — by Kamil Romaszko, 6 years ago

Make Static HTML Website for Predictive Models

Website generator with HTML summaries for predictive models. This package uses 'DALEX' explainers to describe global model behavior. We can see how well models behave (tabs: Model Performance, Auditor), how much each variable contributes to predictions (tabs: Variable Response) and which variables are the most important for a given model (tabs: Variable Importance). We can also compare Concept Drift for pairs of models (tabs: Drifter). Additionally, data available on the website can be easily recreated in current R session. Work on this package was financially supported by the NCN Opus grant 2017/27/B/ST6/01307 at Warsaw University of Technology, Faculty of Mathematics and Information Science.

CEC — by Simon Garnier, 3 months ago

Cross-Entropy Clustering

Splits data into Gaussian type clusters using the Cross-Entropy Clustering ('CEC') method. This method allows for the simultaneous use of various types of Gaussian mixture models, for performing the reduction of unnecessary clusters, and for discovering new clusters by splitting them. 'CEC' is based on the work of Spurek, P. and Tabor, J. (2014) .

randomForestExplainer — by Yue Jiang, 6 years ago

Explaining and Visualizing Random Forests in Terms of Variable Importance

A set of tools to help explain which variables are most important in a random forests. Various variable importance measures are calculated and visualized in different settings in order to get an idea on how their importance changes depending on our criteria (Hemant Ishwaran and Udaya B. Kogalur and Eiran Z. Gorodeski and Andy J. Minn and Michael S. Lauer (2010) , Leo Breiman (2001) ).

hstats — by Michael Mayer, 7 months ago

Interaction Statistics

Fast, model-agnostic implementation of different H-statistics introduced by Jerome H. Friedman and Bogdan E. Popescu (2008) . These statistics quantify interaction strength per feature, feature pair, and feature triple. The package supports multi-output predictions and can account for case weights. In addition, several variants of the original statistics are provided. The shape of the interactions can be explored through partial dependence plots or individual conditional expectation plots. 'DALEX' explainers, meta learners ('mlr3', 'tidymodels', 'caret') and most other models work out-of-the-box.