Found 58 packages in 0.01 seconds
Model Audit - Verification, Validation, and Error Analysis
Provides an easy to use unified interface for creating validation plots for any model. The 'auditor' helps to avoid repetitive work consisting of writing code needed to create residual plots. This visualizations allow to asses and compare the goodness of fit, performance, and similarity of models.
Resampling Methods for Triangular and Trapezoidal Fuzzy Numbers
The classical (i.e. Efron's, see Efron and Tibshirani (1994, ISBN:978-0412042317) "An Introduction to the Bootstrap") bootstrap is widely used for both the real (i.e. "crisp") and fuzzy data.
The main aim of the algorithms implemented in this package is to overcome a problem with repetition of a few distinct values and to create fuzzy numbers, which are "similar" (but not the same) to values from the initial sample.
To do this, different characteristics of triangular/trapezoidal numbers are kept (like the value, the ambiguity, etc., see Grzegorzewski et al.
An Information About Deputies and Votings in Polish Diet from Seventh to Eighth Term of Office
Set of functions that access information about deputies and votings in Polish diet from webpage < http://www.sejm.gov.pl>. The package was developed as a result of an internship in MI2 Group - < http://mi2.mini.pw.edu.pl>, Faculty of Mathematics and Information Science, Warsaw University of Technology.
Arena for the Exploration and Comparison of any ML Models
Generates data for challenging machine learning models in 'Arena' < https://arena.drwhy.ai> - an interactive web application. You can start the server with XAI (Explainable Artificial Intelligence) plots to be generated on-demand or precalculate and auto-upload data file beside shareable 'Arena' URL.
Explaining Correlated Features in Machine Learning Models
Tools for exploring effects of correlated features in predictive
models. The predict_triplot() function delivers instance-level explanations
that calculate the importance of the groups of explanatory variables. The
model_triplot() function delivers data-level explanations. The generic plot
function visualises in a concise way importance of hierarchical groups of
predictors. All of the the tools are model agnostic, therefore works for any
predictive machine learning models. Find more details in Biecek (2018)
Explain Interactions in 'XGBoost'
Structure mining from 'XGBoost' and 'LightGBM' models. Key functionalities of this package cover: visualisation of tree-based ensembles models, identification of interactions, measuring of variable importance, measuring of interaction importance, explanation of single prediction with break down plots (based on 'xgboostExplainer' and 'iBreakDown' packages). To download the 'LightGBM' use the following link: < https://github.com/Microsoft/LightGBM>. 'EIX' is a part of the 'DrWhy.AI' universe.
Make Static HTML Website for Predictive Models
Website generator with HTML summaries for predictive models. This package uses 'DALEX' explainers to describe global model behavior. We can see how well models behave (tabs: Model Performance, Auditor), how much each variable contributes to predictions (tabs: Variable Response) and which variables are the most important for a given model (tabs: Variable Importance). We can also compare Concept Drift for pairs of models (tabs: Drifter). Additionally, data available on the website can be easily recreated in current R session. Work on this package was financially supported by the NCN Opus grant 2017/27/B/ST6/01307 at Warsaw University of Technology, Faculty of Mathematics and Information Science.
The Merging Path Plot
The Merging Path Plot is a methodology for adaptive fusing of k-groups
with likelihood-based model selection. This package contains tools for
exploration and visualization of k-group dissimilarities.
Comparison of k-groups is one of the most important issues
in exploratory analyses and it has zillions of applications.
The traditional approach is to use pairwise post hoc tests
in order to verify which groups differ significantly. However,
this approach fails with a large number of groups in both interpretation
and visualization layer.
The Merging Path Plot solves this problem by using an easy-to-understand
description of dissimilarity among groups based on Likelihood Ratio Test (LRT) statistic (Sitko, Biecek 2017)
Cross-Entropy Clustering
Splits data into Gaussian type clusters using the Cross-Entropy
Clustering ('CEC') method. This method allows for the simultaneous use of
various types of Gaussian mixture models, for performing the reduction of
unnecessary clusters, and for discovering new clusters by splitting them.
'CEC' is based on the work of Spurek, P. and Tabor, J. (2014)
Explaining and Visualizing Random Forests in Terms of Variable Importance
A set of tools to help explain which variables are most important in a random forests. Various variable importance measures are calculated and visualized in different settings in order to get an idea on how their importance changes depending on our criteria (Hemant Ishwaran and Udaya B. Kogalur and Eiran Z. Gorodeski and Andy J. Minn and Michael S. Lauer (2010)