Found 57 packages in 0.02 seconds
An Information About Deputies and Votings in Polish Diet from Seventh to Eighth Term of Office
Set of functions that access information about deputies and votings in Polish diet from webpage < http://www.sejm.gov.pl>. The package was developed as a result of an internship in MI2 Group - < http://mi2.mini.pw.edu.pl>, Faculty of Mathematics and Information Science, Warsaw University of Technology.
Resampling Methods for Triangular and Trapezoidal Fuzzy Numbers
The classical (i.e. Efron's, see Efron and Tibshirani (1994, ISBN:978-0412042317) "An Introduction to the Bootstrap") bootstrap is widely used for both the real (i.e. "crisp") and fuzzy data.
The main aim of the algorithms implemented in this package is to overcome a problem with repetition of a few distinct values and to create fuzzy numbers, which are "similar" (but not the same) to values from the initial sample.
To do this, different characteristics of triangular/trapezoidal numbers are kept (like the value, the ambiguity, etc., see Grzegorzewski et al.
Flexible Tool for Bias Detection, Visualization, and Mitigation
Measure fairness metrics in one place for many models. Check how big is model's bias towards different races, sex, nationalities etc. Use measures such as Statistical Parity, Equal odds to detect the discrimination against unprivileged groups. Visualize the bias using heatmap, radar plot, biplot, bar chart (and more!). There are various pre-processing and post-processing bias mitigation algorithms implemented. Package also supports calculating fairness metrics for regression models. Find more details in (Wiśniewski, Biecek (2021))
Arena for the Exploration and Comparison of any ML Models
Generates data for challenging machine learning models in 'Arena' < https://arena.drwhy.ai> - an interactive web application. You can start the server with XAI (Explainable Artificial Intelligence) plots to be generated on-demand or precalculate and auto-upload data file beside shareable 'Arena' URL.
Explaining Correlated Features in Machine Learning Models
Tools for exploring effects of correlated features in predictive
models. The predict_triplot() function delivers instance-level explanations
that calculate the importance of the groups of explanatory variables. The
model_triplot() function delivers data-level explanations. The generic plot
function visualises in a concise way importance of hierarchical groups of
predictors. All of the the tools are model agnostic, therefore works for any
predictive machine learning models. Find more details in Biecek (2018)
Explain Interactions in 'XGBoost'
Structure mining from 'XGBoost' and 'LightGBM' models. Key functionalities of this package cover: visualisation of tree-based ensembles models, identification of interactions, measuring of variable importance, measuring of interaction importance, explanation of single prediction with break down plots (based on 'xgboostExplainer' and 'iBreakDown' packages). To download the 'LightGBM' use the following link: < https://github.com/Microsoft/LightGBM>. 'EIX' is a part of the 'DrWhy.AI' universe.
R Bindings to the 'Fstlib' Library
The 'fstlib' library provides multithreaded serialization of compressed data frames using the 'fst' format. The 'fst' format allows for random access of stored data and compression with the 'LZ4' and 'ZSTD' compressors.
Make Static HTML Website for Predictive Models
Website generator with HTML summaries for predictive models. This package uses 'DALEX' explainers to describe global model behavior. We can see how well models behave (tabs: Model Performance, Auditor), how much each variable contributes to predictions (tabs: Variable Response) and which variables are the most important for a given model (tabs: Variable Importance). We can also compare Concept Drift for pairs of models (tabs: Drifter). Additionally, data available on the website can be easily recreated in current R session. Work on this package was financially supported by the NCN Opus grant 2017/27/B/ST6/01307 at Warsaw University of Technology, Faculty of Mathematics and Information Science.
The Merging Path Plot
The Merging Path Plot is a methodology for adaptive fusing of k-groups
with likelihood-based model selection. This package contains tools for
exploration and visualization of k-group dissimilarities.
Comparison of k-groups is one of the most important issues
in exploratory analyses and it has zillions of applications.
The traditional approach is to use pairwise post hoc tests
in order to verify which groups differ significantly. However,
this approach fails with a large number of groups in both interpretation
and visualization layer.
The Merging Path Plot solves this problem by using an easy-to-understand
description of dissimilarity among groups based on Likelihood Ratio Test (LRT) statistic (Sitko, Biecek 2017)
Cross-Entropy Clustering
Splits data into Gaussian type clusters using the Cross-Entropy
Clustering ('CEC') method. This method allows for the simultaneous use of
various types of Gaussian mixture models, for performing the reduction of
unnecessary clusters, and for discovering new clusters by splitting them.
'CEC' is based on the work of Spurek, P. and Tabor, J. (2014)