Examples: visualization, C++, networks, data cleaning, html widgets, ropensci.

Found 57 packages in 0.02 seconds

sejmRP — by Piotr Smuda, 8 years ago

An Information About Deputies and Votings in Polish Diet from Seventh to Eighth Term of Office

Set of functions that access information about deputies and votings in Polish diet from webpage < http://www.sejm.gov.pl>. The package was developed as a result of an internship in MI2 Group - < http://mi2.mini.pw.edu.pl>, Faculty of Mathematics and Information Science, Warsaw University of Technology.

FuzzyResampling — by Maciej Romaniuk, 9 months ago

Resampling Methods for Triangular and Trapezoidal Fuzzy Numbers

The classical (i.e. Efron's, see Efron and Tibshirani (1994, ISBN:978-0412042317) "An Introduction to the Bootstrap") bootstrap is widely used for both the real (i.e. "crisp") and fuzzy data. The main aim of the algorithms implemented in this package is to overcome a problem with repetition of a few distinct values and to create fuzzy numbers, which are "similar" (but not the same) to values from the initial sample. To do this, different characteristics of triangular/trapezoidal numbers are kept (like the value, the ambiguity, etc., see Grzegorzewski et al. , Grzegorzewski et al. (2020) , Grzegorzewski et al. (2020) , Grzegorzewski and Romaniuk (2022) , Romaniuk and Hryniewicz (2019) ). Some additional procedures related to these resampling methods are also provided, like calculation of the Bertoluzza et al.'s distance (aka the mid/spread distance, see Bertoluzza et al. (1995) "On a new class of distances between fuzzy numbers") and estimation of the p-value of the one- and two- sample bootstrapped test for the mean (see Lubiano et al. (2016, )). Additionally, there are procedures which randomly generate trapezoidal fuzzy numbers using some well-known statistical distributions.

fairmodels — by Jakub Wiśniewski, 3 years ago

Flexible Tool for Bias Detection, Visualization, and Mitigation

Measure fairness metrics in one place for many models. Check how big is model's bias towards different races, sex, nationalities etc. Use measures such as Statistical Parity, Equal odds to detect the discrimination against unprivileged groups. Visualize the bias using heatmap, radar plot, biplot, bar chart (and more!). There are various pre-processing and post-processing bias mitigation algorithms implemented. Package also supports calculating fairness metrics for regression models. Find more details in (Wiśniewski, Biecek (2021)) .

arenar — by Piotr Piątyszek, 5 years ago

Arena for the Exploration and Comparison of any ML Models

Generates data for challenging machine learning models in 'Arena' < https://arena.drwhy.ai> - an interactive web application. You can start the server with XAI (Explainable Artificial Intelligence) plots to be generated on-demand or precalculate and auto-upload data file beside shareable 'Arena' URL.

triplot — by Katarzyna Pekala, 5 years ago

Explaining Correlated Features in Machine Learning Models

Tools for exploring effects of correlated features in predictive models. The predict_triplot() function delivers instance-level explanations that calculate the importance of the groups of explanatory variables. The model_triplot() function delivers data-level explanations. The generic plot function visualises in a concise way importance of hierarchical groups of predictors. All of the the tools are model agnostic, therefore works for any predictive machine learning models. Find more details in Biecek (2018) .

EIX — by Szymon Maksymiuk, 4 years ago

Explain Interactions in 'XGBoost'

Structure mining from 'XGBoost' and 'LightGBM' models. Key functionalities of this package cover: visualisation of tree-based ensembles models, identification of interactions, measuring of variable importance, measuring of interaction importance, explanation of single prediction with break down plots (based on 'xgboostExplainer' and 'iBreakDown' packages). To download the 'LightGBM' use the following link: < https://github.com/Microsoft/LightGBM>. 'EIX' is a part of the 'DrWhy.AI' universe.

fstcore — by Mark Klik, 5 months ago

R Bindings to the 'Fstlib' Library

The 'fstlib' library provides multithreaded serialization of compressed data frames using the 'fst' format. The 'fst' format allows for random access of stored data and compression with the 'LZ4' and 'ZSTD' compressors.

modelDown — by Kamil Romaszko, 5 years ago

Make Static HTML Website for Predictive Models

Website generator with HTML summaries for predictive models. This package uses 'DALEX' explainers to describe global model behavior. We can see how well models behave (tabs: Model Performance, Auditor), how much each variable contributes to predictions (tabs: Variable Response) and which variables are the most important for a given model (tabs: Variable Importance). We can also compare Concept Drift for pairs of models (tabs: Drifter). Additionally, data available on the website can be easily recreated in current R session. Work on this package was financially supported by the NCN Opus grant 2017/27/B/ST6/01307 at Warsaw University of Technology, Faculty of Mathematics and Information Science.

factorMerger — by Tomasz Mikołajczyk, 6 years ago

The Merging Path Plot

The Merging Path Plot is a methodology for adaptive fusing of k-groups with likelihood-based model selection. This package contains tools for exploration and visualization of k-group dissimilarities. Comparison of k-groups is one of the most important issues in exploratory analyses and it has zillions of applications. The traditional approach is to use pairwise post hoc tests in order to verify which groups differ significantly. However, this approach fails with a large number of groups in both interpretation and visualization layer. The Merging Path Plot solves this problem by using an easy-to-understand description of dissimilarity among groups based on Likelihood Ratio Test (LRT) statistic (Sitko, Biecek 2017) . 'factorMerger' is a part of the 'DrWhy.AI' universe (Biecek 2018) . Work on this package was financially supported by the 'NCN Opus grant 2016/21/B/ST6/02176'.

CEC — by Simon Garnier, 9 months ago

Cross-Entropy Clustering

Splits data into Gaussian type clusters using the Cross-Entropy Clustering ('CEC') method. This method allows for the simultaneous use of various types of Gaussian mixture models, for performing the reduction of unnecessary clusters, and for discovering new clusters by splitting them. 'CEC' is based on the work of Spurek, P. and Tabor, J. (2014) .