Examples: visualization, C++, networks, data cleaning, html widgets, ropensci.

Found 139 packages in 0.01 seconds

MKpower — by Matthias Kohl, a year ago

Power Analysis and Sample Size Calculation

Power analysis and sample size calculation for Welch and Hsu (Hedderich and Sachs (2018), ISBN:978-3-662-56657-2) t-tests including Monte-Carlo simulations of empirical power and type-I-error. Power and sample size calculation for Wilcoxon rank sum and signed rank tests via Monte-Carlo simulations. Power and sample size required for the evaluation of a diagnostic test(-system) (Flahault et al. (2005), ; Dobbin and Simon (2007), ) as well as for a single proportion (Fleiss et al. (2003), ISBN:978-0-471-52629-2; Piegorsch (2004), ; Thulin (2014), ), comparing two negative binomial rates (Zhu and Lakkis (2014), ), ANCOVA (Shieh (2020), ), reference ranges (Jennen-Steinmetz and Wellek (2005), ), multiple primary endpoints (Sozu et al. (2015), ISBN:978-3-319-22005-5), and AUC (Hanley and McNeil (1982), ).

MKomics — by Matthias Kohl, 4 years ago

Omics Data Analysis

Similarity plots based on correlation and median absolute deviation (MAD); adjusting colors for heatmaps; aggregate technical replicates; calculate pairwise fold-changes and log fold-changes; compute one- and two-way ANOVA; simplified interface to package 'limma' (Ritchie et al. (2015), ) for moderated t-test and one-way ANOVA; Hamming and Levenshtein (edit) distance of strings as well as optimal alignment scores for global (Needleman-Wunsch) and local (Smith-Waterman) alignments with constant gap penalties (Merkl and Waack (2009), ISBN:978-3-527-32594-8).

MKclass — by Matthias Kohl, 2 years ago

Statistical Classification

Performance measures and scores for statistical classification such as accuracy, sensitivity, specificity, recall, similarity coefficients, AUC, GINI index, Brier score and many more. Calculation of optimal cut-offs and decision stumps (Iba and Langley (1991), ) for all implemented performance measures. Hosmer-Lemeshow goodness of fit tests (Lemeshow and Hosmer (1982), ; Hosmer et al (1997), ). Statistical and epidemiological risk measures such as relative risk, odds ratio, number needed to treat (Porta (2014), ).

survAUC — by Frederic Bertrand, a year ago

Estimators of Prediction Accuracy for Time-to-Event Data

Provides a variety of functions to estimate time-dependent true/false positive rates and AUC curves from a set of censored survival data.

sdcLog — by Matthias Gomolka, 4 months ago

Tools for Statistical Disclosure Control in Research Data Centers

Tools for researchers to explicitly show that their results comply to rules for statistical disclosure control imposed by research data centers. These tools help in checking descriptive statistics and models and in calculating extreme values that are not individual data. Also included is a simple function to create log files. The methods used here are described in the "Guidelines for the checking of output based on microdata research" by Bond, Brandt, and de Wolf (2015) < https://cros.ec.europa.eu/system/files/2024-02/Output-checking-guidelines.pdf>.

permGS — by Matthias Brueckner, 8 years ago

Permutational Group Sequential Test for Time-to-Event Data

Permutational group-sequential tests for time-to-event data based on the log-rank test statistic. Supports exact permutation test when the censoring distributions are equal in the treatment and the control group and approximate imputation-permutation methods when the censoring distributions are different.

rticles — by Christophe Dervieux, a year ago

Article Formats for R Markdown

A suite of custom R Markdown formats and templates for authoring journal articles and conference submissions.

ggdiceplot — by Matthias Flotho, 10 days ago

Dice Plot Visualization for 'ggplot2'

Provides 'ggplot2' extensions for creating dice-based visualizations where each dot position represents a specific categorical variable. The package includes geom_dice() for displaying presence/absence of categorical variables using traditional dice patterns. Each dice position (1-6) represents a different category, with dots shown only when that category is present. This allows intuitive visualization of up to 6 categorical variables simultaneously.

diceplot — by Matthias Flotho, 13 days ago

High Dimensional Categorical Data Visualization

Easy visualization for datasets with more than two categorical variables and additional continuous variables. 'diceplot' is particularly useful for exploring complex categorical data in the context of pathway analysis across multiple conditions. For a detailed documentation please visit < https://dice-and-domino-plot.readthedocs.io/en/latest/>.

timeseriesdb — by Matthias Bannert, 3 years ago

A Time Series Database for Official Statistics with R and PostgreSQL

Archive and manage times series data from official statistics. The 'timeseriesdb' package was designed to manage a large catalog of time series from official statistics which are typically published on a monthly, quarterly or yearly basis. Thus timeseriesdb is optimized to handle updates caused by data revision as well as elaborate, multi-lingual meta information.