Found 448 packages in 0.02 seconds
Split-Apply-Combine with Dynamic Groups
Estimate group aggregates, where one can set user-defined conditions
that each group of records must satisfy to be suitable for aggregation. If
a group of records is not suitable, it is expanded using a collapsing scheme
defined by the user. A paper on this package was published in the Journal
of Statistical Software
Fast, Robust, and High-Quality Synthetic Data Generation with a Tuneable Privacy-Utility Trade-Off
Synthesize numeric, categorical, mixed and time series data. Data circumstances including mixed (or zero-inflated) distributions and missing data patterns are reproduced in the synthetic data. A single parameter allows balancing between high-quality synthetic data that represents correlations of the original data and lower quality but more privacy safe synthetic data without correlations. Tuning can be done per variable or for the whole dataset.
Distances on Directed Graphs
Distances on dual-weighted directed graphs using
priority-queue shortest paths (Padgham (2019)
Fit, Simulate and Diagnose Exponential-Family Models for Networks
An integrated set of tools to analyze and simulate networks based on exponential-family random graph models (ERGMs). 'ergm' is a part of the Statnet suite of packages for network analysis. See Hunter, Handcock, Butts, Goodreau, and Morris (2008)
Managing Larger Data on a GitHub Repository
Because larger (> 50 MB) data files cannot easily be committed to git, a different approach is required to manage data associated with an analysis in a GitHub repository. This package provides a simple work-around by allowing larger (up to 2 GB) data files to piggyback on a repository as assets attached to individual GitHub releases. These files are not handled by git in any way, but instead are uploaded, downloaded, or edited directly by calls through the GitHub API. These data files can be versioned manually by creating different releases. This approach works equally well with public or private repositories. Data can be uploaded and downloaded programmatically from scripts. No authentication is required to download data from public repositories.
Geological Time Scale Plotting
Functionality for adding the geological timescale to bivariate plots.
Dynamic Function-Oriented 'Make'-Like Declarative Pipelines
Pipeline tools coordinate the pieces of computationally
demanding analysis projects.
The 'targets' package is a 'Make'-like pipeline tool for statistics and
data science in R. The package skips costly runtime for tasks
that are already up to date,
orchestrates the necessary computation with implicit parallel computing,
and abstracts files as R objects. If all the current output matches
the current upstream code and data, then the whole pipeline is up
to date, and the results are more trustworthy than otherwise.
The methodology in this package
borrows from GNU 'Make' (2015, ISBN:978-9881443519)
and 'drake' (2018,
Core Methods and Classes Used by 'aroma.*' Packages Part of the Aroma Framework
Core methods and classes used by higher-level 'aroma.*' packages part of the Aroma Project, e.g. 'aroma.affymetrix' and 'aroma.cn'.
Conditional Visualization for Statistical Models
Exploring fitted models by interactively taking 2-D and 3-D sections in data space.
R Interface with Google Compute Engine
Interact with the 'Google Compute Engine' API in R. Lets you create, start and stop instances in the 'Google Cloud'. Support for preconfigured instances, with templates for common R needs.