Examples: visualization, C++, networks, data cleaning, html widgets, ropensci.

Found 144 packages in 0.01 seconds

tip — by Charles W. Harrison, 3 years ago

Bayesian Clustering Using the Table Invitation Prior (TIP)

Cluster data without specifying the number of clusters using the Table Invitation Prior (TIP) introduced in the paper "Clustering Gene Expression Using the Table Invitation Prior" by Charles W. Harrison, Qing He, and Hsin-Hsiung Huang (2022) . TIP is a Bayesian prior that uses pairwise distance and similarity information to cluster vectors, matrices, or tensors.

CUFF — by Charles-Édouard Giguère, 3 years ago

Charles's Utility Function using Formula

Utility functions that provides wrapper to descriptive base functions like cor, mean and table. It makes use of the formula interface to pass variables to functions. It also provides operators to concatenate (%+%), to repeat (%n%) and manage character vectors for nice display.

robustDA — by Charles Bouveyron, 5 years ago

Robust Mixture Discriminant Analysis

Robust mixture discriminant analysis (RMDA), proposed in Bouveyron & Girard, 2009 , allows to build a robust supervised classifier from learning data with label noise. The idea of the proposed method is to confront an unsupervised modeling of the data with the supervised information carried by the labels of the learning data in order to detect inconsistencies. The method is able afterward to build a robust classifier taking into account the detected inconsistencies into the labels.

DRaWR — by Charles Blatti, 4 years ago

Discriminative Random Walk with Restart

We present DRaWR, a network-based method for ranking genes or properties related to a given gene set. Such related genes or properties are identified from among the nodes of a large, heterogeneous network of biological information. Our method involves a random walk with restarts, performed on an initial network with multiple node and edge types, preserving more of the original, specific property information than current methods that operate on homogeneous networks. In this first stage of our algorithm, we find the properties that are the most relevant to the given gene set and extract a subnetwork of the original network, comprising only the relevant properties. We then rerank genes by their similarity to the given gene set, based on a second random walk with restarts, performed on the above subnetwork.

FisherEM — by Charles Bouveyron, 5 years ago

The FisherEM Algorithm to Simultaneously Cluster and Visualize High-Dimensional Data

The FisherEM algorithm, proposed by Bouveyron & Brunet (2012) , is an efficient method for the clustering of high-dimensional data. FisherEM models and clusters the data in a discriminative and low-dimensional latent subspace. It also provides a low-dimensional representation of the clustered data. A sparse version of Fisher-EM algorithm is also provided.

funLBM — by Charles Bouveyron, 4 years ago

Model-Based Co-Clustering of Functional Data

The funLBM algorithm allows to simultaneously cluster the rows and the columns of a data matrix where each entry of the matrix is a function or a time series.

PopPsiSeqR — by Charles Soeder, 3 months ago

Process and Visualize Evolve & Resequence Experiments

Handle data from evolve and resequence experiments. Measured allele frequencies (e.g., from variants called from high-throughput sequencing data) are compared using an update of the PsiSeq algorithm (Earley, Eric and Corbin Jones (2011) ). Functions for saving and loading important files are also included, as well as functions for basic data visualization.

Linkage — by Charles Bouveyron, 4 years ago

Clustering Communication Networks Using the Stochastic Topic Block Model Through Linkage.fr

It allows to cluster communication networks using the Stochastic Topic Block Model by posting jobs through the API of the linkage.fr server, which implements the clustering method. The package also allows to visualize the clustering results returned by the server.

HDclassif — by Laurent Berge, 8 months ago

High Dimensional Supervised Classification and Clustering

Discriminant analysis and data clustering methods for high dimensional data, based on the assumption that high-dimensional data live in different subspaces with low dimensionality proposing a new parametrization of the Gaussian mixture model which combines the ideas of dimension reduction and constraints on the model.

trade — by Charles Taragin, 2 months ago

Tools for Trade Practitioners

A collection of tools for trade practitioners, including the ability to calibrate different consumer demand systems and simulate the effects of tariffs and quotas under different competitive regimes. These tools are derived from Anderson et al. (2001) and Froeb et al. (2003) .