Found 65 packages in 0.15 seconds
Importing and Analysing 'SNP' and 'Silicodart' Data Generated by Genome-Wide Restriction Fragment Analysis
Functions are provided that facilitate the import and analysis of 'SNP' (single nucleotide polymorphism) and 'silicodart' (presence/absence) data. The main focus is on data generated by 'DarT' (Diversity Arrays Technology), however, data from other sequencing platforms can be used once 'SNP' or related fragment presence/absence data from any source is imported. Genetic datasets are stored in a derived 'genlight' format (package 'adegenet'), that allows for a very compact storage of data and metadata. Functions are available for importing and exporting of 'SNP' and 'silicodart' data, for reporting on and filtering on various criteria (e.g. 'CallRate', heterozygosity, reproducibility, maximum allele frequency). Additional functions are available for visualization (e.g. Principle Coordinate Analysis) and creating a spatial representation using maps. 'dartR' supports also the analysis of 3rd party software package such as 'newhybrid', 'structure', 'NeEstimator' and 'blast'. Since version 2.0.3 we also implemented simulation functions, that allow to forward simulate 'SNP' dynamics under different population and evolutionary dynamics. Comprehensive tutorials and support can be found at our 'github' repository: github.com/green-striped-gecko/dartR/. If you want to cite 'dartR', you find the information by typing citation('dartR') in the console.
Datasets for 'spatstat' Family
Contains all the datasets for the 'spatstat' family of packages.
Efficient Implementation of K-Means++ Algorithm
Efficient implementation of K-Means++ algorithm. For more
information see (1) "kmeans++ the advantages of the k-means++
algorithm" by David Arthur and Sergei Vassilvitskii (2007),
Proceedings of the eighteenth annual ACM-SIAM symposium on Discrete
algorithms, Society for Industrial and Applied Mathematics,
Philadelphia, PA, USA, pp. 1027-1035, and (2) "The Effectiveness of
Lloyd-Type Methods for the k-Means Problem" by Rafail Ostrovsky, Yuval
Rabani, Leonard J. Schulman and Chaitanya Swamy
'a la Carte' on Text (ConText) Embedding Regression
A fast, flexible and transparent framework to estimate context-specific word and short document embeddings using the 'a la carte'
embeddings approach developed by Khodak et al. (2018)
Geolocalização De Endereços Brasileiros (Geocoding Brazilian Addresses)
Método simples e eficiente de geolocalizar dados no Brasil. O pacote é baseado em conjuntos de dados espaciais abertos de endereços brasileiros, utilizando como fonte principal o Cadastro Nacional de Endereços para Fins Estatísticos (CNEFE). O CNEFE é publicado pelo Instituto Brasileiro de Geografia e Estatística (IBGE), órgão oficial de estatísticas e geografia do Brasil. (A simple and efficient method for geolocating data in Brazil. The package is based on open spatial datasets of Brazilian addresses, primarily using the Cadastro Nacional de Endereços para Fins Estatísticos (CNEFE), published by the Instituto Brasileiro de Geografia e Estatística (IBGE), Brazil's official statistics and geography agency.)
Discounting and Compounding Calculations for Various Scenarios
Functions for compounding and discounting calculations included here serve as a complete reference for various scenarios of time value of money. Raymond M. Brooks (“Financial Management,” 2018, ISBN: 9780134730417). Sheridan Titman, Arthur J. Keown, John D. Martin (“Financial Management: Principles and Applications,” 2017, ISBN: 9780134417219). Jonathan Berk, Peter DeMarzo, David Stangeland, Andras Marosi (“Fundamentals of Corporate Finance,” 2019, ISBN: 9780134735313). S. A. Hummelbrunner, Kelly Halliday, Ali R. Hassanlou (“Contemporary Business Mathematics with Canadian Applications,” 2020, ISBN: 9780135285015).
Download and Processing of Automatic Weather Stations (AWS) Data of INMET-Brazil
A collection of functions for downloading and processing automatic weather station (AWS) data from INMET (Brazil’s National Institute of Meteorology), designed to support the estimation of reference evapotranspiration (ETo). The package facilitates streamlined access to meteorological data and aims to simplify analyses in agricultural and environmental contexts.
Simplify Access to Data from the Amazon Region
Functions to download and treat data regarding the Brazilian Amazon region from a variety of official sources.
Segment Images in Text Lines and Words
Find text lines in scanned images and segment the lines into words.
Includes implementations of the paper 'Novel A* Path Planning Algorithm for Line Segmentation of Handwritten Documents' by Surinta O. et al (2014)
Building a Concordance of Terms in a Series of Texts
Compute the frequency distribution of a search term in a series of texts. For example, Arthur Conan Doyle wrote a total of 60 Sherlock Holmes stories, comprised of 54 short stories and 4 longer novels. I wanted to test my own subjective impression that, in many of the stories, Sherlock Holmes' popularity was used as bait to induce the reader to read a story that is essentially not primarily a Sherlock Holmes story. I used the term "Holmes" as a search pattern, since Watson would frequently address him by name, or use his name to describe something that he was doing. My hypothesis is that the frequency distribution of the search pattern "Holmes" is a good proxy for the degree to which a story is or is not truly a Sherlock Holmes story. The results are presented in a manuscript that is available as a vignette and online at < https://barryzee.github.io/Concordance/index.html>.