Found 208 packages in 0.01 seconds
Normalization of 'JSON' Strings
Provides a function allowing to normalize a 'JSON' string, for example by adding double quotes around the keys when they are missing. Also provides 'RStudio' addins for the same purpose.
Fuzzy String Matching
Fuzzy string matching implementation of the 'fuzzywuzzy' < https://github.com/seatgeek/fuzzywuzzy> 'python' package. It uses the Levenshtein Distance < https://en.wikipedia.org/wiki/Levenshtein_distance> to calculate the differences between sequences.
Handle Strings as Vectors of Characters
Creates a new chars class which looks like a string but is actually a vector of individual characters, making 'strings' iterable. This class enables vector operations on 'strings' such as reverse, sort, head, and set operations.
'Python' Style String Functions
A comprehensive set of string manipulation functions based on those found in 'Python' without relying on 'reticulate'. It provides functions that intend to (1) make it easier for users familiar with 'Python' to work with strings, (2) reduce the complexity often associated with string operations, (3) and enable users to write more readable and maintainable code that manipulates strings.
Algorithmic Complexity for Short Strings
Main functionality is to provide the algorithmic complexity for short strings, an approximation of the Kolmogorov Complexity of a short string using the coding theorem method (see ?acss). The database containing the complexity is provided in the data only package acss.data, this package provides functions accessing the data such as prob_random returning the posterior probability that a given string was produced by a random process. In addition, two traditional (but problematic) measures of complexity are also provided: entropy and change complexity.
Adaptive Builder for Formatted Strings
Provides a set of functions to facilitate building formatted strings under various replacement rules: C-style formatting, variable-based formatting, and number-based formatting. C-style formatting is basically identical to built-in function 'sprintf'. Variable-based formatting allows users to put variable names in a formatted string which will be replaced by variable values. Number-based formatting allows users to use index numbers to represent the corresponding argument value to appear in the string.
Cluster Strings by Edit-Distance
Returns an edit-distance based clusterization of an input vector of strings. Each cluster will contain a set of strings w/ small mutual edit-distance (e.g., Levenshtein, optimum-sequence-alignment, Damerau-Levenshtein), as computed by stringdist::stringdist(). The set of all mutual edit-distances is then used by graph algorithms (from package 'igraph') to single out subsets of high connectivity.
C++ Tools for Cleaning Strings
Provides function declarations and inline function definitions that facilitate cleaning strings in C++ code before passing them to R.
Assertions to Check Properties of Strings
A set of predicates and assertions for checking the properties of strings. This is mainly for use by other package developers who want to include run-time testing features in their own packages. End-users will usually want to use assertive directly.
Parse 'User-Agent' Strings
Despite there being a section in RFC 7231 < https://tools.ietf.org/html/rfc7231#section-5.5.3> defining a suggested structure for 'User-Agent' headers this data is notoriously difficult to parse consistently. Tools are provided that will take in user agent strings and return structured R objects. This is a 'V8'-backed package based on the 'ua-parser' project < https://github.com/ua-parser>.