Data exploration and modelling is a process in which a lot of data artifacts are produced. Artifacts like: subsets, data aggregates, plots, statistical models, different versions of data sets and different versions of results. The more projects we work with the more artifacts are produced and the harder it is to manage these artifacts. Archivist helps to store and manage artifacts created in R. Archivist allows you to store selected artifacts as a binary files together with their metadata and relations. Archivist allows to share artifacts with others, either through shared folder or github. Archivist allows to look for already created artifacts by using it's class, name, date of the creation or other properties. Makes it easy to restore such artifacts. Archivist allows to check if new artifact is the exact copy that was produced some time ago. That might be useful either for testing or caching.
asearchexamples due to new version of ggplot2 - 2.2.0 [#296]
asaveexamples due to new version of ggplot2 - 2.2.0 [#300]
loadFromLocalRepo()are now handling URL addresses as well. This may be useful to access artifacts generated by the shiny app.
%a%archives proper names of first object so does
ahistoryprints proper name of archived artifact instead of
latexformat as it has new
atrace()function is added. It call
trace()function to store a selected object in the repository after each call to specified FUN (for example 'lm').
restoreLibs()can now restore libraries in custom directory. [#251
maxTagsparameter so that gallery's summaries in the
README.mdfiles now has limited chunk's length. [#249]
restoreLibs()function is added. It recovers previous versions of R packages. Needed due to rapid changes in structure of
ggplot2objects. Now one can restore version of the
ggplot2package consistent with archived object.
RemoteRepoCheckis used to verify if parameters for remote repo are correct.
asessionreturns session info for given artifact (similar to aread).
aformatreturns vector of formats in which the artifact is saved (similar to aread).
saveToRepoby default saves session info.
repoDirGithas changed name to
subdirand the default value is now '/'.
alinkis now working with github and bitbucket repositories.
asearchreturns named list of artifacts. MD5hashes are used as names.
silent=TRUEby default in
saveToRepo. Less warnings.
saveToRepohas now two copies, consistent with other names
saveToLocalRepoan short one
pullGitHubRepohave been moved to separate
archivist.githubpackage to maintain Local/Remote consistency. [#198].
deleteRepowas deprecated. Use
createEmptyRepowere deprecated. Use
rmFromRepowas deprecated. Use
multiSearchInLocalRepoand it's remote version were deprecated. Now multiple patterns are available in
alinkfunction: Returns a Link To Download an Artifact Stored on GitHub Repository. Ideal combination with
pushRepofunction which add files, commits them and pushes from Local
Repositoryto synchronized GitHub one. [#146].
git pull) changes from remote GitHub
Repositoryto the correspoding Local one. [#146].
createGithubMDGallerythat give the markdown summary for each artifact in the repository. Ideal for README.md file. Example [#144]
asearchfunction enables a user to read artifacts from default GitHub repository. In the previous version it was possible only in default local repository.
apotions('repo/repoDir', NULL, unset = TRUE)[#176].
asearchcompletely new example section divided into 3 subsections: default local repository, default GitHub resository and Github repository.
htestobject's data is now saved to repository as a list.
devtoolss::session_info()with an artifact during the execution of
format:is now added to every artifact/miniature. Artifacts can be saved in different (and more than one) formats (rda/json/csv) what makes them easier to access from other languages.
New and renamed parameters:
createEmptyGithubRepowere changed into
createEmptyGithubReponow can use
repoDirto specify in which directory the synchronized Local Repository should be created [#142].
archiveno longer cats hook to the artifact during the execution. Hook cat can be set with new
alinkparameter that uses
alink()function, where parameters can be passed with
deleteRepohas now new
unsetparameter that allows to unset global
aoptions('repoDir', NULL, unset = TRUE)when deleted
repoDirwas a globally specified Repository [#157].
repoDirto maintain consistency within package documentation and name convention.
cloneGithubReponow reacts on new
defaultparameter which sets newly created/cloned repositories (GitHub and synchronized with it Local one) as default [#171 , #142].
ahistory()to maintain consistency with
alink. Now the
createEmptyGithubRepofunction. We also added
createEmptyLocalRepoto maintain consistency with other sister functions.
createEmptyRepois now a wrapper around
createEmptyGithubRepofunctions. 2. One can now clone GitHub-archivist repo with new
archivefunction. Example: https://github.com/MarcinKosinski/archive-test4/commits/master
archivist-github-integration``` (or shorter?agithub`).
splitTagsGithubenabling to split
tagcolumn in database into two separate columns:
checkDirectoryfunction is now immune to directories that don't exist. This made
showLocalRepofunction working properly when passed an argument to the directory that do not exist. 2. Changed
dbDisconnect( conn )call to the
on.exit(dbDisconnect( conn ))in
executeSingleQueryfunction to prevent a situation in which during an error inside a function (which might be produced), the connection stays open, when it shouldn
operator does react ondefault = TRUE
deleteRoot = TRUEargument of the
deleteRepofunction works properly and enables removing root directory of the Repository.
Repository). In case of wrong md5hash abbreviation a user will receive an error message.
many = TRUE. They were not removed before.
galleryfolder. They were not removed before.
Invisible(NULL)is the result of the function evaluation.
Invisible(NULL)is the result of the function evaluation
copyLocalRepois set to
copyFromGithubRepocopies only distinct records for table
backpack.dbfile, that can be seen with
show*Repoand copies all mentioned artifacts for local version.
createEmptyRepofunction gives a user-friendly error.
zipGithubRepounzipped file has the same name as zip file. Earlier it had a name of the temporary file that was difficult to notice.
setGithubRepoit is now possible to use repoDirGit parameter. Before there was wrong
paste0()was replaced by
file.path()in appropriate places of function's bodies in the following R scripts:
checkDirectory's function body were removed due to changes in point 11.
checkDirectory2was completely removed as it is unnecessary now.
test_base_functionalities.Rdue to changes in point 11 and 12.
repowill work properly with
summaryGithubRepowhen set. It might have not been noticed in version 1.7, it might have been a bug that occured in the development between 1.7 and 1.8 version.
print.ahistoryfunction can now print outputs of the artifact's history as the
knitr::kablewould. 2. Examples for
searchInGithubReponow works for
repo='archivistparameters as we added new backpack.db file. The previous one was almost empty (for 7 months). 3. Additional examples to better understand usage of archivist package functions: 1. in
loadFromRepofunction - Loading artifacts from the repository which is built in the archivist package and saving them on the example repository. 2. in
createEmptyRepofunction - creating a default local Repository in non existing directory. 3. in
rmFromRepofunction - removing artifacts with
many = TRUEargument. 4. in
deleteRepofunction - using
deleteRoot = TRUEargument. 5. in
copy*Repofunction - using graphGallery local repository in
copyLocalRepofunction. 6. in
get*Tagsfunction - additional example using
getTagsLocalfunction. 7. in
aoptionsfunction - added two new examples concerning usage of
repoDirparameters in this function. 4. Alterations in the text of:
?areaddocumentation pages. 5. Adding missing functions which are used in the archivist package now to
?Repositorydocumentation page. 6.
tempdir()was replaced by
tempfile()in examples sections of:
tempdiris existing directory in which R works so calling
deleteRepo( exampleRepoDir, deleteRoot=TRUE)removed important R files. 7. New tests for the following functions:
Tagsin all functions there has been stated such an order:
Tagsin the text of function's documentation, examples' comments, then
Tagsare considered as a proper name and they begin with capital letter.
tagsin function's body, as parameters, as R object's atrributes, then they begin with small letter.
The order of parameters in asearch has changed!
Added graphGallery for self-contained examples
aread allows for single MD5 hash (which will be read from the default repo)
asearch allows for only patterns (will be searched in local repo)
ahistory has now 'artifact' argument instead of 'obj'
Removed unnecessary dependencies - now archivist is free of dependencies.
shiny package is in Suggests so you should load that package before running shinySearchInLocalRepo function.
saveSetToRepo with a new function
loadSetFromRepo to the
...should be updated...
...should be updated...
setGithubRepofunctions. ...should be updated...