Provides functions to facilitate the use of the 'ff' package in interaction with big data in 'SQL' databases (e.g. in 'Oracle', 'MySQL', 'PostgreSQL', 'Hive') by allowing easy importing directly into 'ffdf' objects using 'DBI', 'RODBC' and 'RJDBC'. Also contains some basic utility functions to do fast left outer join merging based on 'match', factorisation of data and a basic function for re-coding vectors.
ETLUtils provides utility functions to execute standard ETL operations (using package ff) on large data. Currently the following functions might be useful to you if you have some large dataset in SQL and want to import it immediately in an ffdf object
An example can be found at http://www.bnosac.be/index.php/blog/5-get-your-large-sql-data-in-ff-swiftly and at http://www.bnosac.be/index.php/blog/6-readodbcffdf-a-readdbiffdf-for-fetching-large-corporate-sql-data
For users who want to store data from an ffdf back in a database, the package also provides
Other functions include factorise, matchmerge, recoder, naLOCFPlusone and renameColumns.
This is the development version of the package which is available at CRAN.
To install the latest version from github
To get the lastest version from CRAN:
Version: 1.4.1 [2018-01-25]
Version: 1.4 [2018-01-17]
Version: 1.3 [2015-05-12]
Version: 1.2 [2013-01-03]
Version: 1.1 [2012-05-18]
Version: 1.0 [2012-03-26]