This implements the data table back-end for 'dplyr' so that you can seamlessly use data table and 'dplyr' together.
dtplyr is the data.table backend for dplyr. It provides S3 methods for data.table objects so that dplyr works the way you expect.
dtplyr will always be a bit slower than data.table, because it creates copies of objects rather than mutating in place (that's the dplyr philosophy). Currently, dtplyr is quite a lot slower than bare data.table because the methods aren't quite smart enough. I hope interested dplyr & data.table users from the community will help me to improve the performance.
dtplyr was extracted out of dplyr so it could evolve independently (i.e. more rapidly!) than dplyr. It also makes dplyr a little simpler, and it's easier to keep track of issues by backend.
You can install from CRAN with:
Or try the development version from GitHub with:
.keep_all argument (#30, #31).
Slightly improve test coverage (#6).
devtools from GitHub on Travis (#32).
data.table. Right and full join are now implemented (#16, #19).
Remove warnings from tests (#4).
dplyr at revision e5f2952923028803.