Tableplot, a Visualization of Large Datasets

A tableplot is a visualisation of a (large) dataset with a dozen of variables, both numeric and categorical. Each column represents a variable and each row bin is an aggregate of a certain number of records. Numeric variables are visualized as bar charts, and categorical variables as stacked bar charts. Missing values are taken into account. Also supports large 'ffdf' datasets from the 'ff' package.


version 1.3

  • added sd plotting
  • column name rotation added (argument rotateNames in plot.tabplot)

version 1.2

  • comparison columns added
  • diverging palettes used for numeric variables

version 1.1-1

  • fixed bug in comparison tableplot
  • fixed bug with sortCol. sortCol can now be a character, index number, or an expression

version 1.1

  • improved sampling (now very fast for large datasets!). Arguments sample and sampleBinSize are added.
  • added comparison tableplots (see ?"-.tabplot")

version 1.0-1

  • fixed bug when nBins > number of rows

version 1.0

  • fast preprocessing with ffbase, which uses C code
  • tick marks applied on x-ases for numeric variables
  • flexilibity to plot high cardinality categorical variables
  • fixed a bug: if nBins was an integer the tableplot function reported that it was not numeric (thanks to Gabor Grothendieck).
  • updated vignette
  • added shiny interface

version 0.12-1

  • fixed bug with one-level-factors

version 0.12

  • added more flexibility to select columns and to subset the data. The new argument select allows both column indices and expressions while the new argument select_string allows character strings. The new argument subset allows logical expressions to indicate which rows to select, while subset_string allows character strings. The subset(_string) can also be used to select a categorical variable to split the data.
  • vignette rewritten (with knitr)
  • updated preprocessing (using data.table), which allows to create tableplots of larger datasets in memory. (The ffdf format is still needed for datasets that don't fit into memory.)
  • added argument nCol, which sets the maximum number of columns. Now it is possible to automatically generate easy to read tableplots for all columns in the dataset.
  • fixed small ffdf bug
  • fixed bug with large fontsizes
  • added argument fontsize.title

version 0.11-2

  • tableSave: added svg output
  • fixed ffdf bug introduced since R2.15

Reference manual

It appears you don't have a PDF plugin for this browser. You can click here to download the reference manual.


1.3-1 by Martijn Tennekes, 8 months ago

Browse source code at

Authors: Martijn Tennekes and Edwin de Jonge

Documentation:   PDF Manual  

Task views: Official Statistics & Survey Methodology

GPL-3 license

Imports grid

Depends on bit, ff, ffbase

Suggests shiny, knitr, classInt, ggplot2

Depended on by tabplotd3.

See at CRAN