Tag Archives: Data visualization


ggplot2 book cover
gggplot2 Elegant Graphics for Data Analysis

Latest addition of Hadley Wickam’s book ggplot2

Springer International Publishing 2016

This is a major update. I spent a lot of time going over the last chapters in the book.

Part 3 Data Analysis covers a different way of using ggplot2. Instead of doing analysis then plotting. Do both parts at the same time using ggplot2 plot and other new useful packages.

Chapter 9 covers tidy data. Tidy data has variables in columns  and observations  in rows. Straight forward but the data doesn’t always come that way.  Packages tidyr and dplyr  help with tidying up data.

One of things covered in Chapter 10 is pipes and the package magrittr. Using pipes makes for cleaner code.

Chapter 11 Modelling for Visualization. Introduces the new package called broom. broom package takes messy data out put of model functions such as lm, glm, anova and makes them tidy.

The beginning of the book covers aes() and that you need it for your plot and geom() you keep adding them as layers.

This a good book for learning how to use ggplot2 and new techniques for analyzing data.

Graphical Models with R


Graphical models with R by Soren Hojsgaard, David Edwards and Steffen Lauritzen published by Springer

I have been to a lot of talks lately on graphing social networks. Most of the code has been in python. I was happy to find this book written in R.
gRbase is the package for the book.
Most of the packages used in the book are on cran. The missing few dependencies are on bioconductor.org. Links to the site are on the gRbase part of cran. With some fiddling I got everything to work correctly.

Chapter 2.3.5 covers Hypothesis testing with graphical models.

I have been busy working thru all the examples in the book. Making lots of big spidery graphs that make sense. I am pleased with the mix a theory and code in this book.

I am thinking about what data to use with the code for an upcoming talk that I am giving.