It always takes some time to get a grip on a new dataset, especially large ones. The code-books are often as indispensable as they are massive, and not always as clear as one would want. Routings, and resulting and strange patterns of missing values are at times difficult to find.

I found a nice way to plot missing values, using R. Basically, I thought it would be nice to calculate the percentage of missings on each variable, and do so for each year represented in the data. These numbers could be visualized using a levelplot(), which resulted in the graph below.

Curving Normality

Curving Normality is an academic website and blog maintained by Rense Nieuwenhuis.

Rense is a Ph.D. Candidate at the Institue for Innovation and Governance Studies (IGS) of the University of Twente.

His work is forthcoming in the Journal of Marriage and Family and the European Sociological Review.

Enter your email address to subscribe to this blog and receive notifications of new posts by email.

Recent Activities

Conference: Day of Sociology