R-Sessions

Applied R: Manual for the quantitative social scientist

Applied R for the quantitative social scientist is a manual on R written specifically as an introduction for the quantitative social scientist. To my opinion, R-Project is a magnificent statistical program, ready to be accepted and implemented in the social sciences. The flexibility of this program and the way data are handled gives the user a sense of closeness to and control over the data. I think this inspires users to analyze their data more creatively and sometimes in a more advanced way.

Index of the R-Sessions

The R-Sessions are a series of blog entries on using R. A large part consists of an R-manual I once wrote. Other posts include some tricks I found out, as well as entries detailing functions and packages I wrote for …

R Sessions 33: Select (nested) observations with equal number of occurences

Recently, I was contacted with an question about R code. A befriended researcher was working with nested data, which was unbalanced. He was working with data in a ‘long’ format: all observations nested within the same group had the same …

Intended to be a customized solution, it may have grown to be a little more. forward.lmer is an early installment of a full stepwise function for mixed effects regression models in R-Project. I may put in some work to extend …

There are various ways of getting your output from R to your publication draft. Most of them are highly efficient, but unfortunately I couldn’t find a function that combines the output from several (lmer) models and presents it in a …

It always takes some time to get a grip on a new dataset, especially large ones. The code-books are often as indispensable as they are massive, and not always as clear as one would want. Routings, and resulting and strange patterns of missing values are at times difficult to find.

I found a nice way to plot missing values, using R. Basically, I thought it would be nice to calculate the percentage of missings on each variable, and do so for each year represented in the data. These numbers could be visualized using a levelplot(), which resulted in the graph below.

Working with statistics can be quite time consuming. As anyone working with relatively advanced models and large amounts of data knows, especially the waiting can be excruciating. Your statistical software is locked up while crunching those numbers, while you’d actually prefer to run some minor procedures, such as post-estimations, testing some loops, or simply displaying the output of a previously estimated model. With Apple’s Mac OS X you now can run R-Project twice, making the most of your dual core processor.

Yesterday, I received my new Apple MacBook. It’s running a Core 2 Duo at 2.4 Ghz and it’s fast. Really fast! I tested it with using R-Project, doing some timings on matrix transformations.

Apparently, it’s very cool to show of the speed of R-Project on your system. Optimized .DLL files help to speed up your R on Windows systems (and possibly other systems as well) with respect to matrix transformations, which has led to enormous speed increases. So, let’s perform a speed-test of our own.

Today, I introduce the new R-Sessions Forum: a new forum on R-Project and statistics in general. This new forum is closely integrated with , but has its own dynamic. The R-Sessions Forum aims at providing a flexible stage for visitors interested in discussing both R-Project and general statistics.

Topics & Features

The new R-Sessions Forum has several interesting features and covers many topics. Amongst the new topics covered are integration with the R-Sessions on Curving Normality, a section for general question on R, a section for general statistical questions, package specific questions, R development, and of course we have a pub for your general chatter. The integration with the R-Sessions is intended to give more dynamic ways of interacting with the Curving Normality web-site. The other topics are more general in nature and provide ample opportunity to pose your own questions and discuss those with other participants.

R-Project works best with a good text editor that is well integrated with R-Project. This edition of the R-Sessions will focus on TextMate, a paid application marketed as ‘The Missing Editor for Mac OS X’.

Designed explicitly for use by programmers on Mac OS X, TextMate makes a promising first impression. The interface looks very clean, text is rendered perfectly, and syntax colouring is provided for quite a large number of programming languages. Also, the colouring of the syntax looks very nice, by the use of light colours that don’t interfere with reading the text.

Curving Normality

Curving Normality is an academic website and blog maintained by Rense Nieuwenhuis.

Rense is a Ph.D. Candidate at the Institue for Innovation and Governance Studies (IGS) of the University of Twente.

His work is forthcoming in the Journal of Marriage and Family and the European Sociological Review.

Enter your email address to subscribe to this blog and receive notifications of new posts by email.

Recent Activities

Conference: Day of Sociology