Normally I use R for my statistical analyses. However R has the well know problem of having problems to deal with very large data sets. Because I work with such datasets, I was looking for a free or open source application which can deal with such data sets. Gretl seemed to be perfect for this job. In a little test I tried to run a simple OLS and a Logit-Modell with simulated data with 10 million rows. Gretl did the job in a few seconds. However I discovered that the calculation of a simple crosstable (xtab-command) took very very long even with much smaller datasets. For example with a dataset of 500’000 rows, a crosstable made out of two dummy variables took more then 5 minutes time. I’m working with a MacBook Pro with OS X 10.8.4 running on it.
Log in to post a comment.