Hack Harris 12 - OpenRefine and Regex

This week at Hack Harris (Wednesday at 6pm in 140B) we’ll be starting with an intro to OpenRefine, previously Google Refine. If you could download and install it before Wednesday, that’d be swell but not mandatory. OpenRefine is a hybrid database and spreadsheet, borrowing parts of each to make tedious data cleaning of big data sets still tedious but slightly less annoying.

We’ll also be exploring the marvelous world of regular expressions, a widespread tool for searching text that can be used in practically any language, as well as in OpenRefine. Inspired by xkcd, we’ll take a look at a great tutorial and then play a few rounds of Regex Golf.