This is a linkpost for the original article hosted on my old blog: Analysing the movies I’ve watched, Part II, Data cleaning.