Selecting and cleansing the dataset