Accelerating data preparation fo Big Data analytics