This is an applied statistics course focusing on data analysis. The course begins with an overview of how to organise, perform, and write-up data analyses. The course starts with a theoretical part on the how to mine very large datasets to get valuable data to analyse. Then it covers some of the most popular and widely used statistical methods to analyse the data, like linear regression, principal components analysis, cross-validation, and p-values. Instead of focusing on mathematical details, the lectures are designed to help you apply these techniques to real data using the R statistical programming language, interpret and visualise the results, and diagnose potential problems in your analysis.