The aim of the course is to help experienced R users to tackle the problems they face when analysing big data sets. The course will consist of a mixture of lectures and computer labs (in a ratio of approximately 60/40), so there is plenty of time for hands-on exercises. The main purposes of this course can be summarised as:
- Introduction to Big Data and the similarities and differences between regular modelling approaches and big data modelling
- Understanding of possibilities and limitations of R in big data research
- Introduction to high performance computing (HPC)
- Reproducible research
For this course a good understanding of R is required: this is not a course to learn R. This means that if you are using R regularly (e.g., several times a week), write your own scripts and perhaps even packages, you will benefit from this course.
Former occurrences of this course
5-6 October 2017