It is often mentioned that 80% of a data analysis pipeline is involved with the tedious process of cleaning and preparing data in a correct way so they can be consumed for analysis and visualization (Dasu & Johnson, 2003). Tidy data facilitates easier data transformation and visualization. Tidy data works hand in hand with the tools provided by the tidyverse collection of R packages, in a way that promotes reproducibility and efficiency. ggplot2 (Wickham, 2009) is one of the core members of the tidyverse. It is one of the best and most used R packages for data visualization. In this workshop, participants will learn the principle of tidy data, how to transform and combine datasets using the tools from the tidyverse and how to generate advanced visualization with the ggplot2 package.
Dasu, T., & Johnson, T. (2003). Exploratory Data Mining and Data Cleaning. https://doi.org/10.1002/0471448354
Wickham, H. (2009). ggplot2: Elegant Graphics for Data Analysis. Springer-Verlag New York. Retrieved from http://ggplot2.org
Former occurrences of this course
8, 11, 16 & 18 June 2021 | 16, 19, 23 and 26 March 2021 | 24-26 February 2020