This two-day workshop is aimed at providing a basic understanding of creating and evaluating de novo assembly using long read technologies. The workshop requires knowledge of working with the Linux operating system and of biological topics such as genetics and DNA. Although the data is from a plant genome, the learned principles can be applied to many other organisms, including bacteria, fungi and mammals. The idea behind this workshop is that you learn the most by doing and discussing the results.
- Read Quality assessment. Using bioinformatics tools to check the quality of the reads.
- De novo assembly. Producing genome assemblies based on PacBio, Nanopore and Illumina.
- Assembly comparison. The genome assemblies based on the different types of data might have different results. Here we will compare the results and investigate the found differences between them.
– Assembly base Quality check using mRNA sequence. We will use our NGS data and mRNA sequences to identify SNPs and other discrepancies between the assemblies. We will study the effects these assembly errors have on downstream analyzes.
Day 1: Morning: Quality control and evaluation of long read data Afternoon: Assembly of long reads
Day 2: Morning: Comparison and visualization of long read assemblies Afternoon: Validation of assemblies