assembling_long_read_data
Differences
This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| assembling_long_read_data [2017/11/09 11:34] – 129.173.88.84 | assembling_long_read_data [2018/01/08 14:51] (current) – 129.173.88.84 | ||
|---|---|---|---|
| Line 1: | Line 1: | ||
| ====== ASSEMBLING LONG READ DATA ====== | ====== ASSEMBLING LONG READ DATA ====== | ||
| + | |||
| + | Documentation by Sarah Shah | ||
| When you have your porechopped reads in fastq and fasta formats, try out the following assemblers: | When you have your porechopped reads in fastq and fasta formats, try out the following assemblers: | ||
| - | Programs: ABruijn ([[https:// | + | Programs: ABruijn ([[https:// |
| **ABruijn** | **ABruijn** | ||
| Line 26: | Line 28: | ||
| / | / | ||
| + | </ | ||
| + | |||
| + | Abruijn has been replaced by **Flye** as of January 2018! Example usage: | ||
| + | < | ||
| + | #!/bin/bash | ||
| + | #$ -S /bin/bash | ||
| + | . / | ||
| + | #$ -cwd | ||
| + | #$ -pe threaded 16 | ||
| + | #$ -o leg | ||
| + | |||
| + | source / | ||
| + | |||
| + | unset PYTHONPATH | ||
| + | |||
| + | flye --nano-raw Acas_merged_pc_fl.fastq --genome-size 45m --out-dir Acas_filtlongFlye --threads 16 --iterations 3 --min-overlap 3000 | ||
| </ | </ | ||
| **Canu** | **Canu** | ||
| Line 58: | Line 76: | ||
| **smartdenovo** | **smartdenovo** | ||
| + | Download smartdenovo to your account on Perun. | ||
| + | < | ||
| + | / | ||
| + | make -f reads.mak | ||
| + | </ | ||
| + | The **.utg** file is the important output. | ||
| **miniasm** | **miniasm** | ||
| - | The simplest and the fastest of all the assemblers here. First, | + | The simplest and the fastest of all the assemblers here. First, |
| + | < | ||
| + | minimap2 -x ava-ont reads.fq reads.fq | gzip -1 > reads.paf.gz | ||
| + | </ | ||
| + | |||
| + | Then, use miniasm: | ||
| + | < | ||
| + | miniasm -f reads.fq reads.paf.gz > reads.gfa | ||
| + | </ | ||
| + | |||
| + | View the .gfa file using **Bandage**. You can convert the .gfa file to a fasta file by: | ||
| + | < | ||
| + | awk '/ | ||
| + | </ | ||
| + | |||
| + | ---- | ||
| + | |||
| + | The Unicycler Github page ([[https:// | ||
| + | |||
| + | Do a quick BLAST search of your contigs and separate out the eukaryotic and bacterial contigs. Compare your assemblies using QUAST ([[http:// | ||
assembling_long_read_data.1510241695.txt.gz · Last modified: by 129.173.88.84
