from_nanopore_to_gene_prediction
Differences
This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| from_nanopore_to_gene_prediction [2019/07/24 11:10] – [Canu] 134.190.235.39 | from_nanopore_to_gene_prediction [2019/07/24 11:47] (current) – 134.190.235.39 | ||
|---|---|---|---|
| Line 1: | Line 1: | ||
| ======From Nanopore to Gene Prediction: a pathway====== | ======From Nanopore to Gene Prediction: a pathway====== | ||
| + | By Greg and Jon | ||
| Line 19: | Line 19: | ||
| There are a number of additional flags that can help refine the program’s behavior to your needs. | There are a number of additional flags that can help refine the program’s behavior to your needs. | ||
| - | Importantly, | + | Importantly, |
| **Model presets:** | **Model presets:** | ||
| Line 63: | Line 63: | ||
| The first flag will allow the program to classify a read based on the barcode call of either the start or end of the read, so long as they do not disagree. | The first flag will allow the program to classify a read based on the barcode call of either the start or end of the read, so long as they do not disagree. | ||
| - | The second flag will classify a read based on a start barcode, and having an end barcode is opitional. **This is the default behaviour** | + | The second flag will classify a read based on a start barcode, and having an end barcode is optional. **This is the default behaviour** |
| The third flag requires the same barcode on both ends of the read in order for it to be classified. | The third flag requires the same barcode on both ends of the read in order for it to be classified. | ||
| Line 112: | Line 112: | ||
| - | Nanopore technology generates a truly absurd amount of data files, which can be unwieldy to use, both in the sense of day to day work, as well as for programs like Terminal to handle. Therefore, we will use another script to combine fast5 files into multi fast5 files. This is the default method currently, however programs like Deepbinner work with the individual files, and most of our older datasets | + | Nanopore technology generates a truly absurd amount of data files, which can be unwieldy to use, both in the sense of day to day work, as well as for programs like Terminal to handle. Therefore, we will use another script to combine fast5 files into multi fast5 files. This is the default method currently, however programs like Deepbinner work with the individual files, and most of our older data sets are still in single file format, which is important to keep in mind. |
| < | < | ||
| Line 134: | Line 134: | ||
| < | < | ||
| - | Additionally, | + | Additionally, |
| I have also created a script in / | I have also created a script in / | ||
| Line 181: | Line 181: | ||
| Before this step, however, you should merge all the files together into one single .fastq file for each barcode directory created by guppy, using the ' | Before this step, however, you should merge all the files together into one single .fastq file for each barcode directory created by guppy, using the ' | ||
| - | On the commandline, write: | + | On the command line, write: |
| < | < | ||
| This will take all files ending in the fastq suffix and merge them into a single file. You can name the output however you like. This is the file you will use for porechop. | This will take all files ending in the fastq suffix and merge them into a single file. You can name the output however you like. This is the file you will use for porechop. | ||
| Line 206: | Line 206: | ||
| ===== Filtlong ===== | ===== Filtlong ===== | ||
| - | This step is important if you have lots and lots of data. Here, filtlong attempts to take the ' | + | This step is important if you have lots and lots of data. Here, filtlong attempts to take the ' |
| In our lab, we typically use filtlong to obtain the longest reads. | In our lab, we typically use filtlong to obtain the longest reads. | ||
| Line 249: | Line 249: | ||
| < | < | ||
| | | ||
| - | | + | |
| conda deactivate</ | conda deactivate</ | ||
from_nanopore_to_gene_prediction.1563977458.txt.gz · Last modified: by 134.190.235.39
