gene_prediction_just_augustus
Differences
This shows you the differences between two versions of the page.
| Next revision | Previous revision | ||
| gene_prediction_just_augustus [2024/10/25 13:57] – created 134.190.144.194 | gene_prediction_just_augustus [2024/10/28 10:17] (current) – 134.190.221.230 | ||
|---|---|---|---|
| Line 1: | Line 1: | ||
| + | ====== Gene prediction with just Augustus ====== | ||
| + | By Jason Shao & Joran Martijn (Last Edited: October 28th 2024) | ||
| + | |||
| + | ====Intro==== | ||
| + | **Augustus** is an //ab initio// gene predictor that employs **H**idden **M**arkov **M**odels (HMMs) pre-trained on existing datasets. Training a custom HMM model with your own data is possible, but in this basic tutorial, we are only going to consider pre-existing models. | ||
| + | |||
| + | ====Example Usage==== | ||
| + | < | ||
| + | source activate augustus-3.5.0 | ||
| + | |||
| + | # only required for species=generic | ||
| + | export AUGUSTUS_CONFIG_PATH="/ | ||
| + | |||
| + | augustus \ | ||
| + | --species=generic \ | ||
| + | <your genome> \ | ||
| + | --gff3=on \ | ||
| + | --outfile=< | ||
| + | |||
| + | conda deactivate | ||
| + | </ | ||
| + | Note that the species here is set to generic to minimize biases for a divergent organism. If your organism is closely related to one of the pre-trained species below, you can specify that instead to yield a better prediction. | ||
| + | |||
| + | If you decide to use a pre-trained species, then you don't have to include the export line. The reason for including the export line is because augustus had stopped shipping probability files with generic species around 3.3.3. Why have they done that, you might ask? Well, unfortunately it's another great mystery for which science cannot explain. | ||
| + | |||
| + | ====Pre-trained Species==== | ||
| ^ Identifier | ^ Identifier | ||
| | human | //Homo sapiens// | | human | //Homo sapiens// | ||
| Line 7: | Line 33: | ||
| | tribolium | | tribolium | ||
| | schistosoma | | schistosoma | ||
| - | | tetrahymena | + | | **tetrahymena** |
| | galdieria | | galdieria | ||
| | maize | //Zea mays// | | maize | //Zea mays// | ||
| - | | toxoplasma | + | | **toxoplasma** | **// |
| | caenorhabditis | | caenorhabditis | ||
| | aspergillus_fumigatus | | aspergillus_fumigatus | ||
gene_prediction_just_augustus.1729875429.txt.gz · Last modified: by 134.190.144.194
