gene_prediction_find_supported_orfs
Differences
This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| gene_prediction_find_supported_orfs [2025/08/18 14:59] – [find_supported_orfs.py in a nutshell] 134.190.145.228 | gene_prediction_find_supported_orfs [2025/08/18 15:53] (current) – 134.190.145.228 | ||
|---|---|---|---|
| Line 17: | Line 17: | ||
| 2. Load FASTA file representing the genome sequence | 2. Load FASTA file representing the genome sequence | ||
| - | 3. Interrogate the RNAseq BAM file and extract **the locations of all introns** | + | 3. Interrogate the RNAseq BAM file and extract **the locations of all supported |
| - | To be added, an intron | + | An intron |
| * Have at least 5 reads supporting the existence of this intron | * Have at least 5 reads supporting the existence of this intron | ||
| Line 40: | Line 40: | ||
| Currently, a high coverage region must be at least 150 bp in length to move on to the next stage | Currently, a high coverage region must be at least 150 bp in length to move on to the next stage | ||
| + | |||
| + | |||
| + | {{ : | ||
| 5. High coverage regions are matched with supported introns and **spliced** accordingly | 5. High coverage regions are matched with supported introns and **spliced** accordingly | ||
| Line 53: | Line 56: | ||
| 8. The genes are then printed in GFF3 format | 8. The genes are then printed in GFF3 format | ||
| + | ===== Blastocystis ===== | ||
| + | |||
| + | Many // | ||
| + | |||
| + | Run of the mill gene predictors are unaware of this peculiarity, | ||
| + | |||
| + | find_supported_orfs.py integrates a nifty work-around to recognize such genes. The terminal ' | ||
| + | |||
| + | Currently, only the last 50 bp of a high coverage region are search for this motif, and only the last instance of this motif is edited. This strictness was found to eliminate many false positive hits. | ||
gene_prediction_find_supported_orfs.1755539954.txt.gz · Last modified: by 134.190.145.228
