benchmarking_universal_single-copy_orthologs_busco
Differences
This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| benchmarking_universal_single-copy_orthologs_busco [2020/04/06 17:01] – 24.138.68.92 | benchmarking_universal_single-copy_orthologs_busco [2024/12/10 14:49] (current) – 129.173.94.151 | ||
|---|---|---|---|
| Line 1: | Line 1: | ||
| **BUSCO: Benchmarking Universal Single-Copy Orthologs** | **BUSCO: Benchmarking Universal Single-Copy Orthologs** | ||
| - | Documentation by Sarah Shah and Dayana Salas-Leiva | + | Documentation by Dayana Salas-Leiva |
| - | Web: | + | Web: http:// |
| - | You can check the completeness of your genome by picking out single-copy orthologs. | + | You can check the completeness of your genome by identifying |
| + | |||
| + | tBLASTn for eukaryotic genome and prokaryotic transcriptome modes | ||
| + | |||
| + | Augustus for eukaryotic genome mode | ||
| + | |||
| + | Metaeuk for eukaryotic genome and eukaryotic transcriptome modes | ||
| + | |||
| + | Prodigal for prokaryotic genome mode | ||
| + | |||
| + | HMMER3 for all modes | ||
| + | |||
| + | There are two main dataset versions of busco: odb9 (contains 303 orthologs only compatible with Busco3) and odb10 (contains 255 orthologous only compatible with Busco4) | ||
| The following are examples of a shell script for genomic and a proteomic search: | The following are examples of a shell script for genomic and a proteomic search: | ||
| - | **** BUSCO 4.0.5 **** | + | *** BUSCO 3 *** |
| + | |||
| + | ** Genomic: ** | ||
| + | |||
| + | # | ||
| + | #$ -S /bin/bash | ||
| + | | ||
| + | #$ -pe threaded 1 | ||
| + | #$ -cwd | ||
| + | cd $PWD | ||
| + | # | ||
| + | | ||
| + | | ||
| + | | ||
| + | conda deactivate | ||
| + | |||
| + | ** Proteomic: ** | ||
| + | |||
| + | # | ||
| + | #$ -S /bin/bash | ||
| + | | ||
| + | #$ -pe threaded 1 | ||
| + | #$ -cwd | ||
| + | cd $PWD | ||
| + | # | ||
| + | | ||
| + | | ||
| + | | ||
| + | conda deactivate | ||
| + | |||
| + | |||
| + | ** * BUSCO 4.0.5 * ** | ||
| ** Genomic: ** | ** Genomic: ** | ||
| Line 38: | Line 81: | ||
| - | *** BUSCO 3 *** | + | ** * BUSCO 5.2.2 * ** |
| - | #!/bin/bash | + | ** Genomic: default metaeuk** |
| - | | + | |
| - | ./etc/profile | + | source activate busco-5 |
| - | | + | INPUT=' |
| - | | + | OUTDIR=' |
| - | cd $PWD | + | MODE=' |
| - | | + | # |
| - | source activate | + | ## the latest busco db for eukaryota is odb10 |
| - | export AUGUSTUS_CONFIG_PATH="/ | + | LINEAGEDB=' |
| - | run_BUSCO.py | + | ## busco v5 only works with odb10 |
| - | | + | ## it will not work with odb9 |
| + | # run busco | ||
| + | ## do not specify output dir with a trailing slash, it will lead to a fatal error | ||
| + | ## modes are genome, proteins, transcriptome | ||
| + | ## the below command will use Metaeuk as gene predictor | ||
| + | | ||
| + | | ||
| + | --out $OUTDIR \ | ||
| + | --mode $MODE \ | ||
| + | | ||
| + | | ||
| + | conda deactivate | ||
| + | |||
| + | ** Proteomic: | ||
| + | source activate busco-5 | ||
| + | # in the busco-5 environment, | ||
| + | # / | ||
| + | # but we don't have writing permissions there | ||
| + | # not sure why we need writing permissions but it doesnt work anyway | ||
| + | # but we copied that dir to a place where we do have writing permissions: | ||
| + | # you may want to copy it to your own home | ||
| + | export AUGUSTUS_CONFIG_PATH=" | ||
| + | INPUT=' | ||
| + | OUTDIR=' | ||
| + | MODE=' | ||
| + | AUGUSTUS_SPECIES=' | ||
| + | # setting the lineage db | ||
| + | ## the latest version (as of writing) is odb10 | ||
| + | LINEAGEDB='/ | ||
| + | ## busco v5 only works with odb10 | ||
| + | ## it will not work with odb9 | ||
| + | busco \ | ||
| + | --in $INPUT \ | ||
| + | --out $OUTDIR \ | ||
| + | --mode $MODE \ | ||
| + | --lineage_dataset $LINEAGEDB \ | ||
| + | --cpu 8 \ | ||
| + | --augustus \ | ||
| + | --augustus_species $AUGUSTUS_SPECIES \ | ||
| + | conda deactivate | ||
| + | | ||
| **Note**: Take out the mitochondrial genome before running this analysis. | **Note**: Take out the mitochondrial genome before running this analysis. | ||
benchmarking_universal_single-copy_orthologs_busco.1586203272.txt.gz · Last modified: by 24.138.68.92
