ploidy_analysis_using_ploidyngs
Differences
This shows you the differences between two versions of the page.
| Next revision | Previous revision | ||
| ploidy_analysis_using_ploidyngs [2019/09/27 10:36] – created 129.173.90.41 | ploidy_analysis_using_ploidyngs [2022/06/24 16:01] (current) – [UPDATE (June 2022)] 134.190.232.29 | ||
|---|---|---|---|
| Line 5: | Line 5: | ||
| Script for running a ploidyNGS analysis on perun | Script for running a ploidyNGS analysis on perun | ||
| + | < | ||
| # | # | ||
| #$ -S /bin/bash\\ | #$ -S /bin/bash\\ | ||
| . / | . / | ||
| #$ -cwd | #$ -cwd | ||
| - | |||
| source activate ploidyNGS-preq | source activate ploidyNGS-preq | ||
| - | / | + | / |
| + | | ||
| + | | ||
| conda deactivate | conda deactivate | ||
| + | </ | ||
| Perun will immediately generate error messages that look dire if not fatal | Perun will immediately generate error messages that look dire if not fatal | ||
| + | |||
| + | '' | ||
| " | " | ||
| Stopping at filesystem boundary (GIT_DISCOVERY_ACROSS_FILESYSTEM not set)." | Stopping at filesystem boundary (GIT_DISCOVERY_ACROSS_FILESYSTEM not set)." | ||
| - | but you can ignore these and the program continues to run | + | '' |
| + | This is the result of a python function defined by ploidyNGS to try to identify which version of the script you are running, but it typically doesn' | ||
| - | One of the options you will be interested in is | ||
| - | -d 100 (default), --max_depth 100 (default) | ||
| - | Max number of reads kepth at each position in the | ||
| - | reference genome (integer, default: 100) | ||
| - | If you have low coverage you will need to change this otherwise you won't have | ||
| - | many sites that qualify for examination. | ||
| + | One of the options you will be interested in is '' | ||
| + | |||
| + | < | ||
| + | Max number of reads kepth at each position in the reference genome (integer, default: 100) | ||
| + | </ | ||
| + | |||
| + | This sets the maximum read depth '' | ||
| Examine the tab output file. In particular, if the lines for Third and Fourth contain | Examine the tab output file. In particular, if the lines for Third and Fourth contain | ||
| Line 38: | Line 45: | ||
| To see the sorted content of the Third and Fourth lines | To see the sorted content of the Third and Fourth lines | ||
| - | grep Fourth myoutput.tab |cut -f4 |sort -n\\ | + | < |
| - | grep Third myoutput.tab |cut -f4 |sort -n | + | $ grep Fourth myoutput.tab | cut -f4 | sort -n |
| + | $ grep Third myoutput.tab | cut -f4 | sort -n | ||
| + | </ | ||
| + | To remove lines containing " | ||
| + | < | ||
| + | $ grep -vP " | ||
| + | </ | ||
| - | To remove Third and Fourth lines from the tab | + | Once satisfied with the tab you need to make a PDF (it is suppose to generate it automatically |
| + | but it doesn' | ||
| - | grep -v Fourth | + | On the command line execute |
| + | < | ||
| + | $ Rscript --vanilla / | ||
| + | </ | ||
| - | Once satisfied with the tab you need to make a pdf (it is suppose | + | This will generate a PDF called '' |
| - | but it doesn' | + | |
| - | On the commandline | + | ==== UPDATE (June 2022) ==== |
| + | |||
| + | I (Joran) have updated | ||
| + | |||
| + | The main new feature is the '' | ||
| + | |||
| + | Other updates: | ||
| + | * Positions reported in the .tab file are now 1-indexed (i.e. starting at 1) instead of 0-indexed (i.e. starting at 0) | ||
| + | * If you have ''/ | ||
| + | * The axis titles in the histogram will be a bit easier to understand | ||
| + | * The script will now report the total number of heteromorphic positions per contig and over all contigs | ||
| + | |||
| + | To call it, use a Perun submission script like this one | ||
| + | |||
| + | < | ||
| + | # | ||
| + | #$ -S /bin/bash | ||
| + | . / | ||
| + | #$ -cwd | ||
| + | #$ -q 256G-batch | ||
| + | |||
| + | source activate ploidyNGS-dependencies | ||
| + | export PATH="/ | ||
| + | |||
| + | BAMFILE=' | ||
| + | MINCOV=10 | ||
| + | MAXDEPTH=2000 | ||
| + | OUTBASE=' | ||
| + | |||
| + | ploidyNGS_minCov.py \ | ||
| + | --out $OUTBASE \ | ||
| + | --bam $BAMFILE \ | ||
| + | --min_cov $MINCOV \ | ||
| + | --max_depth $MAXDEPTH | ||
| + | |||
| + | conda deactivate | ||
| + | </ | ||
| - | Rscript --vanilla | + | A more elaborate script is available on the GitHub repository of the Roger lab: https://github.com/RogerLab/gospel_of_andrew. |
| - | This will generate a pdf called NA which you can transfer to your home computer to look at. | + | The updated scripts are also available on the original GitHub repository https:// |
ploidy_analysis_using_ploidyngs.1569591419.txt.gz · Last modified: by 129.173.90.41
