trinity_for_assembling_transcript_reads
Differences
This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| trinity_for_assembling_transcript_reads [2017/07/17 09:07] – 129.173.90.165 | trinity_for_assembling_transcript_reads [2025/11/07 11:27] (current) – 134.190.191.186 | ||
|---|---|---|---|
| Line 1: | Line 1: | ||
| ====== Trinity - Assembly of transcriptome reads ====== | ====== Trinity - Assembly of transcriptome reads ====== | ||
| - | Documentation by Shelby Williams | + | Documentation by Shelby Williams |
| Trinity is an assembler of RNA-seq reads, after they have been trimmed. Trinity uses three programs (Inchworm, Chrysalis, and Butterfly) to assemble large volumes of transcriptome reads. The output of Trinity is the Trinity.fasta file found in the / | Trinity is an assembler of RNA-seq reads, after they have been trimmed. Trinity uses three programs (Inchworm, Chrysalis, and Butterfly) to assemble large volumes of transcriptome reads. The output of Trinity is the Trinity.fasta file found in the / | ||
| Line 6: | Line 6: | ||
| ---- | ---- | ||
| + | A simple Trinity shell script, using the new conda-environments: | ||
| - | A simple Trinity shell script: | + | **For strand-specific data (No genome guided or //de novo//): |
| - | #!/bin/bash | + | It could be **RF** or **FR**. We usually get RF (typical of the dUTP/UDG sequencing method). If unsure, you need to draw violin plots to determine the specificity. |
| + | < | ||
| + | #!/bin/bash | ||
| #$ -S /bin/bash | #$ -S /bin/bash | ||
| + | . / | ||
| + | #$ -cwd | ||
| + | #$ -pe threaded 10 | ||
| + | ##source activate trinity-2.11-with-workaround no longer need to use, use newest version instead | ||
| + | ## this special built is version is 2.11 - use new version now instead see below | ||
| + | |||
| + | source activate trinity | ||
| + | ## this is version 2.15.2 | ||
| + | |||
| + | Trinity \ | ||
| + | --seqType fq \ | ||
| + | --SS_lib_type RF \ | ||
| + | --left Reads_R1_PairNtrim.fastq \ | ||
| + | --right Reads_2_PairNtrim.fastq \ | ||
| + | --CPU 10 \ | ||
| + | --max_memory 20G | ||
| + | |||
| + | conda deactivate | ||
| + | </ | ||
| + | |||
| + | If your genome is compact, that is, genes are very close together with minimum intergenic space, it may be beneficial to run Trinity with the '' | ||
| + | |||
| + | From the '' | ||
| + | < | ||
| + | # --jaccard_clip | ||
| + | # you expect high gene density with UTR | ||
| + | # | ||
| + | # for reads). | ||
| + | # | ||
| + | # | ||
| + | # | ||
| + | # | ||
| + | </ | ||
| + | |||
| + | **For strand-specific data (Genome guided): | ||
| + | |||
| + | It could be **RF** or **FR**. We usually get RF (typical of the dUTP/UDG sequencing method). If unsure, you need to draw violin plots to determine the specificity. | ||
| + | |||
| + | < | ||
| + | #!/bin/bash | ||
| + | #$ -S /bin/bash | ||
| . / | . / | ||
| + | #$ -cwd | ||
| + | #$ -pe threaded 10 | ||
| + | source activate trinity-2.11-with-workaround | ||
| + | |||
| + | # this special built is version is 2.11 | ||
| + | Trinity \ | ||
| + | --CPU 10 \ | ||
| + | --max_memory 100G \ | ||
| + | --genome_guided_bam yourgenome.fasta.sambamsorted.bam \ | ||
| + | --genome_guided_max_intron 1000 \ | ||
| + | --SS_lib_type RF | ||
| + | - | ||
| + | conda deactivate | ||
| + | </ | ||
| + | |||
| + | |||
| + | |||
| + | |||
| + | **For NO strand specific data:** | ||
| + | < | ||
| + | #!/bin/bash | ||
| + | #$ -S /bin/bash | ||
| + | . / | ||
| #$ -cwd | #$ -cwd | ||
| + | #$ -pe threaded 10 | ||
| + | |||
| + | source activate trinity-2.11-with-workaround | ||
| + | |||
| + | # this special built is version is 2.11 | ||
| + | Trinity \ | ||
| + | --seqType fq \ | ||
| + | --left Reads_R1_PairNtrim.fastq \ | ||
| + | --right Reads_2_PairNtrim.fastq \ | ||
| + | --CPU 10 \ | ||
| + | --max_memory 20G | ||
| + | |||
| + | source deactivate | ||
| + | </ | ||
| - | #$ -pe threaded 2 | ||
| - | export PATH=/ | + | **ATTENTION!** |
| - | / | + | |
| - | #Note: this is for Trinity | + | Some Trinity |
trinity_for_assembling_transcript_reads.1500293277.txt.gz · Last modified: by 129.173.90.165
