User Tools

Site Tools


trinity_for_assembling_transcript_reads

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
trinity_for_assembling_transcript_reads [2017/07/17 09:07] 129.173.90.165trinity_for_assembling_transcript_reads [2025/11/07 11:27] (current) 134.190.191.186
Line 1: Line 1:
 ====== Trinity - Assembly of transcriptome reads ====== ====== Trinity - Assembly of transcriptome reads ======
 +Documentation by Shelby Williams (last updated by D. Salas-Leiva, 01-07-2020 and J. Martijn 24-04-2023 and K.Dunn 07-11-2025)
  
 Trinity is an assembler of RNA-seq reads, after they have been trimmed. Trinity uses three programs (Inchworm, Chrysalis, and Butterfly) to assemble large volumes of transcriptome reads. The output of Trinity is the Trinity.fasta file found in the /trinity_out_dir/ folder/ Trinity is an assembler of RNA-seq reads, after they have been trimmed. Trinity uses three programs (Inchworm, Chrysalis, and Butterfly) to assemble large volumes of transcriptome reads. The output of Trinity is the Trinity.fasta file found in the /trinity_out_dir/ folder/
Line 5: Line 6:
 ---- ----
  
 +A simple Trinity shell script, using the new conda-environments:
  
-A simple Trinity shell script:+**For strand-specific data (No genome guided or //de novo//) ** 
  
-#!/bin/bash+It could be **RF** or **FR**. We usually get RF (typical of the dUTP/UDG sequencing method). If unsure, you need to draw violin plots to determine the specificity.
  
 +<code>
 +#!/bin/bash
 #$ -S /bin/bash #$ -S /bin/bash
 +. /etc/profile
 +#$ -cwd
 +#$ -pe threaded 10
  
 +##source activate trinity-2.11-with-workaround no longer need to use, use newest version instead
 +## this special built is version is 2.11 - use new version now instead see below 
 +
 +source activate trinity 
 +## this is version 2.15.2 
 +
 +Trinity \
 +    --seqType fq \
 +    --SS_lib_type RF \
 +    --left Reads_R1_PairNtrim.fastq \
 +    --right Reads_2_PairNtrim.fastq \
 +    --CPU 10 \
 +    --max_memory 20G
 +
 +conda deactivate
 +</code>
 +
 +If your genome is compact, that is, genes are very close together with minimum intergenic space, it may be beneficial to run Trinity with the ''--jaccard_clip'' option. This can prevent Trinity from falsely assembling transcripts from multiple consecutive genes in a row into a single transcript. 
 +
 +From the ''--help'' :
 +<code>
 +#  --jaccard_clip                  :option, set if you have paired reads and
 +#                                   you expect high gene density with UTR
 +#                                   overlap (use FASTQ input file format
 +#                                   for reads).
 +#                                   (note: jaccard_clip is an expensive
 +#                                   operation, so avoid using it unless
 +#                                   necessary due to finding excessive fusion
 +#                                   transcripts w/o it.)
 +</code>
 +
 +**For strand-specific data (Genome guided):  ** 
 +
 +It could be **RF** or **FR**. We usually get RF (typical of the dUTP/UDG sequencing method). If unsure, you need to draw violin plots to determine the specificity.
 +
 +<code>
 +#!/bin/bash
 +#$ -S /bin/bash
 . /etc/profile . /etc/profile
 +#$ -cwd
 +#$ -pe threaded 10
  
 +source activate trinity-2.11-with-workaround
 +
 +# this special built is version is 2.11
 +Trinity \
 +    --CPU 10 \
 +    --max_memory 100G \
 +    --genome_guided_bam yourgenome.fasta.sambamsorted.bam \
 +    --genome_guided_max_intron 1000 \
 +    --SS_lib_type RF
 +    -
 +conda deactivate
 +</code>
 +
 +
 +
 +
 +**For NO strand specific data:**
 +<code>
 +#!/bin/bash
 +#$ -S /bin/bash
 +. /etc/profile
 #$ -cwd #$ -cwd
 +#$ -pe threaded 10
 +
 +source activate trinity-2.11-with-workaround
 +
 +# this special built is version is 2.11
 +Trinity \
 +    --seqType fq \
 +    --left Reads_R1_PairNtrim.fastq \
 +    --right Reads_2_PairNtrim.fastq \
 +    --CPU 10 \
 +    --max_memory 20G
 +
 +source deactivate
 +</code>
  
-#$ -pe threaded 2 
  
-export PATH=/opt/perun/jre1.8.0_121/bin:$PATH +**ATTENTION!**
-/opt/perun/trinityrnaseq-Trinity-v2.4.0/Trinity --seqType fq --left RetortaCarp_1_PairNtrim.fastq --right RetortaCarp_2_PairNtrim.fastq --CPU 6 --max_memory 20G+
  
-#Note: this is for Trinity version 2.4.0. You made need to update the path to other versions of Trinity as they become installed on perun. +Some Trinity  versions, such as 2.4.0, are not compatible with Bowtie2Skip this step by adding the flag **--no_bowtie**
trinity_for_assembling_transcript_reads.1500293249.txt.gz · Last modified: by 129.173.90.165