This is an old revision of the document!
Trinity - Assembly of transcriptome reads
Documentation by Shelby Williams (last updated by D. Salas-Leiva, 04-13-2020)
Trinity is an assembler of RNA-seq reads, after they have been trimmed. Trinity uses three programs (Inchworm, Chrysalis, and Butterfly) to assemble large volumes of transcriptome reads. The output of Trinity is the Trinity.fasta file found in the /trinity_out_dir/ folder/
A simple Trinity shell script, using the new conda-environments:
For strand-specific data:
It could be RF or FR. We usually get RF (typical of the dUTP/UDG sequencing method). If unsure, you need to draw violin plots to determine the specificity.
#!/bin/bash #$ -S /bin/bash . /etc/profile #$ -cwd #$ -pe threaded 10 source activate trinity-fixed # this special built is version is 2.6.6 Trinity --seqType fq --SS_lib_type RF --left Reads_R1_PairNtrim.fastq --right Reads_2_PairNtrim.fastq --CPU 10 --max_memory 20G source deactivate
For NO strand specific data:
#!/bin/bash #$ -S /bin/bash . /etc/profile #$ -cwd #$ -pe threaded 10 source activate trinity-fixed # this special built is version is 2.6.6 Trinity --seqType fq --left Reads_R1_PairNtrim.fastq --right Reads_2_PairNtrim.fastq --CPU 10 --max_memory 20G source deactivate
ATTENTION!
Some Trinity versions, such as 2.4.0, are not compatible with Bowtie2. Skip this step by adding the flag –no_bowtie
