This is an old revision of the document!
Table of Contents
Practical Protocols for Probing Protists
Unix, Perun and where to find softwares and databases
RAM and CPU memory - Perun Queues
GPU nodes and how to work with them
GitHub repositories of the lab
Bioconda
Tips and Tricks
The idea of this section is to share command line tips and tricks that you have found made it a lot easier to do bioinformatics work. This can range from basic UNIX things like setting up your terminal environment, configuring your SSH connections or managing your filesystem to explaining new bioinformatics tools and how to work with large scale computer clusters.
Recordings of the Tips and Tricks meetings are available for downloading or viewing online here
Or directly on Perun, on /scratch3/downloads/icgvideo/TAT
The Terminal
Command line utilities
SSH
Common Bioinformatics Operations
DNA Sequence Data Processing
[Short reads] Cleaning of Illumina Paired-end reads
[Short reads] Short read assembly
[Short reads] depth and breadth of coverage
[Long reads] MINION Sequencing from START to FINISH
[Long reads] Assembling Long Read Data
[Long reads] Nanopore tools for polishing
Decontamination using a metagenomics approach (Anvi’o)
Decontamination using read classifier (Eukfinder)
Decontaminating reads using DECONSEQ
[Genome] Evaluating genome completeness using Benchmarking Universal Single-Copy Orthologs (BUSCO)
[Genome] Evaluating genome quality using CGAL
[Mito genome] Visualizing mitochondrial genomes
Changing contig or scaffold names in a genome assembly
Extracting a single fasta entry (or multiple) from a multifasta file
RNA sequence data, gene expression
de novo transcriptome assembly with Trinity
de novo transcriptome assembly with rnaSPAdes
[RNA-seq reads] Mapping RNAseq data to your genome
[Transcriptome] Evaluating and comparing transcriptome assemblies with rnaQUAST
Gene Prediction
[Ab initio] Gene prediction with just GeneMark
[Ab initio] Gene prediction with just Augustus
[RNAseq-informed] Gene prediction with find_supported_orfs.py
[Pipeline] Gene prediction with Braker2
[Pipeline] Gene prediction with the Funannotate
[Fix] Improving gene models with fix_genes_with_false_introns.py
Functional annotation
Functional annotation with the Funannotate pipeline
Finding genes of interest with PANTHER HMMs
Phylogenetic analyses
Phylogenomic analyses
Misc. analyses
Programming languages
Bioinformatics Q&A
Bioinformatics and Trivia
Dokuwiki maintenance
Project Documentation
Blastocystis orf160 This could possibly be an interesting project for a future undergraduate student
