This is a pipeline in progress for detecting viral sequences from RNA-seq data. I’ve based it off methods used in this paper: https://www.nature.com/articles/s41564-024-01796-6

Software:

Input data:

Workflow:

Steps:

  1. Use trimmomatic (downloaded on HPC). For example:
java -jar /programs/trimmomatic/trimmomatic-0.39.jar PE -phred33 input_forward.fastq input_reverse.fastq output_forward_paired.fastq output_forward_unpaired.fastq output_reverse_paired.fastq output_reverse_unpaired.fastq ILLUMINACLIP:/programs/trimmomatic/adapters/TruSeq3-PE-2.fa:2:30:10 LEADING:3 TRAILING:3 SLIDINGWINDOW:4:15 MINLEN:35