-
Notifications
You must be signed in to change notification settings - Fork 140
Description
Hello,
I am using cutadapt 5.0 and Python 3.12.10. Cutadapt was installed using miniforge3.
I have a paired-end TruSeq Illumina multiplexed library (96 samples total). I am trying to remove adapters as well as demultiplex the reads based on inline barcode sequence.
This is the command that I used:
cutadapt -e 0 --no-indels --pair-filter=both \
-g file:adapterA_fwd.fasta -G file:adapterA_fwd.fasta \
-a GATCGGAAGAGCACACGTCTGAACTCCAGTCAC -A GATCGGAAGAGCGTCGTGTAGGGAAAGAGTGT \
-o {name}.1.fastq -p {name}.2.fastq \
R1_001.fastq R2_001.fastq
The analysis runs and generates a R1 and R2 fastq file for each of the 96 barcodes as well as unknown R1 and R2 fastq files. I also get two unexpected files: 1.1.fastq, 1.2.fastq, which I have figured out are related to sequences that also carry the -a/-A adapter sequence. I thought that the analysis would first remove sequencing adapters and then demultiplex the adapter trimmed reads into R1/R2 fastq files corresponding to the barcodes in the -g/-G provided files.
I have gotten around this issue by doing the following:
- Remove sequencing primer binding site read through from 3’ end:
cutadapt \
-a GATCGGAAGAGCACACGTCTGAACTCCAGTCAC \
-A GATCGGAAGAGCGTCGTGTAGGGAAAGAGTGT \
-o {name}.1.fastq -p {name}.2.fastq \
R1.fastq R2.fastq
- Concatenate sequencing adapter trimmed reads with untrimmed reads to prepare for demultiplexing:
cat 1.1.fastq unknown.1.fastq > R1_adapter_removed.fastq
cat 1.2.fastq unknown.2.fastq > R2_adapter_removed.fastq
- Demultiplex adapter trimmed reads/remove adapter from 5’ end:
cutadapt -e 0 --no-indels --pair-filter=both -g file:adapter_fwd.fasta -G file:adapter_fwd.fasta \
-o {name}.1.fastq -p {name}.2.fastq \
R1_adapter_removed.fastq \
R2_adapter_removed.fastq
Is this the best approach?
Thank you in advance~
Gina