Illumina FASTQ files – Read Segment Quality Control Indicator

The Read Segment Quality Control Indicator

from Bio.SeqIO.QualityIO import FastqGeneralIterator
handle = open("B_trimmed.fastq", "w")
min_length = 10
for title, seq, qual in FastqGeneralIterator(open("untrimmed.fastq")) :
    #Find the location of the first "B" (PHRED quality 2)
    trim = qual.find("B")
    if trim == -1:
        #No need to trim                                                                                                     
        handle.write("@%sn%sn+n%sn" % (title, seq, qual))
    elif trim >= min_length:
        #Take everything up to the first B                                                                                   
        handle.write("@%sn%sn+n%sn" % (title, seq[:trim], qual[:trim]))
handle.close()

Illumina FASTQ files – Read Segment Quality Control Indicator

Popular Tags

Categories