Category Archives: HOWTO

Cross-links in GenomeDiagram

I’ve just finished writing up an example for the Biopython Tutorial of the new GenomeDiagram functionality added in Biopython 1.59. You can now control the start and end points of individual tracks, and you can add cross-links between regions of … Continue reading

Posted in Biopython, Blogroll, Code, Community, Development, HOWTO, OBF, OBF Projects | Tagged | 1 Comment

Chromosome Diagrams in Biopython

One of the new things coming in Biopython 1.59 is improved chromosome diagrams, something you may have seen via Twitter. I’ve just been updating the Biopython Tutorial (current version here, PDF) to include an example drawing this: Here’s a PDF … Continue reading

Posted in Biopython, Blogroll, Code, Development, Documentation, HOWTO, OBF, OBF Projects | Tagged | 1 Comment

Illumina FASTQ files – Read Segment Quality Control Indicator

In another quirk to the FASTQ story, recent Illumina FASTQ files don’t actually use the full range of PHRED scores – and a score of 2 has a special meaning, The Read Segment Quality Control Indicator (RSQCI, encoded as ‘B’). … Continue reading

Posted in Biopython, Blogroll, Code, Community, Development, Documentation, HOWTO, OBF, OBF Projects | Tagged , | Leave a comment

Partial sequence files with Biopython

This is another blog post to highlight one of the neat tricks you’ll be able to do with Biopython 1.54 (which you can help test with the Biopython 1.54 beta release). It is often useful to be able to extract … Continue reading

Posted in Biopython, Blogroll, Code, Development, HOWTO, OBF, OBF Projects | Tagged | Comments Off

Interleaving paired FASTQ files with Biopython

This post is about paired end data (FASTA or FASTQ) and manipulating it with Biopython’s Bio.SeqIO module (see also FASTQ conversions & speeding up FASTQ).

Posted in Biopython, Blogroll, Code, Community, Development, Documentation, HOWTO, OBF Projects | Tagged , | Leave a comment

Working with FASTQ files in Biopython when speed matters

Biopython’s SeqIO interface revolves around SeqRecord objects which can impose a speed penalty. For FASTQ files the quality string gets turned into a list of integers on parsing, and then re-encoded back to ASCII on writing. Working directly with the raw strings is less flexible, but much faster. Continue reading

Posted in Biopython, Blogroll, Code, Community, Development, Documentation, HOWTO, OBF Projects | Tagged , | Comments Off

Simpler, optimized format conversion with Biopython

In this post we’ll look at the new convert() function that both Bio.SeqIO and Bio.AlignIO will get in Biopython 1.52. This allows easier file conversion, and internally provides a mechanism for specific optimisations, such as for FASTQ conversions. Continue reading

Posted in Biopython, Blogroll, Code, Development, Documentation, HOWTO, OBF, OBF Projects | Tagged , | Comments Off