May 2013 M T W T F S S « Feb 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31
Tag Archives: Biopython
We’ve just released a beta of Biopython 1.55 for user testing. Since Biopython 1.54 was released three months ago, we’ve made a good start on work for Python 3 support (via the 2to3 script), but as a side effect of … Continue reading
The Biopython team is proud to announce Biopython 1.54, a new stable release of the Biopython library. Biopython 1.54 comes five months after our last release and brings new features, tweaks to some established functions and the usual collection of … Continue reading
In another quirk to the FASTQ story, recent Illumina FASTQ files don’t actually use the full range of PHRED scores – and a score of 2 has a special meaning, The Read Segment Quality Control Indicator (RSQCI, encoded as ‘B’). … Continue reading
This is another blog post to highlight one of the neat tricks you’ll be able to do with Biopython 1.54 (which you can help test with the Biopython 1.54 beta release). It is often useful to be able to extract … Continue reading
One of the small changes coming in Biopython 1.54 (which you can try out already using the Biopython 1.54 beta) is to Bio.SeqIO and Bio.AlignIO. Previously the input and output functions had required file handles, but they will now also … Continue reading
A beta release for Biopython 1.54 is now available for download and testing.
I’m delighted to announce an open access publication in Nucleic Acids Research describing the FASTQ file format based on the conventions agreed by the OBF projects: The Sanger FASTQ file format for sequences with quality scores, and the Solexa/Illumina FASTQ … Continue reading
We are pleased to announce the availability of Biopython 1.53, a new stable release of the Biopython library, three months after the release of Biopython 1.52. This is our first release since migrating from CVS to git for source code … Continue reading
This post is about paired end data (FASTA or FASTQ) and manipulating it with Biopython’s Bio.SeqIO module (see also FASTQ conversions & speeding up FASTQ).
Biopython’s SeqIO interface revolves around SeqRecord objects which can impose a speed penalty. For FASTQ files the quality string gets turned into a list of integers on parsing, and then re-encoded back to ASCII on writing. Working directly with the raw strings is less flexible, but much faster. Continue reading