This blog post is syndicated from a post on Jonathan Sobel’s blog, originally published July 27, 2017. Jonathan was supported by the ongoing OBF travel fellowship program to attend the 2017 Bioinformatics Open Source Conference (BOSC), held as part of the 2017 ISMB/ECCB meeting in Prague, Czech Republic, in July 2017.
The OBF’s Travel Fellowship program continues to help open source bioinformatics software developers with funding to attend conferences, workshops, or training events. The next call closes 15 December 2017.
During the event, I discovered an amazing community of scientists and developers involved in the field of bioinformatics, with a strong open source mindset. “Sharing is caring” and I can tell these guys care a lot! The first day was quite technical with several talks about projects developed during the code fest that happen just prior to the conference. I had the opportunity to discover the FAIR principles (Findability, Accessibility, Interoperability, and Reproducibility) of open data, and several tools aimed at simplifying bioinformatics workflow sharing, visualizing and production. These teams triggered my curiosity towards the Common Workflow Language (CWL) and data standards, notably RABIX, GA4GH and nextflow. Several talks presented very useful tools such as YAMP (Yet Another Metagenomic Pipline!) or MiltiQC for next generation sequencing quality control, and Open MS 2.0 for mass-spectrometry data analysis. One important topic of the day was the reproducibility of bioinformatics piplines and several talks were addressing this question with various approaches, such as containers (Dockers, BioContainers), GNU Guix or package repository such as BioConda.
On the second day, I had the chance to present our BeerDeCoded project in the Citizen Science session of the BOSC. I had the first slot in the morning with an audience of nearly 250 attendees. The beer topic is kind of holy in a geek environment. I had the pleasure to share several important message regarding science conducted outside of academia or industry, in a community laboratory space like Hackuarium. I put some emphasis about science communication between fields and outside of our institutional scientific community. As experts, we have the responsibility to make our knowledge and our researches accessible to a wide audience, and this is exactly our goal with BeerDeCoded and Hackuarium. In addition, I could announce the official release of our first results based on the metagenomic analysis of 39 beer samples. I was able to show our preliminary analysis. The BOSC community was really enthusiastic about the project and attendees tweeted quite a lot about it.
In addition, I had some very interesting questions about the interest of breweries in BeerDeCoded, the potential fear of companies that citizen scientist decode their proprietary yeast strain or about the data integration of sequencing with GC/MS in order to study small molecules present in our beer data-set. Moreover, this talk will potentially trigger new collaborations with with Bérénice Batut from the Galaxy training network. Galaxy is an open source, web-based platform for data intensive biomedical research. The Galaxy platform regroups a collection of bioinformatics tools and workflow that can be run without coding knowledge. This program is widely used by biologists to analyze their next generation sequencing data. BeerDeCoded will benefit from this collaboration with a specific instance of Galaxy to make the beer metagenomics accessible to anyone.
Later during this second day, I was impressed by several other talks. One of the greatest initiative is the work of the H3ABioNet, which aim at training bioinformaticians in Africa. I discovered the Journal of open source software (JOSS) that facilitate the publication of bioinformatics software. Then, BioThings SDK and Wikidata presented their API and their knowledge base that allows retrieving efficiently some annotations of biological data. This day was as well the occasion to discuss about data sharing of human data (wearable, clinical, etc.) in order to improve precision medicine and the ethical implication. In addition, second-hand data usage and the problem of re-digitalization of published data in a non-machine readable format was evoked. Finally, Nick Loman gave the closing keynote presentation at BOSC. He gave a great talk about virus outburst surveillance using the Oxford nanopore minION sequencing technology, using two examples, namely Ebola outburst in Africa in 2015 and the Zika virus in Brazil.
In summary my first BOSC experience was very intense and highly interesting. I met with great scientists and developers and I learned about the newest open source software/library/API and practices in this field. I would like to thank once again the BOF committee for allowing me to join this great event and to give me the opportunity to present Hackuarium and the BeerDeCoded project.