the bioinformatics lab

Ep 32: Best Practices - Pipeline Development, Part Two


Listen Later

PHA4GE Ten Best Practices for Public Health Bioinformatics Pipelines:
https://github.com/pha4ge/public-health-pipeline-best-practices/blob/main/docs/pipeline-best-practices.md
Summary
In this episode, Kevin Libuit and Andrew Page discuss the 10 best practices for public health pipeline development. They start by emphasizing the use of common file formats and the importance of avoiding reinventing the wheel. They highlight the benefits of standard file formats and the availability of parsers for different languages. They also discuss the implementation of software testing, including the use of automated testing and the integration of testing with Docker containers. They emphasize the need for accessibility to benchmark or validation data sets and the importance of reference data requirements. They also touch on the significance of hiring bioinformaticians and the documentation practices that should be followed.
Takeaways
Use common file formats to avoid reinventing the wheel and enable compatibility with other programs.
Implement software testing, including automated testing, to ensure functionality and identify bugs.
Provide benchmark or validation data sets to allow users to compare and evaluate the performance of the pipeline.
Consider the reference data requirements and ensure accessibility to curated databases.
Hire bioinformaticians with domain expertise to navigate the complexities of pipeline development.
Follow documentation practices, including communication of authorship, pipeline maintenance statements, and community guidelines for contribution and support.
...more
View all episodesView all episodes
Download on the App Store

the bioinformatics labBy The Bioinformatics Lab


More shows like the bioinformatics lab

View all
Science Friday by Science Friday and WNYC Studios

Science Friday

6,097 Listeners

Nature Podcast by Springer Nature Limited

Nature Podcast

752 Listeners

Science Quickly by Scientific American

Science Quickly

1,372 Listeners

This American Life by This American Life

This American Life

90,830 Listeners

Science Vs by Spotify Studios

Science Vs

12,074 Listeners

The Daily by The New York Times

The Daily

111,746 Listeners

The Long Run with Luke Timmerman by Timmerman Report

The Long Run with Luke Timmerman

122 Listeners

The Peter Attia Drive by Peter Attia, MD

The Peter Attia Drive

7,910 Listeners

The Journal. by The Wall Street Journal & Gimlet

The Journal.

5,923 Listeners

Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

Machine Learning Street Talk (MLST)

87 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

389 Listeners

Huberman Lab by Scicomm Media

Huberman Lab

28,304 Listeners

The Ezra Klein Show by New York Times Opinion

The Ezra Klein Show

15,220 Listeners

Night Science by Itai Yanai & Martin Lercher

Night Science

62 Listeners

Proteomics in Proximity by Olink Proteomics

Proteomics in Proximity

6 Listeners