Chris Mungall
Capturing knowledge in the life sciences is a huge undertaking. The scope of the field extends from the atomic level up to planetary-scale ecosystems, and a wide variety of disciplines collaborate on the research.
Chris Mungall and his colleagues at the Berkeley Lab tackle this knowledge-management challenge with well-honed collaborative methods and AI-augmented computational tooling that streamlines the organization of these precious scientific discoveries.
We talked about:
his biosciences and genetics work at the Berkeley Lab
how the complexity and the volume of biological data he works with led to his use of knowledge graphs
his early background in AI
his contributions to the gene ontology
the unique role of bio-curators, non-semantic-tech biologists, in the biological ontology community
the diverse range of collaborators involved in building knowledge graphs in the life sciences
the variety of collaborative working styles that groups of bio-creators and ontologists have created
some key lessons learned in his long history of working on large-scale, collaborative ontologies, key among them, meeting people where they are
some of the facilitation methods used in his work, tools like GitHub, for example
his group's decision early on to commit to version tracking, making change-tracking an entity in their technical infrastructure
how he surfaces and manages the tacit assumptions that diverse collaborators bring to ontology projects
how he's using AI and agentic technology in his ontology practice
how their decision to adopt versioning early on has enabled them to more easily develop benchmarks and evaluations
some of the successes he's had using AI in his knowledge graph work, for example, code refactoring, provenance tracking, and repairing broken links
Chris's bio
Chris Mungall is Department Head of Biosystems Data Science at Lawrence Berkeley National Laboratory. His research interests center around the capture, computational integration, and dissemination of biological research data, and the development of methods for using this data to elucidate biological mechanisms underpinning the health of humans and of the planet. He is particularly interested in developing and applying knowledge-based AI methods, particularly Knowledge Graphs (KGs) as an approach for integrating and reasoning over multiple types of data. Dr. Mungall and his team have led the creation of key biological ontologies for the integration of resources covering gene function, anatomy, phenotypes and the environment. He is a principal investigator on major projects such as the Gene Ontology (GO) Consortium, the Monarch Initiative, the NCATS Biomedical Data Translator, and the National Microbiome Data Collaborative project.
Connect with Chris online
LinkedIn
Berkeley Lab
Video
Here’s the video version of our conversation:
https://youtu.be/HMXKFQgjo5E
Podcast intro transcript
This is the Knowledge Graph Insights podcast, episode number 37. The span of the life sciences extends from the atomic level up to planetary ecosystems. Combine this scale and complexity with the variety of collaborators who manage information about the field, and you end up with a huge knowledge-management challenge. Chris Mungall and his colleagues have developed collaborative methods and computational tooling that enable the construction of ontologies and knowledge graphs that capture this crucial scientific knowledge.
Interview transcript
Larry:
Hi everyone. Welcome to episode number 37 of the Knowledge Graph Insights podcast. I am really delighted today to welcome to the show Chris Mungall. Chris is a computational scientist working in the biosciences at the Lawrence Berkeley National Laboratory. Many people just call it the Berkeley Lab. He's the principal investigator in a group there, has his own lab working on a bunch of interesting stuff, which we're going to talk about today.