PaperPlayer biorxiv bioinformatics

BRAKER2: Automatic Eukaryotic Genome Annotation with GeneMark-EP+ and AUGUSTUS Supported by a Protein Database


Listen Later

Link to bioRxiv paper:
http://biorxiv.org/cgi/content/short/2020.08.10.245134v1?rss=1
Authors: Bruna, T., Hoff, K., Stanke, M., Lomsadze, A., Borodovsky, M.
Abstract:
Full automation of gene prediction has become an important bioinformatics task since the advent of next generation sequencing. The eukaryotic genome annotation pipeline BRAKER1 had combined self-training GeneMark ET with AUGUSTUS to generate genes coordinates with support of transcriptomic data. Here, we introduce BRAKER2, a pipeline with GeneMark EP+ and AUGUSTUS externally supported by cross-species protein sequences aligned to the genome. Among the challenges addressed in the development of the new pipeline was generation of reliable hints to the locations of protein-coding exon boundaries from likely homologous but evolutionarily distant proteins. Under equal conditions, the gene prediction accuracy of BRAKER2 was shown to be higher than the one of MAKER2, yet another genome annotation pipeline. Also, in comparison with BRAKER1 supported by a large volume of transcript data, BRAKER2 could produce a better gene prediction accuracy if the evolutionary distances to the reference species in the protein database were rather small. All over, our tests demonstrated that fully automatic BRAKER2 is a fast and accurate method for structural annotation of novel eukaryotic genomes.
Copy rights belong to original authors. Visit the link for more info
...more
View all episodesView all episodes
Download on the App Store

PaperPlayer biorxiv bioinformaticsBy Multimodal LLC