The availability of reliable genomic data is one of the major drivers in advancing genome engineering. The different databases and annotations contain information from nucleotide sequences up to proteins and their functions. However, not all genomic resources are equally helpful or complete. This chapter aims to provide an overview of available genome references and annotations for the industrially most interesting strains of Pichia pastoris (now reclassified to Komagataella phaffii, Komagataella pastoris, and Komagataella kurtzmanii), including a short guideline on which genomic reference to use for which task. Additionally, we will cover the most important databases and data types and show methods on how to access them, based on the strain K. phaffii CBS 7435.
Keywords: Data retrieval; Data visualization; Genome assembly; Komagataella phaffii; Nucleic acid databases; Pichia pastoris.
© 2026. The Author(s), under exclusive license to Springer Science+Business Media, LLC, part of Springer Nature.