Naming the unnamed: over 65,000 Candidatus names for unnamed Archaea and Bacteria in the Genome Taxonomy Database


Citation
Pallen et al. (2022). International Journal of Systematic and Evolutionary Microbiology 72 (9)
Names (7)
Subjects
Ecology, Evolution, Behavior and Systematics General Medicine Microbiology Modeling and Simulation
Abstract
Thousands of new bacterial and archaeal species and higher-level taxa are discovered each year through the analysis of genomes and metagenomes. The Genome Taxonomy Database (GTDB) provides hierarchical sequence-based descriptions and classifications for new and as-yet-unnamed taxa. However, bacterial nomenclature, as currently configured, cannot keep up with the need for new well-formed names. Instead, microbiologists have been forced to use hard-to-remember alphanumeric placeholder labels. Here, we exploit an approach to the generation of well-formed arbitrary Latinate names at a scale sufficient to name tens of thousands of unnamed taxa within GTDB. These newly created names represent an important resource for the microbiology community, facilitating communication between bioinformaticians, microbiologists and taxonomists, while populating the emerging landscape of microbial taxonomic and functional discovery with accessible and memorable linguistic labels.
Authors
Publication date
2022-09-20
DOI
10.1099/ijsem.0.005482

© 2022-2024 The SeqCode Initiative
  All information contributed to the SeqCode Registry is released under the terms of the Creative Commons Attribution (CC BY) 4.0 license