ChromoDB data update: a new working release and ongoing quality control

ChromoDB has been updated with a new working dataset comprising more than 130,000 chromosome-count records. This new upload represents a significant step in the progressive consolidation of the database and incorporates a large amount of recently curated information from the main ChromoDB working tables.

The update also improves the integration of taxonomic and external reference information. Where available, records now include links to relevant external resources such as POWO and GBIF, together with bibliographic information, DOI links and source URLs. These links are intended to improve traceability and make each record easier to verify against its original taxonomic and bibliographic context.

This release should be understood as a working update, not as a final curated version of the database. During the upload and subsequent checks, several issues were identified that will guide the next phase of data cleaning. One of the immediate priorities is the standardization of country names, as some records still contain non-standard, translated or inconsistent geographic entries. Bibliographic matching and normalization also remain central tasks, particularly for older chromosome-count literature.

Making this update available is part of ChromoDB’s broader strategy: to publish data progressively, while documenting the curation process openly. Rather than waiting for a fully closed and definitive dataset, ChromoDB will continue to grow through successive updates, each one accompanied by additional checks, corrections and improvements.

The next steps will focus on three main areas: geographic standardization, bibliographic revision and further validation of links to taxonomic resources. These tasks are essential for improving the long-term reliability of ChromoDB and for preparing future data publication workflows.