AllTheBacteria documentation
All WGS isolate bacterial INSDC data to August 2024 uniformly assembled, QC-ed, annotated, searchable.
Follow up to Grace Blackwell’s 661k dataset (which covered everything to Nov 2018).
Preprint: https://doi.org/10.1101/2024.03.08.584059
Please use the github repository to raise any issues: https://github.com/AllTheBacteria/AllTheBacteria/issues
Contents:
- Overview
- Metadata and QC
- SQLite metadata
- Assemblies
- Species identification
- Annotation
- Antimicrobial Resistance
- Biosynthetic Gene Clusters
- Species specific typing
- Archaea
- Sequence alignment with LexicMap
- Contributing to AllTheBacteria
- Batch downloading from OSF
- FAQ
- Release History
- Migration from EBI FTP to OSF