RabbitTClust: enabling fast clustering analysis of millions of bacteria genomes with MinHash sketches
{{output}}
We present RabbitTClust, a fast and memory-efficient genome clustering tool based on sketch-based distance estimation. Our approach enables efficient processing of large-scale datasets by combining dimensionality reduction techniques with streaming and paralle... ...