ViroWatch is a bioinformatics software pipeline designed to facilitate real-time molecular surveillance of viral genomes. This lightweight, graph-based system efficiently handles and aggregates genomic sequencing data, particularly focusing on HIV-1 genomes. It is optimized for low-resource settings and can process data from multiple sequencing platforms, providing rapid and accessible insights into viral transmission dynamics and potential drug resistance.
Important
This project is currently under active development, no release have been made.
In the meantime you can use a command line implementation of the pipeline in HIV-64148 assembly pipeline.
ViroWatch aims to enhance the management of genomic sequencing data using graph-based approaches, improving the scalability and flexibility of viral surveillance systems. It is designed to analyze HIV-1 genomes from clinical samples and public databases, integrating molecular and clinical data for comprehensive viral monitoring.
- Integrating molecular surveillance with clinical data and existing virological databases.
- Developing a scalable and portable system for rapid deployment, even in low-resource settings.
- ViroWatch enables public health authorities and researchers to:
- Track the emergence of drug-resistant HIV-1 strains.
- Monitor potential viral transmission clusters.
- Graph-Based Data Management: Utilizes graph databases for flexible and scalable storage of genomic data.
- Compatibility with Multiple Sequencing Platforms: Supports long-read sequencing technologies (e.g., Oxford Nanopore, PacBio).
- Real-Time Surveillance: Enables rapid analysis and visualization of transmission clusters and genomic diversity.
- Portable and Lightweight: Runs on standard computer systems, making it accessible to labs in low-resource settings.
- Data Integration: Combines clinical data with existing virological and molecular epidemiology datasets.
- QC Tools (selectable)
- Taxonomic sequence classification
- Kraken2 (v2.1.3)
- Assemblers (selectable)
- Canu (v2.2)
- Flye/MetaFlye (v2.9.5)
- HaploDMF (latest)
- Variant caller
- NanoCaller (v3.6.0) - Variant calling from reads.
- Snippy (latest) - Variant calling from assembled contigs.
- HIV-1 sequences from Los Alamos HIV databases.
- Stanford HIV Drug Resistance Database.
- Publicly available sequences from NCBI.