PanWSGA enables researchers to project a single bacterial genome onto a curated, species-level pangenome reference derived from more than 100,000 high-quality assemblies.
The platform integrates standardized reannotation, a custom classification engine, and a dedicated visualization module to deliver reproducible gene-category profiles and functional summaries within minutes.
PanWSGA classifies genes from a single genome as core, accessory, cloud, or unique by mapping them to a curated species-level pangenome. This provides pangenome-level context even when no multi-genome dataset is available.
A purpose-built Python engine implementing strict gene-to-cluster matching criteria (≥90% identity and ≥90% coverage). It performs deterministic gene assignment, optimized for large species-level cluster sets, and outputs category-specific FASTA files, tables, and logs.
A dedicated visualization subsystem that integrates classification and EggNOG functional annotations to produce structured, publication-ready biological summaries. GeneEye supports both static high-resolution outputs and interactive browser-based charts.
PanWSGA includes 1,975 species-level pangenomes, each constructed from filtered, uniformly reannotated genomes and manually reviewed to remove cluster inconsistencies, paralog inflation, and contamination.
All analyses follow a fixed, standardized workflow—Prokka → Unique Gene Finder → EggNOG-mapper → GeneEye—executed in a containerized environment for consistency and reproducibility across submissions.
All computational steps run on the server. No installation, local dependencies, or computational infrastructure are required.
a genome assembly in FASTA format
The genome is reannotated using Prokka
Unique Gene Finder maps predicted proteins to pangenome clusters
EggNOG-mapper assigns COG/KEGG/EC annotations
GeneEye generates static and dynamic summaries
Users receive full output files, annotation tables, and publication-ready figures
Typical runtime: 2–20 minutes depending on genome size
of new bacterial isolates
of rare, unusual, or unique gene content
of core vs. accessory gene sets
of research figures and summaries
Use in clinical, surveillance, academic, and instructional settings