IntroductionLeptospirosis is a globally-distributed, potentially-fatal, emerging infectious disease 1. Case fatality rates of up to 20% are due to inadequate diagnostic tools, a limited understanding of mechanisms of disease pathogenesis, and poorly understood leptospiral virulence mechanisms. Symptoms vary from a self-resolving, undifferentiated febrile illness to multiorgan failure and fulminant death. Leptospira are genetically- and antigenically-diverse Gram-negative spirochetes phylogenetically-resolved into three sub-clades of six or more species of similar pathogenicity, and 24 well-studied serogroups [>300 distinct serotypes]. Until recently, the varied clinical presentation of leptospirosis was a presumed to be related to leptospiral diversity with certain species and serovars being inherently more virulent, but evidence to support this assumption is not supported by comparative whole genome analysis (CWGA)2. Leptospira CWGA is based on long-term culture-adapted strains [non-pathogens, attenuated clinical isolates, and avirulent reference strains]; most are based on draft genomes (https://www.ncbi.nlm.nih.gov/genome/genomes/). Consequently, genome data from low-passage strains are critical. Here, we describe the annotation of recently-completed genomes of L. interrogans strains [FMAS_KW1, FMAS_KW2 and FMAS_AW1] recently isolated from three leptospirosis patients.
All strains were isolated in the context of a prospective clinical study of undifferentiated febrile illness in humans that began in 2015 from epidemiologically-contrasting sites around Sri Lanka. Full details of the study and methods have been published elsewhere 3. Samples for cultures were obtained from febrile patients who were clinically classified as 'probable' leptospirosis cases who presented at hospital, two from Karawanalla (FMAS_KW1 and FMAS_KW2), and one from Awissawella (FMAS_AW1). At the bedside, four drops of fresh whole blood were inoculated into 9 mL of semisolid EMJH 4 medium supplemented with 5-flourouracil and neomycin, incubated at 300C, and checked biweekly under darkfield microscopy for growth. All three isolates grew slowly, requiring 10, 15 and 17 weeks post-inoculation, respectively, before cells were visible. Serotyping of newly isolated Leptospira strains was done in the Pasteur Institute, France, using a standard panel of rabbit antisera against reference serovars representing 24 main serogroups 5. Semisolid cultures were sub-cultured into liquid EMJH medium, and sub-cultured no more than twice more in liquid EMJH prior to genomic DNA extraction, which was done from log phase growth6; DNA purification was done using a commercially available genomic DNA purification kit (Gene Jet, ThermoFisher). SMRTbell libraries were generated and sequenced on a PacBio RS II system (Pacific Biosciences, Menlo Park, CA, USA). A minimum of 30X read coverage was obtained for all three isolates. Raw read data were preprocessed using an in-house developed quality control pipeline. Genomes were assembled de novo using Canu 1.8 7 then circularized using Circlator 8 (http://sanger-pathogens.github.io/circlator), and polished using Quiver 9. The workflow resulted in two, three and four overlapping contigs each for FMAS_KW2, FMAS_KW1 and FMAS_AW1, respectively. The fully closed genomes were then annotated by the NCBI Prokaryotic Genome Annotation Pipeline. Software was run with default settings.
Genome sizes ranged from 4.65Mbp (FMAS_KW2) to 5.07 Mbp (FMAS_AW1), Table 1. All contained the typical two chromosomes, and all encoded 37tRNA genes covering all 20 amino acids (Table 1). Two strains, FMAS_AW1 and FMAS_KW1, contained additional large replicons presumptively classified as plasmids by curators of the NCBI Prokaryotic Genome Annotation Pipeline resource (Table 1). Notably, FMAS_AW1 is amongst the largest Leptospira genomes reported to date, with its presumed plasmid, pLiSL1 (approx. size, 130 Kbp), being the largest known extra-chromosomal Leptospira replicon.
The dendrogram based on genomic blast against the NCBI whole genome database (https://www.ncbi.nlm.nih.gov/genome/179?) including 311 L. interrogans genomes shows that FMAS_KW1, FMAS_KW2 and FMAS_AW1 form a discrete cluster with two previously sequenced strains: L. interrogans serogroup Autumnalis serovar Weerasinghe strain 6L-int, and L. interrogans serogroup Pyrogenes serovar Pyrogenes strain Sri Lanka 14 (Figure 1), both of which have been isolated previously in Sri Lanka10. Serotyping data from FMAS_KW2 and FMAS_AW1 showed that are within serogroup Autumnalis. FMAS_KW1 was slow growing and was unable to be serotyped in the reference laboratory.
Five leptospiral species--L. interrogans, L. kirschneri, L. borgpetersenii, L. weilli and L. santarosai--have been reported as causing human leptospirosis in Sri Lanka 10. The strains from which this information derived is from Sri Lanka dating to the 1960s and 1970s, except the two strains reported in 2018 11, showing current scarcity of knowledge on circulating strains of Leptospira in Sri Lanka. Of these, several annotated, genomes originating from Sri Lanka of strains of imprecise provenance and very large passage number are publicly available. Of leptospiral species/strains isolated in Sri Lanka to date, only the genome of L. borgpetersenii serogroup Sejroe serovar Ceylonica strain Piyasena isolated in 1964 (from a male patient in Colombo) has been completed and closed. The present work adds three more complete genomes to this database, widening the knowledge on Leptospira genome. In-depth analysis of new genomes published in the present paper will enhance the knowledge on pathogenesis and evolution of Leptospira.