Research Article |
Corresponding author: Uwe Fritz ( uwe.fritz@senckenberg.de ) Academic editor: Ralf Britz
© 2021 Marika Asztalos, Dinçer Ayaz, Yusuf Bayrakcı, Murat Afsar, Cemal Varol Tok, Carolin Kindler, Daniel Jablonski, Uwe Fritz.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Asztalos M, Ayaz D, Bayrakcı Y, Afsar M, Tok CV, Kindler C, Jablonski D, Fritz U (2021) It takes two to tango – Phylogeography, taxonomy and hybridization in grass snakes and dice snakes (Serpentes: Natricidae: Natrix natrix, N. tessellata). Vertebrate Zoology 71: 813-834. https://doi.org/10.3897/vz.71.e76453
|
Using two mitochondrial DNA fragments and 13 microsatellite loci, we examined the phylogeographic structure and taxonomy of two codistributed snake species (Natrix natrix, N. tessellata) in their eastern distribution area, with a focus on Turkey. We found evidence for frequent interspecific hybridization, previously thought to be extremely rare, and for backcrosses. This underscores that closely related sympatric species should be studied together because otherwise the signal of hybridization will be missed. Furthermore, the phylogeographic patterns of the two species show many parallels, suggestive of a shared biogeographic history. In general, the phylogeographies follow the paradigm of southern richness to northern purity, but the dice snake has some additional lineages in the south and east in regions where grass snakes do not occur. For both species, the Balkan Peninsula and the Caucasus region served as glacial refugia, with several mitochondrial lineages occurring in close proximity. Our results show that the mitochondrial divergences in both species match nuclear genomic differentiation. Yet, in the former glacial refugia of grass snakes there are fewer nuclear clusters than mitochondrial lineages, suggesting that Holocene range expansions transformed the glacial hotspots in melting pots where only the mitochondrial lineages persisted, bearing witness of former diversity. On the other hand, the deep mitochondrial divergences in N. tessellata across its entire range indicate that more than one species could be involved, even though lacking microsatellite data outside of Turkey prevent firm conclusions. On the contrary, our microsatellite and mitochondrial data corroborate that N. megalocephala is invalid and not differentiated from sympatric populations of N. natrix. For Cypriot grass snakes, our analyses yielded conflicting results. A critical assessment of the available evidence suggests that N. natrix is a genetically impoverished recent invader on Cyprus and taxonomically not distinct from a subspecies also occurring in western Anatolia and the southern Balkans. Based on combined mitochondrial and nuclear genomic evidence we propose that for grass snakes the following subspecies should be recognized in our study region: (1) Natrix natrix vulgaris Laurenti, 1768, southeastern Central Europe and northern Balkans; (2) Natrix natrix moreotica (Bedriaga, 1882), southern Balkans, western Anatolia, and Cyprus; and (3) Natrix natrix scutata (Pallas, 1771), eastern Anatolia, Caucasus region, Iran, northeastern distribution range (from eastern Poland and Finland to Kazakhstan and the Lake Baikal region). Thus, Natrix natrix cypriaca (Hecht, 1930) becomes a junior synonym of N. n. moreotica and Natrix natrix persa (Pallas, 1814) becomes a junior synonym of N. n. scutata. Due to insufficient material, we could not resolve the status of Natrix natrix syriaca (Hecht, 1930) from the Gulf of İskenderun, southeastern Turkey.
Biogeography, hybridization, Natrix natrix, Natrix megalocephala, Natrix tessellata, phylogeography, subspecies, taxonomy, Turkey
The genus Natrix Laurenti, 1768 comprises five widely distributed Palearctic snake species (
Natrix maura is a Western Mediterranean and Western European species, ranging from northwestern Africa and the Iberian Peninsula to central and southern France and northwestern Italy. Introduced populations also occur on the Balearic Islands and Sardinia (
Until a few years ago, the three grass snake species were lumped together in one polytypic species, N. natrix sensu lato (
Each of the three grass snake species is polytypic. Recently,
In the present study we aim to resolve the taxonomic and genetic differentiation of grass snakes (N. natrix sensu stricto) in the Balkan Peninsula, Anatolia, the Caucasus region and Iran. In addition, we scrutinize whether the phylogeographic and taxonomic differentiation of the dice snake (N. tessellata) parallels that of N. natrix in our study region, and we examine, for the first time, whether and, if so, to which extent, natural hybridization occurs between these two sympatric species. For doing so, we apply an established toolbox consisting of combined evidence from mitochondrial DNA (mtDNA) sequences and 13 nuclear genomic microsatellite loci (
Data for 683 grass snakes (Natrix natrix) and 332 dice snakes (N. tessellata) were analyzed for the present study. A total of 181 tissue samples (115 N. natrix, 66 N. tessellata) were newly processed and microsatellite data were generated for 15 additional samples of N. natrix with previously published mtDNA sequences (
Two marker systems (mtDNA, microsatellite loci) were applied, which were successfully used in our previous studies on the phylogeography and hybridization of grass snakes (
Laboratory procedures followed
Two mtDNA fragments were sequenced, the cytochrome b (cyt b) gene (1,117 bp) and the partial ND4 gene plus adjacent DNA coding for tRNAs (tRNA-His, tRNA-Ser, tRNA-Leu; 866 bp). For some samples the obtained sequences were shorter and for a few samples, only one mtDNA fragment could be sequenced (Table S1).
In addition to mtDNA sequencing, the samples were also genotyped at the same 13 polymorphic microsatellite loci as in our previous studies. For N. tessellata, the presence of the microsatellite motifs was verified by sequencing the PCR products in both directions using unlabeled primers. For detailed information on microsatellite loci, primers and PCR conditions for microsatellites and mtDNA, see Tables S2 and S3,
Cyt b sequences could be generated for 110 samples (55 Natrix natrix, 55 N. tessellata), ND4+tRNAs for 112 samples (73 N. natrix, 39 N. tessellata). These sequences were aligned with those from previous studies (
For each of the two species and for individual lineages within each species, uncorrected p distances (averages) were computed based on the haplotypes of each mtDNA fragment using mega 10.2.3 (
Our microsatellite data were subjected to unsupervised Bayesian cluster analyses using structure 2.3.4 (
To infer whether individual genotypes in structure analyses represent hybrids or pure individuals, thresholds for hybrid identity were determined using hybridlab 1.0 (
structure analyses are based on population-genetic presumptions (Hardy-Weinberg equilibrium, linkage equilibrium;
For selected populations, allelic richness was calculated using fstat 2.9.4 (
In parsimony network analyses (cyt b, ND4+tRNAs), each mitochondrial lineage of Natrix natrix and N. tessellata revealed by phylogenetic analyses (
The network for cyt b of Natrix natrix contained several loops (Fig.
Parsimony network for 705 cyt b sequences of Natrix natrix. Symbol sizes reflect haplotype frequencies. Small black circles are missing node haplotypes; each line connecting two haplotypes corresponds to one mutation step, if not otherwise indicated by red numbers along lines. Haplotype colors correspond to lineages.
The following cyt b haplotypes were newly identified in the present study: gn22-gn27, gy15-gy31, and l23 (European Nucleotide Archive accession numbers OU862648–OU862671).
The network for ND4+tRNAs sequences of N. natrix (Fig. S1) resembled the cyt b network, but with fewer mutations, and the clusters of lineages 4 and 5 were not clearly distinct. Again, the haplotype cluster of lineage 7 was located in the center of the network, with direct connections to haplotypes of lineages 1, 3, 5, 6, and 8. The 30 individual haplotypes of lineage 7 differed by up to 14 mutations. The haplotypes of lineage 7 were connected to haplotypes of lineage 1 by a minimum of 35 mutations. Lineage 1 comprised three haplotypes that differed by a maximum of four mutations. Haplotypes of lineage 2 were connected to haplotypes of lineage 1 by a minimum of four mutations. The four haplotypes of lineage 2 differed by a maximum of three mutations. In addition, haplotypes of lineage 7 were connected over at least 25 mutations to haplotypes of lineage 5. The latter lineage comprised 20 individual haplotypes that differed by a maximum of eight mutations. Lineage 5 was connected with haplotypes of lineage 4 by a minimum of only 2 mutations. Lineage 4 comprised 39 haplotypes that differed by a maximum of ten mutations. The only haplotype of lineage 6 was connected to lineage 7 by a minimum of ten mutations. Furthermore, haplotypes of lineage 7 were also connected to those of lineages 3 (minimum 29 mutations) and 8 (minimum 16 mutations). Haplotypes of lineages 3 and 8 were separated by a minimum of 37 mutations. The 48 haplotypes of lineage 3 and the 21 haplotypes of lineage 8 differed both by a maximum of nine mutations.
The following haplotypes for ND4+tRNAs were newly identified in the present study: gn17–gn21, gy13–gy31, and r39 (European Nucleotide Archive accession numbers OU862605–OU862629). All currently known haplotypes for N. natrix and their accession numbers are listed in the Supplementary Information (Table S6).
For cyt b of Natrix tessellata (Fig.
Parsimony network for 435 cyt b sequences of Natrix tessellata. Symbol sizes reflect haplotype frequencies. Small black circles are missing node haplotypes; each line connecting two haplotypes corresponds to one mutation step, if not otherwise indicated by red numbers along lines. Haplotype colors correspond to lineages.
The following cyt b haplotypes were newly identified in the present study: E52–E61 and T16–T24 (European Nucleotide Archive accession numbers OU862672–OU862690).
The second network (ND4+tRNAs) for N. tessellata was restricted to two lineages, E and T (Fig. S2). The haplotypes of the two lineages were connected over two pathways by a minimum of 44 mutations each. Lineage E corresponded to eight haplotypes, which differed by a maximum of six mutations. The 10 haplotypes of lineage T differed by up to 28 mutations.
All haplotypes for ND4+tRNAs were newly identified in the present study: E1–E8 and T1–T10 (European Nucleotide Archive accession numbers OU862630–OU862647). These haplotypes and all currently known cyt b haplotypes for N. tessellata and their accession numbers are listed in the Supplementary Information (Table S6).
Using distinct haplotypes, the average uncorrected p distance between dice snakes and grass snakes was for the cyt b gene 11.50%, and for the DNA coding for the partial ND4 gene plus tRNAs 12.10% (Table S7). For the eight mitochondrial lineages of Natrix natrix, divergences of 0.96–4.21% were observed for the cyt b gene, and for ND4+tRNAs, 0.60–5.17%. The nine mitochondrial lineages of N. tessellata were more divergent. In the cyt b gene, they differed on average by 1.34–6.15%, and for ND4+tRNAs haplotypes, a divergence of 6.93% was observed for lineages E and T (Table S8). The highest divergence values within N. tessellata resemble those between the three grass snake species N. astreptophora, N. helvetica, and N. natrix (
When the geographic distribution of the individual mitochondrial lineages of the two Natrix species is compared (Fig.
Sampling sites and geographic distribution of mtDNA lineages for 653 grass snakes (Natrix natrix; top) and 325 dice snakes (N. tessellata; bottom) used in this study (eastern parts of the distribution ranges). For N. tessellata, data outside Turkey are from
For each of the two species, the 13 studied microsatellite loci were highly polymorphic, with 9–32 alleles per locus for Natrix natrix and 3–19 alleles per locus for N. tessellata. Total allele numbers for N. natrix were 229 and for N. tessellata, 158 (for allele size range and individual allele numbers per locus, see Table S3).
After initial analysis of our total dataset (n = 705) using structure 2.3.4, hidden structuring was examined using hierarchically nested subsets corresponding to each of the two species and genetic clusters within either species, with admixed individuals in each step removed. The respective results were then compared to the mitochondrial identity of each sample. In addition to the structure analyses, PCAs were calculated for the same datasets to scrutinize the structure results using an approach free of population-genetic assumptions.
For the structure run using all 705 genotypes, the ΔK method identified K = 2 as the optimal number of clusters. However, a second pronounced peak occurred for K = 5 (Fig. S3A). Under K = 2, one cluster matched Natrix natrix and the other N. tessellata (Fig.
Genotypic structuring of grass snakes and dice snakes in the study region (n = 705). In the map, the four clusters representing Natrix natrix under K = 5 are merged and shown in orange. Admixed ancestries are depicted as pie chart sectors according to Q values as explained in the text. In the bar plots below the map, samples are represented by three bars, indicating their mitochondrial lineage (top) and their inferred structure cluster membership under K = 2 and K = 5 (center and bottom). Mitochondrial lineages of N. natrix are color-coded as in Figure
Most of the interspecific hybrids were from Turkey, but hybrid signatures were also found in the Balkans, Transcaucasia and Russia. Sixty-six hybrids were morphologically identified as grass snakes (64 as N. natrix, 2 as ‘N. megalocephala’) and the remaining seven hybrids were morphologically identified as N. tessellata. The majority of hybrids (49) held mitochondrial haplotypes of N. natrix, three had haplotypes of N. tessellata, and for the remaining snakes the mitochondrial identity is unknown (Table S1). Under K = 2, structure assigned most samples from Cyprus to the cluster for pure N. tessellata, even though the snakes were morphologically unambiguous N. natrix. A few other Cypriot samples were genotypically assigned to N. natrix or identified as interspecific hybrids. Yet, under K = 5, the genotypes of the controversial Cypriot snakes were not lumped together with N. tessellata but constituted an extra cluster (Fig.
In the face of these conflicting results and the morphological identity of the Cypriot snakes, we used the K = 5 scenario as the basis for further calculations. For calculating Q values in hybridlab to determine interspecific hybrids, the individual Q values of the three clusters for N. natrix and of the Cypriot cluster were merged and opposed to the Q values of the cluster for N. tessellata. This resulted in thresholds of 93% for pure N. natrix and 94% for pure N. tessellata.
Principal Component Analyses for microsatellite data of all 705 samples (Natrix natrix, N. tessellata). The same dataset was used as for structure analyses. Samples are colored according to their mitochondrial lineage or structure cluster membership; admixed individuals in structure analyses are shown in grey. Thresholds for admixed samples were derived from hybridlab simulations and the same as used for structure. Oval outlines correspond to 95% confidence intervals. PC1 explains 3.96% of variance, PC2 2.13%, and PC3 1.70%. Some snakes morphologically matching N. tessellata are highlighted. The pink arrows denote the only grass snake corresponding to mitochondrial lineage 6 of N. natrix.
Based on these thresholds for pure Natrix natrix and N. tessellata, a second structure run was conducted (Fig.
In the third structure analysis, the cluster of N. natrix from the south and east, matching several distinct mtDNA lineages, was examined. Samples representing pure N. natrix of the western cluster and admixed snakes were removed. For the 159 processed samples, the ΔK method revealed K = 3 as the optimal solution (Fig. S3C). A mainly western cluster (purple in Fig.
The results of the second and third structure analyses largely mirrored the outcome of the first structure calculation for K = 5 (Fig.
Genotypic structuring of Natrix natrix in the study region. Symbol colors indicate structure cluster membership; admixed ancestries are depicted as pie chart sectors according to Q values. In the bar plots below the maps, samples are represented by two bars, indicating their mitochondrial lineage (top) and their inferred cluster membership (bottom). Mitochondrial lineages are color-coded as in Figure
Principal Component Analyses for microsatellite data of (A) pure Natrix natrix (n = 626) and (B) N. natrix of the southern and eastern cluster (n = 159). The same datasets were used as for structure analyses. Samples are colored according to their mitochondrial lineage or structure cluster membership. Thresholds for admixed samples were the same as used for structure. Oval outlines correspond to 95% confidence intervals. For (A) PC1 explains 2.82% of variance, PC2 2.18%, and PC3 1.66%; for (B) PC1 explains 4.78% of variance, PC2 3.52%, and PC3 2.57%. The pink arrows highlight a snake corresponding to mitochondrial lineage 6 of N. natrix.
In a fourth structure run, only our 52 samples of pure Natrix tessellata from Turkey were processed (Fig.
Genotypic structuring of Natrix tessellata in the study region (n = 52). Symbol colors indicate structure cluster membership; admixed ancestries are depicted as pie chart sectors according to Q values. Below the map, the upper bar shows the mitochondrial identity and the lower bar the microsatellite genotype of each individual sample. Mitochondrial lineages color-coded as in Figure
The most prominent results of our study are (1) the evidence for hybridization between grass snakes (Natrix natrix) and dice snakes (N. tessellata), (2) the pronounced divergence of Cypriot grass snakes, (3) the similarity of phylogeographic differentiation patterns of N. natrix and N. tessellata, and (4) that the eight mitochondrial lineages of N. natrix correspond to less microsatellite clusters. These results have implications both for phylogeography in general and for the taxonomy of the two Natrix species. In addition, our results confirm that N. megalocephala is not a valid species.
Previous investigations focused either on the phylogeography of N. natrix (e.g.,
When the putative hybrids from Turkey are compared to the parental species (Table S10), it is obvious that the hybrids share with each parental species many otherwise species-specific alleles. This corroborates their hybrid status, as do specimens that are assigned in the PCAs to the N. natrix cluster although they were morphologically identified as N. tessellata (Fig.
Yet, our study also showed the limitations of microsatellites. Using these markers, only recent hybrid and backcross generations can be detected (
Regarding Cypriot grass snakes, our study revealed a known weakness of the ∆K method and the structure software: The misidentification of K = 2 as top level of the hierarchical structure, even when more clusters should be present (
Cypriot grass snakes are showing an excess of homozygotes, indicating inbreeding (Table S9). This could have caused their distinctiveness in structure analyses under K = 5 (Fig.
Grass snakes are extremely rare on Cyprus and seriously endangered (
The herpetofauna of Cyprus is characterized both by old endemics and recent invaders (
The local grass snakes have been treated as another candidate for an old Cypriot endemic (
In general, the mitochondrial phylogeographies of N. natrix and N. tessellata show some similarities (Fig.
In contrast to N. natrix, N. tessellata shows additional lineages in regions where grass snakes do not occur, i.e., on the island of Crete, in the southern Levant and Egypt or in southern Central Asia. It remains a challenge to study the disjunct populations of N. natrix from Siberia and the Lake Baikal region to find out how they match the general pattern.
As in many other taxa (
In dice snakes, we are lacking nuclear genomic evidence for most lineages. Our sample is restricted to Turkey. Yet, it shows that the two mitochondrial lineages occurring there (Fig.
Principal Component Analyses for microsatellite data of pure Natrix tessellata (n = 52). The same dataset was used as for structure analyses. Samples are colored according to their mitochondrial lineage or structure cluster membership. Thresholds for admixed samples were the same as used for structure. Oval outlines correspond to 95% confidence intervals. PC1 explains 6.65% of variance, PC2 5.23%, and PC3 5.08%.
For grass snakes, range-wide microsatellite data are available (Figs
How can this be explained? A similar situation is known for the closely related barred grass snake (N. helvetica). Grass snakes are highly mobile animals, with males having larger home ranges than females (
For the Balkan Peninsula and southern Central Europe, this also suggests that a ‘rolling expansion and hybridization wave’ established when grass snakes with mtDNA lineage 4 (N. n. vulgaris) spread northwestward. While N. n. vulgaris successively invaded more northerly regions and ‘genetically swamped’ the resident grass snakes characterized by mtDNA lineage 3 (N. n. natrix; see Asztalos et al. 2021) there, the populations of N. n. vulgaris in their former southern refuge became increasingly admixed when other lineages, having independent glacial refuges in the southern Balkan Peninsula, expanded their ranges with Holocene warming. This lead to the current, at first glance counterintuitive, pattern that no pure populations of N. n. vulgaris exist anymore in their former glacial refuge, in contrast to their Holocene expansion area, where relatively pure populations occur in southern Central Europe and the adjacent northern Balkans.
Among the mtDNA lineages that had glacial refugia in the southern Balkan Peninsula is also mtDNA lineage 3, otherwise characteristic for the nominotypical subspecies N. n. natrix (see
Our results corroborate that the species Natrix megalocephala is invalid (
For dice snakes additional genetic and morphological investigations are needed for a comprehensive taxonomic revision. For N. natrix the present evidence and the nomenclatural foundation laid by
Within this framework (Fig.
Approximate distribution of the subspecies of Natrix natrix as outlined in the present study,
Grass snakes of the eastern microsatellite cluster, characterized by mtDNA lineages 1, 2, and 8, are currently assigned to two subspecies (
Furthermore, our results are inconclusive with respect to another subspecies that was recognized as valid by
Our study showcases that phylogeographic investigations should include samples of codistributed congeners because otherwise evidence for interspecific hybridization will be missed. Genomic approaches are expected to be more informative than investigations based on microsatellite loci alone. In our study system, we identified hybridization between Natrix natrix and N. tessellata, two species that were previously thought to hybridize only rarely (
The authors thank the Scientific and Technological Research Council of Turkey TÜBİTAK (project number 116Z359) for financial support. Marika Asztalos’ work was funded by Senckenberg. Daniel Jablonski was supported by the Slovak Research and Development Agency under the contract APVV-19-0076 and by the grant VEGA 1/0242/21 of the Scientific Grant Agency of the Slovak Republic. All animal handling within the scope of the TÜBİTAK project was in accordance with wildlife husbandry standards approved by Decision 2016–18 of the Laboratory Animals Ethical Committee at Ege University, Turkey. Süleyman İlhan and Melodi Yenmiş assisted with laboratory work. Frank Glaw and Ben Wielstra donated some samples. Henrik Bringsøe allowed the use of two photos for a figure. Frank Glaw and an anonymous reviewer made helpful suggestions that improved an earlier version of this study. Flora Ihlow and Markward Herbert Fischer produced one map for us.
Table S1
Data type: .xlsx
Explanation note: Table S1.
Supplementary Figures S1–S3
Data type: .pdf
Explanation note: Supplementary Figures S1–S3, Supplementary Tables S2–S11.