Review Article |
|
Corresponding author: Wolfgang Wüster ( w.wuster@bangor.ac.uk ) Academic editor: Uwe Fritz
© 2025 Wolfgang Wüster.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Wüster W (2025) Shedding the mitochondrial blinkers: A long-overdue challenge for species delimitation in herpetology. Vertebrate Zoology 75: 259-275. https://doi.org/10.3897/vz.75.e161536
|
The advent of molecular methods has revolutionised the field of species delimitation and description, one of the key tasks of systematic biology. In animal taxonomy, one marker, the mitochondrial DNA (mtDNA) molecule, has acquired and retained disproportionate influence. This is despite its uniparental, clonal mode of inheritance, as a result of which the entire molecule acts as a single locus, and that precludes its use as a test for admixture between putative lineages, a key consideration in species delimitation. To establish the extent to which the limitations of mtDNA affect present-day taxonomy in non-avian reptiles, I surveyed species descriptions and delimitations published during the years 2023–2024, determined the markers used, and whether analyses of different markers were set up to critically test or just to confirm mtDNA-inspired candidate species. Mitochondrial DNA remains the dominant molecular marker in reptile taxonomy, being used in 84% of species descriptions and delimitations, and as the sole molecular marker in 44%. Despite the immense progress in next generation sequencing (NGS) technologies and their increasing affordability, only 3.4% of descriptions used NGS approaches. In 61% of descriptions, taxa were identified primarily through mtDNA divergence, and additional data (morphology, single-copy nuclear gene sequences) were used as confirmatory evidence rather than as rigorous tests of mitochondrially inferred species limits. I reiterate the importance of truly integrative species delimitation that critically tests species limits first hypothesised from mtDNA, and suggest ways of improving the robustness of species delimitations by optimising the allocation of resources to more appropriate markers and through analytical approaches that critically test the evolutionary independence of putative species.
Herpetology, integrative taxonomy, mitochondrial DNA, morphometrics, multilocus analyses, Reptilia, species delimitation, taxonomy
The advent of the molecular age in systematics has resulted in a rapid increase in the rate of species descriptions in many taxa, including reptiles (
The enumeration of the biodiversity of our planet is one of the key tasks of systematic biology. The fundamental unit of this biodiversity is the species. Species not only constitute the taxonomic rank that is closest to being objectively definable, but also form the basis of many conservation metrics and efforts, such as IUCN Red Lists, as well as of regulatory instruments, including both international agreements, such as CITES, and national legislation. Despite the need for accurate species lists for these purposes (
However, despite the pressure of time, species descriptions and delimitations are only useful if they are rigorous and provide the evidence required to support their case, thereby reducing the likelihood of instability from later corrective changes (
While there is a long history of controversy over what species are and how they should be delimited, the General Lineage Concept of species (
Until the second half of the 20th century, morphology and other phenotypic traits (e.g., behaviour, physiology), as well as crossbreeding experiments, were the only sources of evidence available to systematists to support their inferences. The molecular revolution in systematics gained momentum with increasing ease of access to DNA sequence information. In particular, mitochondrial DNA (mtDNA) provided a relatively accessible route to genetic information, which moreover evolves at a rate that makes it suitable for systematics at low taxonomic levels (
The properties that first made mitochondrial DNA popular in systematics include its high rate of evolution, making it useful at low taxonomic ranks, including for intraspecific phylogeography (
The role of mitochondrial DNA in species delimitation was further emphasised by the advent of the barcoding initiative (
While its properties, mode of inheritance and rapid rate of sequence evolution make mtDNA logistically and economically attractive, these very features are responsible for the many limitations that make it unsuitable as a sole source of evidence in species delimitation (
Schematic illustration of the impact of the different modes of inheritance of nuclear and mitochondrial DNA on the ability to detect admixture between a hypothetical red and a blue species. A Nuclear DNA is inherited biparentally and reflects admixture in hybridization. In this case, the nuclear markers of the F2 offspring will reflect a genome originating to 75% from the red species. B Mitochondrial DNA is inherited matrilineally only. Therefore, the information on lines of descent involving male ancestors (indicated by dashed outlines) is lost from the mitogenome of the offspring. The mtDNA of the F2 offspring will reflect the matrilineal ancestry of the blue species only, without any indication of the 75% admixture from the red species.
Do phylogeographic breaks necessarily reflect population genomic breaks? A common assumption in phylogeographic and mtDNA-led systematic studies is that phylogeographic breaks correspond to past or present barriers to gene flow and reflect discontinuities in the genomic make-up of the phylogroups. However, this need not be the case.
Inability to assess introgression and hybridisation. Since the entire mtDNA molecule is (usually) inherited matrilineally and clonally, it follows that it cannot show admixture. Even in the presence of rampant hybridisation, each hybrid will carry the mtDNA haplotype inherited from its mother and her mother, etc. (Fig.
Since detecting admixture is key to distinguishing between tokogenetic and phylogenetic processes, the central task of species delimitation, it follows that on this basis alone, mtDNA cannot be used as the sole source of evidence for species delimitation. The North American ratsnake (Pantherophis obsoletus) complex is a prominent example where parapatric contact zones inferred from mtDNA led to the delimitation of four separate species (
Selection pressure versus population history. Especially in the presence of male-biased gene flow, changes in selection pressures, such as across ecotones, may drive population genomic breaks reflecting major barriers to gene flow that are entirely independent of mitochondrial phylogroups that reflect past lineage fragmentation. In some cases, patterns of genomic differentiation can cut orthogonally across the distribution of mtDNA haplotype clades (
“Lost” mitochondrial species identity. Mitochondrial introgression can lead to the loss of species-specific mitochondrial haplotype clades. For instance, the Carpathian newt (Lissotriton montandoni) is a well-supported and universally recognised species of newt. However, due to mitochondrial introgression from L. vulgaris, it “disappears” into the latter species in a mtDNA-only phylogeny (
Mitochondrial “ghost” species. Matrilineally inherited haplotype clades can persist in gene pools despite extensive gene flow with other populations, even when most of the genome has been replaced by that of the introgressing species. Vipera walser was described from northwestern Italy by
The problematic role of mtDNA in species delimitation. While single-locus species delimitation algorithms claim increased precision in inferring species limits from mitochondrial sequence data (e.g.,
In summary, while mitochondrial DNA sequences remain invaluable as tools to generate species hypotheses, and may well reflect aspects of population history such as past range fragmentation, numerous potential confounding factors preclude their use as the sole or dominant source of evidence for species delimitation. Critical testing of mtDNA–derived species hypotheses using independent marker sets, which can be either nuclear genetic markers or phenotypic characters, and analytical approaches capable of detecting admixture between putative taxa must be considered a fundamental and critical part of any species delimitation attempt (
The formulation of the General Lineage Concept of species (
Although
Mitochondrial DNA sequences will unquestionably continue to be an important source of data in animal systematics in the foreseeable future. However, the main role of mtDNA should be seen as the crucial and often underrated function of hypothesis generation rather than that of hypothesis testing, or, worse, as a sole source of evidence in species delimitation (
The key requirement for any delimitation analysis is that it must be designed to critically test the status of mitochondrial candidate species as independent evolutionary lineages. A critical test must be able to explicitly reject such a primary species hypothesis, not just fail to support it. For example, failure to find statistically significant differences in a phenotypic trait would fail to support a primary species hypothesis, but not reject it; in contrast, if a trait displays a pattern of clinal variation that crosses the putative boundary between the candidate species, this would constitute explicit evidence favouring rejection of the species hypothesis.
I therefore distinguish between two types of approaches: (i) critical approaches that can test and actively reject species hypotheses; and (ii) confirmatory approaches that can at best fail to actively support a species hypothesis, but cannot refute it. Broadly, methods that simply treat each of the candidate species as pre-defined units of comparison cannot critically test them, and are therefore confirmatory. To illustrate the importance of the distinction between critical tests and confirmatory approaches, Figure
Hypothetical scenario of two mitochondrial candidate species distributed along a geographic and ecological gradient. A observed patterns: 1) parapatrically distributed mitochondrial candidate species CS1 and CS2; 2) clinal phenotypic variation along geographical gradient; 3) genomic signature of isolation by distance along geographical gradient as revealed by software such as STRUCTURE (
This scenario is unlikely to be of relevance in situations where mitochondrial candidate species are allopatric and restricted to small, isolated ranges, such as oceanic islands or isolated mountains, where continued genetic exchange is highly improbable. However, structure within candidate species and continuing genetic exchanges between them must be critically evaluated in any situation where the candidate species are contiguously distributed, or likely to have been recently, prior to Pleistocene climatic and sea level fluctuations.
Population genetic approaches to the analysis of nuclear loci. The critical analysis of nuclear loci is a key component of species delimitation, especially where candidate species were first formulated from mtDNA sequence data. A multitude of approaches that do not require a priori grouping of samples into candidate species are available. Algorithms that can detect the number of distinct populations in a set of individual specimen data, such as STRUCTURE or STRUCTURAMA (
Testing candidate species with phenotypic characters. Most phenotypic traits are likely to be under polygenic control, and the analysis of multiple traits can be used to provide a proxy for overall genomic variation, either in conjunction with or instead of nuclear genetic markers. Candidate species would be confirmed by fulfilling the prediction that they constitute phenotypically discrete entities distinct from other such entities. Similar considerations apply here as to nuclear markers: Only methods that consider variation both within and between candidate species can positively reject these primary species hypotheses. Suitable approaches include PCA and derivatives such as MFA and NMDS, as well as the use of Mantel tests to determine whether a given pattern of geographic variation is better explained by IBD or assignment to candidate species (
Analyses of concatenated multilocus sequence data. The key reason for generating multilocus datasets should be to test for cytonuclear discordance, thereby challenging mitochondrial candidate species. However, phylogenetic analyses of concatenated multilocus datasets shoehorn all loci into a single tree topology, and thereby entirely negate their ability to reveal cytonuclear discordance. Concatenating multiple highly variable mitochondrial sequences with a few relatively conserved nuclear loci generates a mitochondrial phylogeny with added noise, not a multilocus phylogeny in any meaningful sense (
Some multispecies coalescent (MSC) approaches. Bayesian species delimitation analyses such as those implemented in the popular programs BPP and BPP3 (
Analyses of phenotypic variation treating candidate species as units of comparison. As shown in Figure
It is also important to be aware of approaches that rely on a priori grouping of specimens as a preparatory step prior to further analyses, such as corrections for size or the estimation of missing data. These can be problematic even if the final analysis itself does not require a priori grouping. For instance, correcting for overall size to identify differences in shape requires grouping into OTUs (
Ecological niche modelling. The notion of ecological divergence as an indicator of speciation has a long history in systematics (
This paper was prompted by multiple recent cases where species limits in high-profile taxa were primarily inferred using an mtDNA-led approach, without critical independent testing of the putative taxa, and the results were later found to be misleading once a multilocus dataset was brought to bear on them. Further recent controversial species delimitation studies and the ensuing criticism (
To understand current practices in species delimitation in reptiles, I conducted a survey of the herpetological literature in the years 2023 and 2024. I searched the Reptile Database (
Only taxonomic decisions resulting from new data and a genuine attempt at species delimitation were included. Taxonomic changes resulting from procedural issues, such as Code-mandated changes, rediscovered senior synonyms, replacement names for nomina nuda, etc., were not considered further. In accordance with widely agreed practice in herpetology (
For each taxonomic decision, I recorded the markers used and whether the data from different markers were used to critically test or merely to confirm the validity of the hypothesised new species. Genetic markers were divided into mtDNA, single-copy nuclear DNA genes (scnDNA), for both of which I recorded the number of genes used, and next-generation sequencing data, such as ddRAD-seq, ultraconserved elements (UCEs), etc. I then noted whether any nuclear markers were used in a manner allowing the critical testing of candidate species, or in a confirmatory manner that could at best fail to confirm species status.
For morphology, I again noted whether this was used in an approach allowing critical testing of mitochondrial candidate species, or in a confirmatory or solely descriptive manner. I also noted whether any newly defined or redefined species is likely to be in geographic contact with closely related congeneric taxa.
I collected data on 440 species descriptions and revalidations published in 253 individual papers. The compilation of the descriptions and the list of references are provided in Files S1 and S2. All the taxonomic decisions analysed here represent bona fide attempts at species delimitation: None could be described as “en passant” taxonomy, i.e., casual taxonomic decisions taken on the basis of superficial exploration of mitochondrial gene trees or distances (
Both morphological and nuclear sequence data were predominantly used in a confirmatory manner rather than as critical tests, precluding rigorous independent testing of mitochondrially defined candidate species. Overall, only 71 out of 179 delimitations (39.7%) including nuclear sequences used these as critical tests of mitochondrially-derived species hypotheses, and only 112 out of 440 delimitations (25.5%) using morphological data used them as critical tests of species hypotheses rather than as confirmatory evidence. These proportions did not differ significantly between taxa with likely contact zones with close relatives and taxa without such contact zones (Tables
Proportion of species descriptions using nuclear data as critical tests versus in a confirmatory approach. The proportions of each do not differ significantly between species with and without contact zones with close relatives (Χ2 = 1.8684, d.f. = 1, p = 0.171661; contact unknown category excluded).
| N | Nuclear data used as test | Nuclear data used as confirmation | |
| Contact | 104 | 37 (35.6%) | 67 (64.4%) |
| No contact | 72 | 33 (45.8%) | 39 (54.2%) |
| Contact unknown | 3 | 1 | 2 |
Proportion of species descriptions using morphological data as critical tests versus in a confirmatory approach. The proportions of each do not differ significantly between species with and without contact zones with close relatives (Χ2 = 2.1536, d.f. = 2, p = 0.340684; contact unknown category included).
| N | Morphology used as test | Morphology used as confirmation | |
| Contact | 194 | 43 (22.2%) | 151 (77.8%) |
| No contact | 214 | 61 (28.5%) | 153 (71.5%) |
| Contact unknown | 31 | 8 (25.8%) | 23 (74.2%) |
Where both mtDNA and single copy nuclear gene sequences were used, most studies used as many or more mitochondrial loci than nuclear loci (Fig.
Relationship between the number of mitochondrial and single-copy nuclear genes used in multilocus reptile species delimitations. Symbol size indicates frequency of the combination of mitochondrial and nuclear gene numbers. Note that the numbers of specimens sequenced for mitochondrial and nuclear genes are not considered! Many studies only sequence a subset of specimens for the nuclear genes.
The results of our survey document the extent to which mitochondrial DNA still dominates reptile systematics, and that, despite numerous conceptual and technological advances (
Startlingly, most (61.9%) species descriptions and revalidations using mtDNA still do not critically test whether mitochondrially-defined candidate species are indeed independently evolving lineages. This matters less in taxa restricted to small, isolated ranges without contact with closely related species (making ongoing gene flow very unlikely). However, the proportion of studies using confirmatory-only approaches in their analysis of scnDNA or morphological data is as high in descriptions of more widespread taxa with contact zones as in isolated taxa. This trend contradicts the broader conclusions of
Sequencing strategies in multilocus studies also betray a lack of awareness of the need for independent data: Most multilocus studies sequence as many or fewer nuclear than mitochondrial loci (Fig.
I stress that nothing in this paper is intended as criticism of any specific study: The approaches used by different author teams may have been dictated by a wide range of technical, economic, logistic or historical reasons or constraints, which may not be apparent to the reader. Instead, the focus should be on the broad pattern of the data, which document continued high levels of reliance on mitochondrial DNA and insufficient testing of its conclusions, and the consequent lack of stringency of the taxonomic conclusions. Below, I offer suggestions for improving the rigour and reliability of species delimitations in herpetology, particularly where mtDNA is used as a primary tool for hypothesis generation.
Many or most of the problems highlighted above could be prevented through the use of next-generation sequencing methods, potentially coupled with increasing standardisation of the markers used (
The need for more genuinely multilocus analyses inevitably raises questions about the cost of additional sequencing, especially in resource-poor settings. However, the data collected here offer only very partial support for this concern: In many studies, researchers sequence multiple mitochondrial genes for numerous specimens, presumably to enhance the resolution of the resulting mitochondrial gene tree, but only a small subset of specimens is sequenced for fewer nuclear genes (Fig.
Allocation of sequencing effort between mitochondrial and single copy nuclear genes in reptile species descriptions. While the use of few loci overall (blue box) may be an inevitable consequence of resource limitation, the sequencing of numerous mitochondrial gene fragments but few or no nuclear markers (orange box) constitutes a suboptimal allocation of resources for the purposes of species delimitation.
As an example, one recent study later criticised for excessive reliance on mtDNA (
Where researchers use single copy nuclear (scn) genes to test mitochondrial candidate species, the choice of loci is of key importance. Most scn loci are slower-evolving and more conserved than most mitochondrial genes. As a result, many contain few or no positions that vary at or around the species level. However, evolutionary rates vary widely between loci (
While molecular work inevitably involves significant costs, morphology remains an accessible and appropriate source of data for testing species boundaries that requires only very limited financial investment. As a proxy for overall genomic variation, detailed studies of morphology can be used to test whether mitochondrial candidate species represent discrete evolutionary lineages. Unfortunately, the analyses above show that this potential of morphological data is rarely realised. Only around one quarter of all descriptions analysed used morphology to critically test species boundaries. Often, morphological data are only used in comparative tables showing the range of values in the suggested candidate species, without further analysis. As noted earlier, approaches based on the a priori grouping of morphological data into OTUs by mitochondrial candidate species (e.g., comparative tables, ANOVA, DFA) assume what should be tested, that each candidate represents a homogeneous taxon and thus a suitable unit for comparison. Instead, methods not involving a priori grouping should be used, such as Principal Components Analysis or conceptually similar approaches. In the case of widespread taxa, dividing each candidate species into several OTUs based on collecting gaps, and then using DFA or similar to test the prediction that the OTUs of each candidate species should be homogeneous but distinct from the OTUs of others, constitutes a suitable alternative solution (e.g.,
For candidate species with broader geographic ranges, thorough coverage of the entire range of the candidate species and its closely related neighbours is essential. In particular, sampling likely contact zones between candidate species is crucial, since these are likely to contain evidence on the degree of genetic isolation between them. Inadequate sampling of contact zones is likely to lead to an overestimate of the number of species present (
The aim of any scientific publication is to provide the reader with the information required to independently assess the validity of the conclusions. In the case of species delimitation studies, a number of additional factors can affect the assessment of the validity of a paper’s conclusions. In particular, understanding the likely nature of contact zones is key to interpreting the evidence for evolutionarily independent lineages. Ideally, papers dealing with species delimitation should provide maps with all known records of the focal species and its near relatives. Maps showing just type localities are of little use in assessing the likelihood of contact between taxa. Similarly, a list of sympatric species, particularly any congeneric species, is valuable to assess the status of the species.
As in any scientific endeavour, where multiple lines of evidence are available, great strength in one can compensate for weakness in another. In species delimitation, the most complex cases are those of widespread candidate species in geographic contact with each other and low or intermediate levels of mtDNA sequence divergence: Here, a genuinely integrative approach with critical testing of mtDNA-defined species limits is essential. Equally, allopatric populations separated by low levels of mtDNA divergence require strong additional evidence to justify recognition as species. In contrast, where entirely allopatric candidate species occupying isolated ranges differ by very high levels of mtDNA divergence or are clearly non-monophyletic, a confirmatory approach to morphological data is likely to be adequate.
This study documents the continued disproportionate influence of mitochondrial DNA in reptile systematics. Many researchers still use mitochondrial DNA as the main source of evidence for species delimitation and fail to critically test whether their mitochondrial candidate species really denote independently evolving organismal lineages. The result of this excessive reliance on mtDNA and lack of rigorous testing of mitochondrial species hypotheses is that the rapid increase in the number of recognised reptile species (
I gratefully acknowledge the numerous authors who shared their research through the use of Open Access publishing, academic repositories, or by message or email. I apologise and take full responsibility for any misrepresentations in File S1, and stress again that the intention of this paper is to reveal broad patterns, not to criticise individual publications or their author teams. Harry Searle-Webb and James Hicks assisted with insightful discussions of morphometric methods and software. Comments on a previous version of the manuscript by Dirk Ahrens, Aurélien Miralles, Uwe Fritz and an anonymous reviewer led to significant improvements.
Files S1, S2
Data type: .zip
Explanation notes: File S1. Species descriptions and delimitations analysed in this paper, with bibliographic source, number of mitochondrial and single copy nuclear genes used, whether next-generation sequencing methods were used, whether nuclear genetic data and morphological data were used as critical tests of mitochondrial candidate species or in a confirmatory approach, and whether the species is likely to have a contact zone with a closely related species. — File S2. List of references of all species descriptions and delimitations analysed in this paper, from File S1.