Research Article |
|
Corresponding author: Irina V. Kulikova ( i-kulikova@mail.ru ) Academic editor: Martin Päckert
© 2025 Irina V. Kulikova, Philip Lavretsky, Kevin G. McCracken, Yury N. Zhuravlev, Irina L. Miroshnichenko, Andrew B. Correll, Jeffrey L. Peters.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Kulikova IV, Lavretsky P, McCracken KG, Zhuravlev YN, Miroshnichenko IL, Correll AB, Peters JL (2025) Genetic differentiation and population structure of “northern” wigeons (Anseriformes: Anatidae: Mareca americana, M. penelope). Vertebrate Zoology 75: 1-1. https://doi.org/10.3897/vz.75.e167908
|
Eurasian wigeon (Mareca penelope) and American wigeon (Mareca americana) are sister species with diagnosable differences mostly in male plumage. They breed in the Palearctic and Nearctic, respectively, but due to transoceanic migrations come in contact in North America, Western Europe, and north-eastern Asia, where they occasionally hybridize. To estimate genomic divergence and study their population structure we sequenced mitochondrial (mt) DNA control region and obtained 3092 autosomal and 189 Z chromosome loci from double-digest restriction associated DNA sequencing (ddRAD-seq). Consistent with previous work with few nuclear loci, we observed discordant patterns between mtDNA and nuclear DNA divergence. Deeply divergent species-specific mtDNA haplogroups contrasted with low autosomal differentiation and moderate Z-sex chromosome divergence. Meanwhile, Z-linked differentiation (ФST = 0.192) between taxa was five times higher than differentiation of autosomal loci (ФST = 0.0386), with four fixed and eight nearly fixed differences in SNPs discovered in three and six Z-linked outlier loci, respectively. No species-specific SNP variants were found among 83 autosomal outlier loci. This elevated Z-chromosome differentiation is most likely the result of selection that has been important in speciation. The lack of population genetic structure within Eurasian wigeon and American wigeon supports the common notion that migratory waterfowl have high dispersal ability that contributes to strong genetic connectivity between geographic populations.
Diagnostic single nucleotide polymorphisms, elevated Z-sex chromosome divergence, mito-nuclear discordance, population genomics, population structure, wigeon
Eurasian wigeon (Mareca penelope) and American wigeon (Mareca americana) compose the “northern” wigeons (
Both species are highly migratory with well-defined migratory pathways. American wigeon migrates along all four North American flyways: Pacific, Central, Mississippi, and Atlantic (
American and Eurasian wigeon have diagnosable differences in male plumage and deeply divergent mtDNA haplogroups (
In the previous study, the only nuclear locus with high ФST of 0.415 was the Z-sex chromosome linked intron 19 of chromo-helicase-DNA binding protein gene 1, CHD1Z (
The objective of this study was to test for genome-wide genetic differentiation between the American and Eurasian wigeon and among populations of each species. We sampled both species widely across the Palearctic and Nearctic and applied ddRAD-seq (double digest restriction associated DNA sequencing). We combined ddRAD-seq data with mtDNA control region variability to answer the following questions: (a) are American and Eurasian wigeon populations genetically structured across the Holarctic, (b) is there congruence between different types of markers: mtDNA, sex chromosomes and autosomes, (c) is the Z chromosome more divergent than the autosomes, and (d) what is the number and distribution of divergent loci between these species, especially on the Z chromosome.
A total of 113 samples of Eurasian wigeon representing five populations and 92 samples of American wigeon from three flyways were included in analysis of mtDNA variability (Fig.
Approximate breeding distributions of the “northern” wigeons (modified from
For the analysis of nuclear DNA, we obtained tissue samples from 70 Eurasian wigeons and 40 American wigeons collected across Asia from Western Siberia to Western Beringia and across North America (Fig.
The raw Illumina reads were processed using the
The aligned sequences were then genotyped using python scripts available with the DaСosta and Sorenson (2014) pipeline. Homozygotes were defined when ≥93% of the reads were identical, whereas heterozygotes were scored if a second haplotype was represented by at least 29% of sequence reads or if as few as 20% of reads were consistent with a second allele and that haplotype was represented in other individuals. Genotypes were flagged if none of these criteria were met, or more than two haplotypes met the criteria, or if they were represented by ≤5 reads; for those genotypes, we retained only the allele represented by the majority of reads and scored the second allele as missing data. We retained all loci that contained ≤10% missing genotypes and ≤5% flagged genotypes.
A representative sequence from each of the final alignments was aligned to the reference mallard genome (assembly v.ZJU1.0, accession no. GCA_015476345.1). This was done to localize each locus and separate autosomal and Z-linked loci for downstream analyses. Only uniquely aligned reads were selected for further analysis.
Analysis of population structure was done using three methods. First, we used principal coordinates analysis (PCoA) based on the Euclidian distances between individual genotypes and implemented by dudi.pco in the R software package ADEGENET v.2.1.3 (
Finally, we utilized the R package POPGENOME v.2.7.7 (
We calculated the pairwise per locus values of ФST, dXY, and π in the r package POPGENOME as described above. Z-linked and some autosomal pairwise ФST values were plotted by chromosomal position in EXCEL (i.e., Manhattan plots). To identify putative loci under selection we used 16,548 SNPs that is quite sufficient number for the task (
American wigeon had almost two-fold higher nucleotide diversity for mtDNA control region than Eurasian wigeon (Table
We recovered two expected deeply divergent species-specific mtDNA haplogroups (
Nucleotide diversity (π), absolute divergence (dXY), and relative divergence (ФST) in Eurasian wigeon, Mareca penelope (M.p.) and American wigeon, M. americana (M.a.) calculated with introgressed haplotypes included.
| Loci | M.p. π | M.a. π | dXY | ФST |
| mtDNA | 0.00278 | 0.00509 | 0.0328 | 0.880 |
| A-loci | 0.00792 | 0.00791 | 0.0082 | 0.039 |
| Z-loci | 0.00385 | 0.00386 | 0.0048 | 0.192 |
After quality filtering, we recovered 3281 ddRAD-seq loci, with a mean depth of 199.1 reads per locus per individual. Among these loci, 3092 loci (388,590 aligned base pairs; 37,182 SNPs) were assigned to autosomes and 189 loci (23,513 aligned base pairs; 1567 SNPs) were assigned to the Z chromosome. The loci were evenly distributed across chromosomes (Table S5) with the number of loci per chromosome being proportional to the chromosome size (R2 = 0.974).
Eurasian and American wigeon had similar autosomal and Z-chromosome nucleotide diversities (Table
Plotting the first two principal coordinates from the PCoA clearly separated Eurasian and American wigeons with the first coordinate axis playing the main part in species separation (Fig.
ADMIXTURE results were based on a total of 15,991 biallelic autosomal SNPs and a total of 553 biallelic Z-chromosome SNPs. We also made ADMIXTURE analysis with a single biallelic SNP randomly chosen from each locus with a total of 2664 biallelic autosomal SNPs and 153 Z-chromosome SNPs (Fig.
ADMIXTURE assignment probabilities for Eurasian wigeon, Mareca penelope (M.p.) and American wigeon, M. americana (M.a.) for (a) 3092 autosomal and (b) 189 Z-linked ddRAD-seq loci for K population values of 2 and 3 and one randomly chosen SNP from each locus. I – M.p. Siberia, II – M.p. Russian Far East, III – M.p. Western Beringia, IV – M.p. NA Pacific Flyway, V – M.p. NA Atlantic Flyway, VI – M.a. NA Pacific Flyway, VII – M.a. NA Central Flyway, VIII – M.a. NA Atlantic Flyway.
A low level of population genetic differentiation across Z-chromosome loci was observed in both Eurasian and American wigeon. Thus, relative divergence ranged from –0.007 to 0.004 between Eurasian and from –0.001 to 0.005 between American populations, thus effectively zero in both species. In contrast, genetic differentiation between species was high and varied from 0.166 to 0.208 (Table S4a). Values of relative divergence across autosomal loci were higher than across Z-linked loci in both Eurasian and American wigeon and ranged from 0.004 to 0.021 (Table S4b). Lower levels of relative divergence were found in pairwise comparisons of NA Pacific and North Asian (Western Beringia and Far East) populations of Eurasian wigeons (ФST = 0.004–0.006) and between NA Pacific and Central populations of American wigeons, whereas the most differentiated Eurasian wigeon population was the NA Atlantic (ФST = 0.018–0.021). Values of autosomal genetic differentiation between the species were much lower than those of the Z chromosome (ФST = 0.037–0.058 vs. ФST = 0.166–0.208; Table S4a,b). Pairwise ФST values based on mtDNA, Z-chromosome, and autosomal loci were strongly and significantly correlated (simple Mantel test) with each other (Fig. S4). AMOVA showed that 95.8% and 79.7% of the autosomal and Z chromosome genetic variability, respectively, were due to variability within the populations, while 3.7% and 20.2% of variability were due to interspecies differences (Table S6). Nucleotide diversity values were similar for all populations of Eurasian and American wigeon (0.0037–0.0039 for Z chromosome loci and 0.0077–0.0079 for autosomal loci; Table S3).
Comparing Eurasian and American wigeons and analyzing Z-linked and autosomal markers together as well as separately, PCadapt detected 14 Z-linked loci (7.4%) and 60 autosomal loci (1.9%) as outliers using a false discovery rate (FDR) of 0.01, and 17 Z-linked loci (9.0%) and 88 autosomal loci (2.7%) as outliers using FDR of 0.05. BAYESCAN identified six Z-linked loci (3.2%) and 58 autosomal loci (1.9%) as outliers using an FDR of 0.01 and nine Z-linked loci (4.8%) and 83 autosomal loci (2.7%) as outliers using FDR of 0.05 (Figs
Diversity estimates across mitochondrial DNA haplotypes and nuclear ddRADseq-loci for American and Eurasian wigeons (Tables
However, mtDNA variability revealed subtle population structure in Eurasian wigeon. Samples from Siberia and the NA Atlantic Flyway were differentiated from the samples collected in Western Beringia, Russian Far East, and NA Pacific Flyway (ФST = 0.14–0.39; Table S4c), which altogether with AMOVA results supported the presence of some barrier to mtDNA gene flow between Siberia and North Asia. We believe it could be explained by common wintering grounds for these groups of populations. It is well known that formation of pairs in waterfowl usually takes place during wintering (
Both mtDNA and nuclear DNA confirm that American and Eurasian wigeons are genetically differentiated from each other. Each of two deeply divergent mtDNA haplogroups (Fig.
The previous study of divergence between American and Eurasian wigeon based on sequencing mtDNA control region and 20 nuclear introns (
Eurasian and American wigeon had five times higher differentiated Z chromosomes than autosomes (ФSTZ/ФSTA = 4.97), whereas genetic diversity of Z-linked loci was almost one half of genetic diversity of autosomal loci (Table
The percent of outlier loci under selection was slightly higher among Z-linked loci (4.8%) than among autosomal loci (2.7%). The difference between Z and autosomal loci was much less pronounced in wigeons than in species of the mallard group (
Nevertheless, Z-linked outliers had elevated absolute divergence (dXY), depressed within-species nucleotide diversity (π), and high ФST values (mean ФST = 0.93) in comparison to Z non-outlier loci, whereas the absolute divergence of autosomal outlier and non-outlier loci was nearly identical. Furthermore, within-species nucleotide diversity of outlier loci was slightly depressed only for the Eurasian wigeon, and ФST values (mean ФST for autosomal outliers = 0.37) were much lower than for Z-outliers. The signal of strong divergent selection across Z chromosome is also supported by the findings of three outlier loci with species diagnostic SNPs and six outliers with SNPs close to be species-specific with just one to three individuals having the SNP variant characteristic of the sister species (Fig.
Haplotype networks for outlier SNPs on Z-chromosome (SNP positions are shown above each network, Z-loci names and SNP variants below each network) of Eurasian wigeon (dark fill), American wigeon (light fill), and American x Eurasian wigeon hybrid (white fill). Each haplotype network includes one (females) or two (males) alleles per individual.
It is noteworthy to mention that alignment of nine Z-chromosome outliers to the mallard genome reveled four of them residing within introns of protein-coding genes and two Z outliers located in introns of long non-coding RNA genes (Table S7). In particular, two of the species diagnostic SNP variants are located in the intron of Janus Kinase 2 (JAK2), a non-receptor tyrosine kinase gene, which is an essential component of signal transduction of the class II cytokine receptors associated with growth and reproduction traits in chickens (
Overall, results obtained with ddRAD-seq support previously discovered mito-nuclear discordance in divergence between American and Eurasian wigeon. Estimates of relative divergence from ddRAD-seq were similar to those calculated from sequencing of 20 nuclear introns, which demonstrates the reliability of genetic parameters recovered with intron sequencing. However, it was not possible to differentiate species using the variability of intron sequences, whereas ddRAD genotypes allowed unambiguous assignment of individuals to their species and identification of admixed individuals or hybrids due to ddRAD-seq allele frequency differences. We also recovered several loci in nuclear genome detected to be under divergent selection that contained fixed or almost fixed differences in SNPs, and all such species-specific SNPs were located on the Z chromosome. Some of those SNPs reside in introns of protein-coding genes. The Z chromosome is notable for its role in speciation in birds with more and more studies demonstrating accumulation of loci under divergent sexual selection and involved in sex differences, species recognition, plumage color including sexually selected ornaments (
The raw ddRAD sequencing data generated in this study are available from the Dryad Digital Repository at https://doi.org/10.5061/dryad.8931zcs4x.
We thank Korobitsyn Igor, National Research Tomsk State University, for providing tissue samples of Eurasian wigeon from Siberia. The research was carried out within the state assignment of Ministry of Science and Higher Education of the Russian Federation (theme No. 124012200182-1). The results were obtained using the equipment of Shared Resource Center “Far Eastern Computing Resource” IACP FEB RAS (https://cc.dvo.ru). Thumbnail image is courtesy of Eric Planet Olympus from Pexels.
Figures S1–S5
Data type: .docx
Explanation notes: Figure S1. Cross-validation errors for autosomal and Z-chromosome ADMIXTURE results for K 1–10. — Figure S2. STRUCTURE assignment probabilities for Eurasian wigeon (Mareca penelope) and American wigeon (M. americana) for 3092 autosomal and 189 Z-linked ddRAD-seq loci with number of populations K of 2–4. — Figure S3. Delta K values for autosomal and Z-chromosome STRUCTURE results for K = 1–8. — Figure S4. Comparisons of pairwise ΦST values based on autosomal ddRAD loci, Z-linked ddRAD loci and mtDNA. — Figure S5. Manhattan plots of autosomal chromosomes with significant outliers as red circles (FDR = 0.01) and yellow circles (FDR = 0.05).
Tables S1–S8
Data type: .xlsx
Explanation notes: Table S1. Sample information on “population”, location, NCBI accession number for mtDNA dataset. — Table S2. Sample information on “population” and location for ddRAD-seq dataset. — Table S3. Nucleotide diversity (dX), gene diversity (H) and Tajima's D values in populations of Eurasian Wigeon (M.p.) and American Wigeon (M.a.). Far East – Russian Far East, West Ber – Western Beringia, Pac NA – Pacific Flyway (North America), Atl NA – Atlantic Flyway (North America), Cent NA – Central Flyway (North America). — Table S4. A-loci pairwise values of nucleotide divergence (dxy, above diagobal) and gene flow (Fst, below diagonal) between populations of Eurasian wigeon (M.p.) and Americam wigeon (M.a) based on (a) 3092 ddRAD autosomal loci; (b) 189 ddRAD Z-loci; (c) 661 bp mtDNA. — Table S5. Numbers of ddRAD loci aligned to Mallard chromosomes (reference genome GCA_015476345.1). — Table S6. AMOVA analysis of Eurasian and American wigeons. — Table S7. BLASTn searches for the Z chromosome loci detected at the outlier SNP analysis. — Table S8. BLASTn searches for the autosomal loci detected at the outlier SNP analysis.