Evidence of two deeply divergent co-existing mitochondrial genomes in the Tuatara reveals an extremely complex genomic organization

Macey, J. Robert; Pabinger, Stephan; Barbieri, Charles G.; Buring, Ella S.; Gonzalez, Vanessa L.; Mulcahy, Daniel G.; DeMeo, Dustin P.; Urban, Lara; Hime, Paul M.; Prost, Stefan; Elliott, Aaron N.; Gemmell, Neil J.

doi:10.1038/s42003-020-01639-0

Download PDF

Article
Open access
Published: 29 January 2021

Evidence of two deeply divergent co-existing mitochondrial genomes in the Tuatara reveals an extremely complex genomic organization

Communications Biology volume 4, Article number: 116 (2021) Cite this article

9884 Accesses
12 Citations
125 Altmetric
Metrics details

Subjects

Abstract

Animal mitochondrial genomic polymorphism occurs as low-level mitochondrial heteroplasmy and deeply divergent co-existing molecules. The latter is rare, known only in bivalvian mollusks. Here we show two deeply divergent co-existing mt-genomes in a vertebrate through genomic sequencing of the Tuatara (Sphenodon punctatus), the sole-representative of an ancient reptilian Order. The two molecules, revealed using a combination of short-read and long-read sequencing technologies, differ by 10.4% nucleotide divergence. A single long-read covers an entire mt-molecule for both strands. Phylogenetic analyses suggest a 7–8 million-year divergence between genomes. Contrary to earlier reports, all 37 genes typical of animal mitochondria, with drastic gene rearrangements, are confirmed for both mt-genomes. Also unique to vertebrates, concerted evolution drives three near-identical putative Control Region non-coding blocks. Evidence of positive selection at sites linked to metabolically important transmembrane regions of encoded proteins suggests these two mt-genomes may confer an adaptive advantage for an unusually cold-tolerant reptile.

Complexity of avian evolution revealed by family-level genomes

Article 01 April 2024

Josefin Stiller, Shaohong Feng, … Guojie Zhang

The variation and evolution of complete human centromeres

Article Open access 03 April 2024

Glennis A. Logsdon, Allison N. Rozanski, … Evan E. Eichler

Evolution of tissue-specific expression of ancestral genes across vertebrates and insects

Article 15 April 2024

Federica Mantica, Luis P. Iñiguez, … Manuel Irimia

Introduction

Mitochondrial polymorphisms in animals exist as low-level mitochondrial heteroplasmy and as deeply divergent co-existing mitochondrial genomes. Low-level mitochondrial heteroplasmy typically arises via shallow historical mutation with the maternal passage leading to generational fixation^1,2. Alternative mechanisms are mitochondrial degradation forming heteroplasmy^2,3 that is generally not inherited and paternal leakage⁴. Instances of paternal leakage causing apparent heteroplasmy have been suggested to be mtDNA fragments recently integrated into the nuclear genome⁴. Deeply divergent co-existing mt-genomes are previously only known among bivalvian mollusks, where one mt-genome is typically maternally inherited and the other paternally inherited, a phenomenon known as doubly uniparental inheritance (DUI)⁵.

The vertebrate mt-genome is small in size (15–26 kb) with most taxa sharing a common gene order⁶. Among the more structurally complicated mt-genomes described is that of the Tuatara (Sphenodon punctatus, Fig. 1). A highly threatened reptile, the Tuatara remains on 32 small islands in New Zealand, and represents the sole-surviving taxon of a deep vertebrate lineage (Order: Rhynchocephalia), which diverged from squamate reptiles 220–250 million years ago (MYA)⁷. Previous studies reporting the first ‘complete’ mt-genome⁸ and subsequent 34 additional population-level Tuatara mt-genomes^9,10 suggests the Tuatara mt-genome is missing three genes that encode ND5, tRNA^His, and tRNA^Thr. That finding is striking because absent tRNA genes have transcriptional implications for all mt-encoded proteins¹¹.

**Fig. 1: Mitochondrial gene organization in the Tuatara.**

In order to investigate the true Tuatara mt-genome composition an array of sequencing techniques including Illumina whole-genome shotgun, Oxford Nanopore, PacBio, and PCR-based Sanger sequencing are used on individuals sampled across extremes of the Tuatara’s geographic distribution: Stephens Island (SI-1–4) and Lady Alice Island (LAI) samples, the latter sequenced for a whole-genome project¹². In doing so, two deeply divergent mt-molecules are discovered in the Tuatara (LAI), each containing the three genes previously reported as missing, albeit bounded by a series of repeated Control Region copies. The Tuatara is exceptional in being a large-bodied reptile metabolizing in cold environments, in which mitochondrial ATP synthesis is conducted^13,14. The likely time of divergence among these molecules and the selective pressures that may have led to maintain two deeply divergent, coexisting mitochondrial genomes is explored.

Results

The discovery of two mitochondrial genomes in the Tuatara

The discovery of two mt-genomes in the LAI individual is established via complementary Illumina (high coverage short-read) and Oxford Nanopore (long-read) data that identify two groups of DNA sequencing reads distinguishable by >5% sequence divergence. Illumina assembly of the LAI molecule 1 (M1) is 18,078 bases and molecule 2 (M2) 18,315 bases, each containing all 37 genes typical of animal mt-genomes. Molecular features implicate both genomes as mtDNA and not nuclear DNA. These include strong strand-bias against guanine (13.8% M1 and 14.6% M2; Supplementary Note 1), all protein-coding genes translate with no internal stop codons, tRNA genes encode tRNAs with stable secondary structures containing recognized anti-codons⁶, and no sequencing reads, whether short or long-read data, were flanked by nuclear DNA. The two mt-genomes are further confirmed via Oxford Nanopore sequencing of LAI total genomic DNA producing 9.46 Gb of sequence data in 7,229.48 K reads (Fig. 2, Table 1, and Supplementary Note 2). Filtering out mtDNA reads with assignments to M1 or M2, results in 114 mtDNA reads that map to 100% of the LAI M1 Illumina assembly reference. Thirty-two LAI M2 assigned reads map to 95.8% of the LAI M2 Illumina assembly reference. A single complete mt-genomic Oxford Nanopore read is obtained for LAI M1 from an mt-molecule naturally nicked, with both native DNA complement strands end-connected via laboratory manipulation, and sequenced in a single reaction (Supplementary Note 3 and Supplementary Fig. 1).

**Fig. 2: Oxford Nanopore coverage of Lady Alice Island mt-genomes (M1 and M2).**

Table 1 Data obtained from Oxford Nanopore runs both mined and retrieved mitochondrial DNA (see text for protocol information).

Full size table

A highly divergent second mt-genome via phylogenetics

Phylogenetic analysis of protein-coding sequences from LAI M1 and M2 with published^9,10 mt-genome sequences (lacking ND5) places LAI M2 outside all extant populations (Fig. 3). The rooting position of M2 between northern and southern populations forms two groups of M1 separated by the Cook Strait with 1.0% sequence divergence. In contrast, all M1 sequences are on average 11.1% divergent to M2, further negating the possibility of M2 being a recent nuclear-integrated copy. Monophyly of LAI genomes (M1 and M2) is statistically rejected. The well-dated ND1–COI section in amphibians and reptiles is estimated to have a pair-wise sequence divergence of 1.3% per million years^15,16,17. Application of this divergence rate to the Tuatara mt-genomes indicates a 7.8-million-year separation between M1 and M2 (10.1% sequence divergence), with a 1.2-million-year separation between northern and southern population M1 genomes (1.6% average sequence divergence), indicating a deeply divergent second mt-molecule (Supplementary Note 4).

**Fig. 3: Phylogenetic relationships of Tuatara populations in relation to Lady Alice Island molecules (M1 and M2).**

An extremely divergent genomic organization among vertebrates

The most complex and rearranged vertebrate mt-genome is discovered in the Tuatara. All 37 mt-genes (13 protein-coding, 2 rRNA, and 22 tRNA) are present with the addition of three putative Control Region non-coding blocks (NC1–3), and duplicate tRNA^Lys and tRNA^Leu(CUN) copies (Fig. 1). A newly identified segment containing ND5, tRNA^Thr, tRNA^His, NC2, and tRNA^Leu(CUN) second copy is discovered that was not reported in all 35 previously published Tuatara mt-genomes^8,9,10. In contrast to the standard vertebrate mt-gene order, the section between ND4 and tRNA^Phe contains the highly rearranged section of ND6, tRNA^Glu, NC1, tRNA^Leu(CUN) first copy, ND5, tRNA^Thr, tRNA^His, NC2, tRNA^Leu(CUN) second copy, Cytb, tRNA^Pro, tRNA^Ser(AGY), and NC3. This large-scale regional genic rearrangement with NC duplications involves protein-coding genes and numerous tRNA gene movements that cannot be explained by a simple duplication and deletion of redundant sequence model⁶. The Tuatara mt-genome architecture is also confirmed in a PacBio sequenced individual (SI-3) and our mining of a transcriptome library¹⁸ (SI-4), Sanger sequencing of PCR-amplified products from SI-2 and SI-3 with forward and reverse primers extending inside ND5 further support that gene and other gene junctions are in the mt-genome (Supplementary Note 5).

Unusual replication origins

The replication origin for the light strand (O_L), missing between tRNA^Asn and tRNA^Cys^6,19, has an O_L-like structure overlapping 14 bases with tRNA^Asn (normally none) and two bases with tRNA^Cys (normally two to four). An associated structure of the adjacent tRNA^Cys encoding a tRNA lacking a D-arm that instead contains a D-arm replacement loop^19,20 may provide an alternative O_L initiation site⁶ (Fig. 4). Concerted evolution of two Control Region sequences in which evolution of duplicated segments of DNA undergo rapid replacement during replication to either create identical or near identical copies through time²¹ is observed in several independent reptiles lineages^22,23,24. The Tuatara three non-coding blocks (NC1–3) show features consistent with Control Region copies that are nearly identical suggestive of concerted evolution (Supplementary Note 6). Phylogenetic analysis of duplicated non-coding blocks of the two LAI mt-genomes and the SI-4 mt-genome produces two equally parsimonious trees showing duplicated regions within an mt-genome clustering (Fig. 5). Monophyly of NC1, NC2 or NC3 sequences is each statistically rejected, a result consistent with concerted evolution in which identical or near identical sequences in repeated regions are propagated²¹.

**Fig. 4: Secondary structures in the region of the light strand replication origin (O_L) in the Tuatara (*Sphenodon punctatus*).**

**Fig. 5: Phylogenetic tree depicting relationships among non-coding blocks (NC1–3) using Lady Alice Island M1 and M2 with Stephans Island sample 4.**

Inheritance of two mt-genomes

The LAI individual exhibiting two mt-genomes (M1, M2) is a male, which received genome-wide sequencing efforts¹², while all other samples were female and not as extensively sequenced. Mined Illumina mtDNA reads totaling 209,650 from LAI, identify 30,005 as M2, and 176,980 as M1, resulting in 206,985 reads assignable to either M1 or M2. Thus, M2 is represented by 14.5% (1/7^th concentration) of assignable reads, relative to the dominant M1 copy. Among bivalves, males inherit and carry mtDNA from both parents, while females only carry mtDNA from the mother⁵. In males, somatic tissues are variable for the concentration of the male-type mt-genome within individuals and between species²⁵, but generally in low concentration in somatic tissues^5,26. A male inherited Tuatara copy is possible. The Tuatara M2 genome is detected only in the single male sampled and DNA was extracted from blood. No other male samples were available, and the M2 genome was not detected in any females sampled, although not subjected to genome-level sequencing.

Structural content of amino acids in two molecules

The Tuatara is a large-bodied unusually cold-tolerant reptile with a low standard metabolic rate (13 °C)^13,14. Two co-occurring mt-genomes may therefore be advantageous for metabolic flexibility in cool environments. Adaptations in transmembrane proteins have been shown to have a high bearing on environmental fitness in extreme environments²⁷. Selection analysis of codon positions with PAML (Supplementary Table 1) identifies a consistent pattern of purifying selection in all mitochondrial-encoded proteins, with an average ratio of non-synonymous to synonymous changes of ω = 0.096 ± 0.016 (Supplementary Table 2). However, PAML and the fixed effects approach in FEL detect a series of amino acids under putative positive selection in LAI M2 and show that the majority of them lie in transmembrane regions. A concentration of 43 out of 55 detected amino acid sites under putative positive selection (78%) reside in transmembrane regions of encoded ND1, ND2, ND4, COII, COIII, and Cytb (FDR ≤ 0.05; Fig. 6; Supplementary Table 3), contrasting typical patterns of mammalian evolution with strong enrichment of adaptive variation in loop regions²⁸. As amino acid changes in transmembrane regions have a stronger impact on protein function than in loop regions²⁹, the selective enrichment of transmembrane amino acid changes suggests divergence of molecules in relation to the species’ life history. Further work is needed to understand the biological implications of two deeply divergent Tuatara mt-genomes.

**Fig. 6: Positive selection of amino acids in protein structure.**

Discussion

The discovery of two deeply-divergent mt-genomes in the Tuatara has profound implications for our understanding of animal mitochondrial genome organization, inheritance, and evolution. The Tuatara is the sole surviving member of an ancient vertebrate lineage (Rhynchocephalia) providing a long period of isolated evolution (220–250 million years)⁷. Whether duplication of Tuatara mt-genomes arose via uni- or biparental mechanisms, and how they have been maintained for a 7–8 million-year period through a potential adaptive advantage is a dynamic new area for research.

DUI is the only well-studied mode of co-existing, deeply divergent mt-molecules found in single individuals, and thus far only confirmed in bivalvian mollusks⁵. In mollusks, the maternal copy is present in somatic tissues, whereas the paternal copy is found primarily in germ-line cells^5,26. Our discovery of two deeply divergent (10.4% sequence divergence) mt-genomes in the Tuatara is intriguing, but the origin, inheritance mechanism, and maintenance of the two molecules remains unknown. Thus, for now, our results do not conclusively support, nor refute a DUI hypothesis.

The two divergent mt-genomes were identified in a single male (the second mt-molecule at 1/7^th the concentration of the first) and their discovery was only achievable in combination with genome-wide sequencing¹² of DNA from blood. Additional samples were screened for a second mt-genome via long-read PacBio sequencing and molecule-specific PCR primers, yet were unsuccessful. These additional individuals were female, high molecular weight DNA extracted from liver was used, and these samples were not subjected to the same exhaustive genome-scale sequencing as our male sample. Additionally, the published transcriptome library¹⁸ was consolidated from multiple individual embryos sequenced at modest depth, and no signal of a second mt-genome was herein detected. The lack of detection in female samples could be attributable to simple unsuccessful attempts of PCR-primer specificity/preferential annealing and/or non-exhaustive genomic sequencing efforts.

Future studies seeking to confirm the widespread presence of two mt-molecules in the Tuatara should examine multiple individuals, of both sexes, across the geographic range of the Tuatara. Ideally, studies would be conducted in pedigrees to explore the inheritance of these molecules, and across multiple tissues to explore any tissue-specific patterns. Such a study is not trivial for a species that is protected under New Zealand and international law, and that is a taonga, or special treasure for those Maori iwi that are the kaitiaki, guardians, of the Tuatara. Our genome sequencing work was undertaken in partnership with Ngatiwai iwi, but any future studies in other locations would require further partnerships with Ngatiwai and another iwi across Aotearoa New Zealand to gain permissions to work on, sample, and sequence these taonga.

Tuatara mt-genomes show extreme rearrangement relative to other vertebrate mt-genomes. Tuatara mt-gene rearrangements include tRNA genes often found in mt-genomic rearrangements, but in the Tuatara with drastic distant locations relative to the standard vertebrate gene order⁶. Unlike most rearrangements among vertebrates, protein-coding and tRNA genes are switched and shuffled among duplicated origins of replication for the heavy strand (putative Control Regions, O_H). A third of the mt-genome is significantly rearranged, two protein-coding genes are switched (ND5 and ND6), tRNA genes are drastically reshuffled, and putative origins of replication are duplicated and inserted in this region (Fig. 1). Putative Control Regions are observed in triplicate going through concerted evolution, and an unusual stem-and-loop structure is discovered with overlapping sequences of tRNA^Asn and tRNA^Cys in the location of light-strand replication (O_L). These observations in totality reinforce the idea of replication errors driving mt-genomic rearrangement⁶.

Tandem complementary efforts of high-throughput short-read sequencing with newly advanced long-read sequencing identify two deeply divergent mt-genomes escaping science for decades. The genomic structure is elucidated via these techniques with a single Oxford Nanopore read covering both DNA strands, signaling a new era of organellar genomics with whole molecule sequencing.

Methods

Sampling

The LAI sample is a male and NCBI Biosample SAMN08793959 with the four Stephens Island samples all being female: (SI-1) SAMN10598677, (SI-2) SAMN10598679, (SI-3) SAMN10598680, and (SI-4) SAMN00855319 with published transcriptome data in SRA051647¹⁸. Blood of LAI was collected with ethical permissions from Victoria University of Wellington and under permits supplied by the Department of Conservation, New Zealand.

Sanger sequencing of Stephens Island samples (SI-1–3)

Shotgun Sanger sequencing of SI-1 PCR fragments from COIII to 12 S rRNA follows methods previously described³⁰, with amplifications conducted using L9940 5’-GCAGCATGATACTGACACTTYGT-3’ and H1067⁶. A MegaBACE 1000 (Amersham) DNA sequencer ran 768-reads that were assembled using Phrap. Sanger sequencing of SI-2 and SI-3 used two forward REX26_ND5F1 5’-GTGCACTAACACAAAACGATATC-3’ and REX27_ND5F1 5’-GCGCACTGACACAAAATGATATT-3’, and two reverse primers REX26_ND5R1 5’-GGATTCCTCCTATTTTTCGAATG-3’ and REX27_ND5R1 5’- GGATTCCTCCTATTTTTCAGATA-3’ designed in ND5 from SI-1 sequences. Amplifications applied forward ND5 primers with H1067⁶ and reverse ND5 primers with L9940. End-sequencing was done on all PCR fragments with internal reactions using “ND4” ³¹ in ND4 and “IguaCytBR2”³² in Cytb. Reactions were run on an ABI3730 Sequencer (2011 Life Technologies) with 900 chemistry (Supplementary Method 1).

Illumina data collection of the LAI sample

Total genomic DNA was extracted using proteinase K digestion and Phenol-Chloroform extraction from blood. Sequencing was undertaken using the Illumina HiSeq 2000 and 2500 as well as MiSeq sequencing platforms (Illumina, San Diego). Sequencing libraries consisted of paired-end (PE) libraries with estimated insert sizes of 180 bp, 350 bp, and 550 bp and three mate-paired (MP) libraries with estimated insert sizes of 2500 bp, 5000 bp, and 8000 bp. The PE libraries were prepared using the Illumina TruSeq PCR-Free DNA library kit, while the mate-pair libraries were prepared using the Illumina TruSeq DNA library kit as per the manufacturer’s instructions. These libraries were normalized and pooled across 32 lanes on an Illumina HiSeq 2000 or 2500 using 2 × 100 bp PE sequencing at New Zealand Genomics Ltd., Dunedin. We further supplemented these data with additional Illumina TruSeq and Kappa DNA libraries, with insert sizes of 400 bp and 480 bp, respectively. These libraries were normalized and pooled across five Illumina MiSeq 2 × 250 bp runs via New Zealand Genomics Ltd., Dunedin (Supplementary Method 2).

Illumina assembly of LAI

Illumina reads were mined from total genomic shotgun data for mtDNA reads using the first previously published mt-genome⁸ (AF53439) and initial contigs from two PCR fragments obtained from Sanger sequencing of SI-1 with ND5, tRNA^His, and tRNA^Thr herein reported. Illumina HiSeq reads were extracted from eleven PE 100 bp data-sets with an insert size of 180 bp (totaling ~1.9 billion read-pairs) and two additional data-sets with an insert size of 2500 bp (totaling ~47 million read-pairs), and one data-set of 5000 bp (~128 million read-pairs), respectively, using Bowtie 2³³. Extracted reads were cleaned and only properly paired read-pairs were kept for further analysis. In total, 156,012 reads were obtained that could be used for the assembly of mtDNA. First, the insert size for all mapped reads was calculated and only reads where the determined insert size ranged between 100–200 bp (for the 180 bp library) and 2000–3000 bp (for the 2500 bp library) were used to perform the assembly. The initial assembly was created using MaSuRCA assembler³⁴. Next, all contigs mapping to the two PCR fragments obtained from Sanger sequencing of SI-1 reported here were used to fine-tune the Illumina assembly using Minimus³⁵. This draft assembly of Illumina HiSeq reads had an average coverage of 739.2 reads (std dev = 200.7; min = 14, max = 1625). In order to evaluate the Minimus assembly, 71mers were counted with one PE 180 bp data-set using Jellyfish³⁶ that allowed the identification of a repeat region missed by the initial assembly. After including the repeat into the mtDNA sequence, we obtained a final sequence containing 18,078 bp. The final assembly had an average coverage of 723.2 reads (std dev = 178.3; min = 12, max = 1502). Further examination of Illumina data suggested there may be a second copy of the mt-genome. Initially, all reads mentioned above were de novo assembled in Geneious v10.2.4 (Biomatters, Auckland) with the highest sensitivity producing 937 contigs. Recovered contigs were backmapped to the initial Illumina LAI M1 assembly. Three of the 937 contigs nearly covered the entire molecule (M1) with ~98% identity. Twenty-seven additional contigs that mapped were within 90% identity to the initial LAI M1 mt-genome. These contigs were manually assembled to construct a nearly complete second mt-genome (M2). Therefore, all available Illumina reads were re-mined for mtDNA from Illumina HiSeq PE 100 bp reads from (a) eleven data-sets with an insert size of 180 bp (totaling ~1.9 billion read-pairs), (b) eight data-sets with an insert size of 350 bp (totaling ~446 million read-pairs), (c) eight data-sets with an insert size of 550 bp (totaling ~316 million read-pairs), (d) one data-set with an insert size of 2500 bp (~47 million read-pairs), (e) three data-sets with an insert size of 5000 bp (totaling ~307 million read-pairs), (f) four data-sets with an insert size of 8000 bp (totaling ~490 million read-pairs); and Illumina MiSeq PE 250 bp reads from (g) four data-sets with an insert size of 400 bp (totaling ~50 million read-pairs), and (h) one data-set with an insert size of 480 bp (~17 million read-pairs). This produced 104,825 paired-read sets with a total of 209,650 reads matching mtDNA. The 209,650 reads recovered were de novo assembled in Geneious with the highest sensitivity level, using all paired-read information that produced 856 contigs. Consensus sequences from these contigs were then backmapped to the 18,078 bp draft LAI M1 mt-genome obtained from the above assembly and to the draft M2 in Geneious. Contigs were screened for a minimum of 5% sequence divergence to either the 18,078 bp draft LAI M1 or the draft M2. This produced 19 contigs assigned to M2 for further evaluation. The 19 contigs containing raw reads were separately assembled producing 5 contigs that were manually assembled following manual trimming. These final contigs were backmapped to the draft M2, forming a contig containing 30,005 reads with an average coverage of 169.2 reads (std dev = 102.7; min = 3, max = 508). Ambiguities in M2 were resolved and a final second mt-genome was identified and annotated. Of the 856 contigs discovered above not assigned to M2, nine contigs were deemed unusable. The remaining 828 contigs were backmapped to LAI M1 producing a contig with 176,980 reads having an average coverage of 1007.5 reads (std dev = 213.8; min = 25, max = 1791).

DNA preparation for Oxford Nanopore sequencing of LAI

Oxford Nanopore work relied on the sample being subsequently shipped between the University of Otago, New Zealand and San Diego Zoo, California, USA using CITES institutional transfers, together with supporting permits to export and import from the New Zealand Department of Conservation and US Fish and Wildlife Service, respectively. A partial sample was shipped to Dovetail (Santa Cruz, CA) for work on the Tuatara genome. Dovetail extracted DNA from ~100 µl of snap-frozen blood using the Qiagen Blood & Cell Culture DNA Midi Kit (Hilden, Germany), yielding 242 ng/µl provided in TE buffer (runs 1–6). A partial blood sample was shipped to SeqMatic LLC (Fremont, CA) for work on the Tuatara genome. SeqMatic (Fremont, CA) extracted DNA from ~50 µl of snap-frozen blood using an enzymatic DNA extraction with (run 7) and without phenol-chloroform (run 8), yielding 131 ng/µl and 79 ng/µl in TE buffer. Following extractions, DNA was stored and transferred to the Peralta Genomics Laboratory (Alameda, CA) at 4°C and never frozen (Supplementary Method 3).

Oxford Nanopore sequencing of LAI

Eight Oxford Nanopore runs using R9.4 chemistry were conducted applying Minion protocol version GDE_9002_v108_revT_18Oct2016 using a nick-repair enzyme (NEBNext FFPE Repair Mix). To increase library yield, NEBNext Ultra II End Repair/dA-Tailing Module was used to eliminate an additional Solid Phase Reversible Immobilization (SPRI)³⁷ cleanup step with Agencourt AMPure XP beads. To increase library yield, 2 µl of concentrated NEB T4 DNA ligase was added during adaptor ligation. The general protocol consists of four steps. Step 1, DNA Nick-Repair, Blunt-Ending, and End Repair: Add in a clean low bind PCR tube, (a) ~ 2 µg genomic Tuatara DNA as described above of 8.27 µl, (b) NEBNext FFPE Repair Mix of 3 µl, (c) NEBNext Ultra II End Repair/dA-Tailing Mix of 3 µl, (d) NEBNext Ultra II End Repair/dA-Tailing Buffer of 7 µl, (e) 100X NAD + of 0.6 µl, and (f) 10 mM Tris HCl pH 8.5 buffer of 38.13 µl for a total volume of 60 µl. This is mixed gently via inversion and incubated at 20 °C for 60 min and 65 °C for 30 min in a thermocycler. Step 2, SPRI Cleanup: add to the above (a) Agencourt AMPure XP beads of 60 µl and let stand for 2 min followed by discarding supernatant while on the magnet, (b) wash with 70% EtOH of 140 µl X 2, and (c) elute in Nuclease-free water of 31 µl. Step 3, Adaptor Ligation: add to (a) eluted end-prepped DNA above of 30 µl, (b) 1D Adapter Mix of 20 µl, and (c) NEB Instant Sticky End Ligase of 50 µl, for a total volume of 100 µl. This is mixed gently via inversion and incubated at 20 °C for 10 min in a thermocycler. Step 4, Library Purification, with additional SPRI Cleanup: add to the above (a) Agencourt AMPure XP beads of 40 µl and let stand for 2 min followed by discarding supernatant while on the magnet, (b) wash with Adaptor Bead Binding Buffer of 140 µl X 2, and (c) elute in Oxford Nanopore Elution Buffer of 25 µl for DNA sequencing. Deviations to the above protocol are described. Deviations I, DNA Integrity and Sizing: DNA was kept in high integrity for runs 3, 5, 6, 7, and 8 but DNA shearing was conducted in two ways for runs 1, 2, and 4. Run number 4 sheared DNA to a target of 5 kb from ~ 2 µg of Tuatara DNA using a Covaris LE220R with (a) Peak Incident Power (W) 100, (b) Duty Factor 20%, (c) Cycles per Burst 1000, and (d) Treatment Time (s) 600. Runs 1 and 2 sheared DNA to a target of 10 kb from ~ 2 µg of Tuatara DNA using a Covaris g-tube. It is noteworthy that while subjecting DNA to shearing a direct following step implementing nick-repair was applied as described above. Final products from shearing procedures required volume adjustments between input DNA (from shearing in buffer) and 10 mM Tris HCl pH 8.5 buffer in the starting protocol of step 1 above to keep relative concentrations equivalent. Deviations II, Enzymatic Cleanup with Additional SPRI Cleanup: For runs 2, 5, 6, 7, and 8, replacements steps from steps above apply an activated immobilized trypsin resin (ThermoFisher Scientific). Step 2.1, to activate trypsin resin, the resin was gently mixed adding 20 µl of resin to 100 µl of 10 mM Tris HCl pH 8.5 buffer in a 1.5 ml tube. This was inverted for mixing followed by a 6000 rpm spin, with the supernatant discarded; this was repeated two additional times, with 20 µl of 10 mM Tris HCl pH 8.5 buffer added to the end product. Step 2.2, the (a) end-repaired DNA from step 1 of 60 µl, and (b) activated trypsin resin of 20 µl from step 2.1 were (c) incubated at 37 °C for 30 min. Only 60 µl of the total volume of 80 µl was retained leaving 20 µl of pelleted resin behind. Step 3, add to (a) eluted end-prepped DNA above of 60 µl, (b) 1D Adapter Mix of 20 µl, and (c) NEB Instant Sticky End Ligase of 80 µl, for a total volume of 160 µl. This is mixed gently via inversion and incubated at 20 °C for 10 min in a thermocycler. Step 4, add to the above (a) Agencourt AMPure XP beads of 64 µl and let stand for 2 min followed by discarding supernatant while on the magnet, (b) wash with Adaptor Bead Binding Buffer of 140 µl X 2, and (c) elute in Oxford Nanopore Elution Buffer of 25 µl for DNA sequencing. Deviations III, 2D Ligation and Library Preparation: For run 3, 2D chemistry requires replacement steps. Step 3, add to (a) eluted end-prepped DNA above of 30 µl, (b) 2D Adapter Mix of 10 µl, (c) dH2O of 5 µl, (d) HP Adapter of 2 µl with (d) NEB Instant Sticky End Ligase of 50 µl and (e) incubated for 10 min at room temp; followed with (f) HP tether of 1 µl and (g) 2 µl of concentrated NEB T4 DNA ligase added with (h) additional incubation at room temp for 10 min. The 2 µl of concentrated NEB T4 DNA ligase was added during adaptor ligation with the HP Tether to increase library yield. The total yield is 100 µl for this step. Step 4.1, MyOne C1 Streptavidin bead preparation. Add in a 1.5 ml Eppendorf DNA LoBind tube (a) MyOne C1 Streptavidin beads (Invitrogen) of 50 µl and (b) pellet beads on the magnet for 2 min; (c) discard supernatant and (d) add Oxford Nanopore Bead Binding Buffer of 140 µl; (e) vortex until homogeneous and (f) pellet on a magnet for 2 min; (g) discard supernatant and (h) repeat Oxford Nanopore Bead Binding Buffer wash step of 140 µl with pelleting on a magnet for 2 min X 2; (i) add Bead Binding Buffer of 100 µl and (j) label tube as “Washed Beads” for binding step. Step 4.2, for binding add (a) Washed Beads from previous step 4.1 of 100 µl to the tube containing the Ligated DNA in step 3 of 100 µl and (b) incubate at room temperature for 5 min; for elution add (c) Oxford Nanopore elution buffer to DNA-bound beads of 25 µl, (d) incubate tube on the hot block at 37 °C for 10 min, (e) pellet beads on the magnet for 2 min, and (f) transfer supernatant containing library into a clean 1.5 ml Eppendorf DNA LoBind tube for DNA sequencing. Libraries were run on SQK-108, SQK-208, and SQK-LSK-109 flow-cells.

Oxford Nanopore data and evaluation of LAI

Oxford Nanopore 2D reads were extracted using Nanopolish³⁸. By default, Nanopolish extracts reads for export either as a 2D consensus read or a 1D template read; hence there is only a single read per double-stranded (2D) or single-stranded (1D) read. Guppy (Oxford Nanopore) was only used to process 1D reads. All Oxford Nanopore reads obtained from genomic data were compiled into a single database and the Tuatara mt-genomic data were blasted against this database using the default parameters for Blastn and Megablast. This was done with both LAI M1 and M2 Illumina data assemblies reported here in separate searches, as the Rest et al⁸. mt-genome was discovered to lack a segment of the mt-genome. Following this 1.5 kb, 3 kb, and 6 kb random seeds were applied from the LAI M1 and M2 Illumina data assemblies reported here in separate searches to further maximize mitochondrial Oxford Nanopore read recovery. Reads less than 500 bp in length were no longer considered. To evaluate if recovered mt-genomic Oxford Nanopore reads were M1 or M2, matches were made to both the LAI Illumina assembly of M1 or the LAI Illumina assembly of M2, separately. This was done in three separate iterations per molecule using NCBI (a) Blastn, (b) Megablast, and (c) Discontiguous Megablast. A cut-off of 5% difference between molecules was adopted because the two Illumina draft molecules are ~10% sequence divergence and Oxford Nanopore reads have a bias of including numerous gaps representing unread bases. Oxford Nanopore sequences assigned to M1 or M2 were separately backmapped to their respective Illumina draft mt-genome molecules using Geneious with default parameters to evaluate depth coverage.

PacBio sequencing and assembly of Stephen Island sample 3 (SI-3)

Genomic DNA was extracted using the Qiagen Genomic Tip DNA extraction kit from ~20 mg of liver tissue. A PacBio SMRTbell library was prepared using the SMRTbell Express Template Preparation Kit (Pacific Biosciences). A 12 kb and above size selection with the Sage Science BluePippin system was implemented. Prepared libraries were run on the PacBio Sequel platform using version 2.1 chemistry. Two Single-Molecule Real-Time (SMRT) Cells were sequenced, where each library was sequenced on one SMRT cell with 360 min movie lengths. PacBio raw reads were assembled in CANU version 1.8 (Supplementary Method 4).

Transcriptome library mining

The published Tuatara transcriptome library (SI-4)¹⁸ was downloaded. Initial work discovered contaminated Rattus leucopus DNA sequences. The mt-genome of R. leucopus³⁹ was used to purge Rattus mt-DNA from the Tuatara transcriptome library by using the ‘Map to Reference’ function in Geneious and saving a list of unused reads. The Rattus-free mt-DNA reads were mapped to LAI M1 producing a full-coverage of the mt-genome.

Mt-genome alignment for sequence divergence

Complete mt-genome alignments for pair-wise sequence divergence were conducted in Geneious using MUSCLE version 3.8.425, with adjustments to comply with secondary structures of encoding tRNA genes⁴⁰. The end of the third non-coding block was not alignable and excluded (last 404 bp of NC3 in LAI M2, positions 17912–18315; last 230 bp of NC3 in LAI M1, positions 17849–18078; last 230 bp of NC3 in SI-3, positions 17849–18078; last 230 bp of NC3 in SI-4, positions 17842–18071).

Phylogenetic analysis of duplicate mt-genomes and duplicate non-coding blocks

Phylogenetic sampling used LAI M1 and M2 with samples from eight northern islands (Plate, Cuvier, Green Mercury, Middle Mercury, Stanley, Red Mercury, Hen, and Poor Knight) and two southern islands (Stephens and North Brother); GenBank numbers KP996609–19, 40–41^9,10 (Supplementary Method 5). The LAI M2 is 7 encoded amino acid positions longer in Cytb than sequences deemed molecule 1. Gaps were placed in all molecule 1 Cytb sequences: (a) after encoded amino acid position 3 (nucleotide position 9) 5 encoded amino acid positions were entered as gaps (15 nucleotide positions), and (b) after encoded amino acid position 291 (nucleotide position 873), and 2 encoded amino acid positions were entered as gaps (6 nucleotide positions). ND5 was not included because it was not reported in all previous studies^8,9,10. Overlapping regions of bicistronic-encoded genes were excluded: Atp8 and Atp6 (last 96 nucleotides of Atp8), and ND4L and ND4 (last 7 nucleotides of ND4L). To evaluate duplicated non-coding block sequences (NC1–3) from both LAI mt-genomes, they were compared to sequences from SI-4 of the published transcriptome library (Supplementary Method 6). The SI-3 sample was not included because of ambiguities in sequence. Sequences were aligned in Geneious using MUSCLE version 3.8.425. Only areas in common between NC1–3 sequences were evaluated. Phylogenetic estimates were conducted using branch-and-bound searches in PAUP* version 4.0⁴¹ with bootstraps generated via 1000 replicates. Decay indices were generated with successive searches retaining suboptimal trees, and computing tree length difference to the overall shortest tree(s). Two-tailed Wilcoxon signed-ranks tests^42,43 incorporating a correction for tied ranks were used to examine the statistical significance of alternative phylogenetic hypotheses with the shortest estimate(s). Alternative phylogenetic estimates were discovered via searches applying constraint trees that saved suboptimal trees compatible with an alternative phylogenetic hypothesis, using MacClade version 4.03⁴⁴ for constraint tree construction and search implementation in PAUP* version 4.0⁴¹.

Protein evaluation

Aligned protein-coding regions of LAI M1 & M2, SI-3, SI-4, and GenBank KP996609–19, 40–41^9,10 were subjected to PAML-CodeML⁴⁵ (Supplementary Table 1) using ETE3⁴⁶ and to FEL⁴⁷ using HyPhy⁴⁸ to detect selective pressures on individual sites of the LAI M2 lineage. For ND5, the only available LAI M1 & M2, SI-3, and SI-4 sequences were used. Within PAML, we used Bayes Empirical Bayes (BEB)⁴⁹ with a threshold of P > 95% as evidence for sites under putative positive selection. Within FEL, the resulting p values were adjusted for multiple testing using FDR (false discovery rate⁵⁰) across sites. For protein structure visualization, ConSurf Server^51,52 uses HMMER to search for homologs in the UNIREF-90 database (E-value cut-off of 0.0001), and 150 sampled protein sequences (min/max identity of 35%/95%) are aligned using MAFFT-L-INS-i. HHPred and MODELER then apply the best evolutionary substitution model (JTT) to predict the 3D structure of proteins⁵². The structures are visualized using PyMOL⁵³ (see Supplementary Method 7 and Supplementary Tables 1–3).

Statistics and reproducibility

Phylogenetic statistical analyses are described in Methods. Mt-genome coverage statistics (e.g., average, min/max, depth) were calculated in Geneious. Protein evaluation statistical analyses used Python (http://www.python.org), applying multiple tests library for multiple testing correction. ETE3⁴⁶ was used to perform PAML-CodeML⁴⁵ selection analyses, and HyPhy⁴⁸ to perform FEL⁴⁷ selection analyses. Additional details of all experiments are presented in Supplementary Information.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

New DNA sequences generated in this study are deposited as GenBank MN864228– MN864230 and Sequence Read Archive (SRA) PRJNA445603. Published transcriptome sequences¹⁸ used in this study are from SRA SRA051647 and the complete mt-genome from that transcriptome is deposited as a Third Party Annotation in the DDBJ/ENA/GenBank databases as TPA: BK012001.

References

Breton, S. & Stewart, D. T. Atypical mitochondrial inheritance patterns in eukaryotes. Genome 58, 423–431 (2015).
Article CAS PubMed Google Scholar
Hedberg, A. et al. Cancer-specific SNPs originate from low-level heteroplasmic variants in human mitochondrial genomes of a matched cell line pair. Mitochondrial DNA 30, 82–91 (2019).
Article CAS PubMed Google Scholar
Bratic, A. & Larsson, N.-G. The role of mitochondria in aging. J. Clin. Invest. 123, 951–957 (2013).
Article CAS PubMed PubMed Central Google Scholar
Wei, W. et al. Nuclear-mitochondrial DNA segments resemble paternally inherited mitochondrial DNA in humans. Nat. Commun. 11, 1740 (2020).
Article CAS PubMed PubMed Central Google Scholar
Zouros, E. Biparental inheritance through uniparental Transmission: the doubly uniparental inheritance (DUI) of mitochondrial DNA. Evol. Biol. 40, 1–31 (2013).
Article Google Scholar
Macey, J. R., Larson, A., Ananjeva, N. B., Fang, Z. & Papenfuss, T. J. Two novel gene orders and the role of light-strand replication in rearrangement of the vertebrate mitochondrial genome. Mol. Biol. Evol. 14, 91–104 (1997).
Article CAS PubMed Google Scholar
Daugherty, C. H., Patterson, G. B. & Hitchmough, R. A. Taxonomic and conservation review of the New Zealand herpetofauna. N.Z. J. Zool. 21, 317–323 (1994).
Article Google Scholar
Rest, J. S. et al. Molecular systematics of primary reptilian lineages and the Tuatara mitochondrial genome. Mol. Phylogenet. Evol. 29, 289–297 (2003).
Article CAS PubMed Google Scholar
Mohandesan, E., Subramanian, S., Millar, C. D. & Lambert, D. M. Complete mitochondrial genomes of Tuatara endemic to different islands of New Zealand. Mitochondrial DNA 26, 25–26 (2015). online 2013.
Article CAS PubMed Google Scholar
Subramanian, S., Mohandesan, E., Millar, C. D. & Lambert, D. M. Distance-dependent patterns of molecular divergences in Tuatara mitogenomes. Sci. Rep. 5, 8703 (2015).
Article CAS PubMed PubMed Central Google Scholar
Pett, W. & Lavrov, D. V. Cytonuclear interactions in the evolution of animal mitochondrial tRNA metabolism. Genome Biol. Evol. 7, 2089–2101 (2015).
Article CAS PubMed PubMed Central Google Scholar
Gemmell et al. The tuatara genome reveals ancient features of amniote evolution. Nature 584, 403–409 (2020).
Article CAS PubMed PubMed Central Google Scholar
Thompson, M. B. & Daugherty, C. H. Metabolism of Tuatara, Sphenodon punctatus. Comp. Biochem. Physiol. 119A, 519–522 (1998).
Article CAS Google Scholar
Jarvie, J., Jowett, T., Thompson, M. B., Seddon, P. J. & Cree, A. Effects of warm temperatures on metabolic rate and evaporative water loss in Tuatara, a cool-climate Rhynchocephalian survivor. Physiol. Biochem. Zool. 91, 950–966 (2018).
Article PubMed Google Scholar
Macey, J. R. et al. Phylogenetic relationships among Agamid lizards of the Laudakia caucasia species group: testing hypotheses of biogeographic fragmentation and an area cladogram for the Iranian Plateau. Mol. Phylogenet. Evol. 10, 118–131 (1998).
Article CAS PubMed Google Scholar
Macey, J. R. et al. A molecular phylogenetic hypothesis for the Asian agamid lizard genus Phrynocephalus reveals discrete biogeographic clades implicated by plate tectonics. Zootaxa 4467, 1–81 (2018).
Article PubMed Google Scholar
Weisrock, D. W., Macey, J. R., Ugurtas, I. H., Larson, A. & Papenfuss, T. J. Molecular phylogenetics and historical biogeography among salamandrids of the "true" salamander clade: rapid branching of numerous highly divergent lineages in Mertensiella luschani associated with the rise of Anatolia. Mol. Phylogenet. Evol. 18, 434–448 (2001).
Article CAS PubMed Google Scholar
Miller, H. C., Biggs, P. J., Voelckel, C. & Nelson, N. J. De novo sequence assembly and characterisation of a partial transcriptome for an evolutionarily distinct reptile, the tuatara (Sphenodon punctatus). BMC. Genom. 13, 439 (2012).
Article CAS Google Scholar
Seutin, G., Lang, B. E., Mindell, D. I. & Morais, R. Evolution of the WANCY region in amniote mitochondrial DNA. Mol. Biol. Evol. 11, 329–340 (1994).
CAS PubMed Google Scholar
Macey, J. R., Larson, A., Ananjeva, N. B. & Papenfuss, T. J. Replication slippage may cause parallel evolution in the secondary structures of mitochondrial transfer RNAs. Mol. Biol. Evol. 14, 30–39 (1997).
Article CAS PubMed Google Scholar
Arnheim, N. et al. Molecular evidence for genetic exchanges among ribosomal genes on nonhomologous chromosomes in man and apes. Proc. Natl Acad. Sci. USA 77, 7323–7327 (1980).
Article CAS PubMed PubMed Central Google Scholar
Kumazawa, Y., Ota, H., Nishida, M. & Ozawa, T. Gene rearrangements in snake mitochondrial genomes: highly concerted evolution of control-region-like sequences duplicated and inserted into a tRNA gene cluster. Mol. Biol. Evol. 13, 1242–1254 (1996).
Article CAS PubMed Google Scholar
Castoe, T. A., Jiang, Z. J., Gu, W., Wang, Z. O. & Pollock, D. D. Adaptive evolution and functional redesign of core metabolic proteins in snakes. PLoS One 3(5), e2201 (2008).
Article PubMed PubMed Central Google Scholar
Ujvari, B., Dowton, M. & Madsen, T. Mitochondrial DNA recombination in a free-ranging Australian lizard. Biol. Lett. 3, 189–192 (2007).
Article CAS PubMed PubMed Central Google Scholar
Passamonti, M. An unusual case of gender-associated mitochondrial DNA heteroplasmy: the mytilid Musculista senhousia (Mollusca Bivalvia). BMC Evol. Biol. 7(Suppl. 2), S7 (2007).
Article PubMed PubMed Central Google Scholar
Ghiselli, F., Milani, L. & Passamonti, M. Strict sex-specific mtDNA segregation in the germ line of the DUI species Venerupis philippinarum (Bivalvia: Veneridae). Mol. Biol. Evol. 28, 949–961 (2011).
Article CAS PubMed Google Scholar
Morin, P. A. et al. Demography or selection on linked cultural traits or genes? Investigating the driver of low mtDNA diversity in the sperm whale using complementary mitochondrial and nuclear genome analyses. Mol. Ecol. 27, 2604–2619 (2018).
Article PubMed Google Scholar
da Fonseca, R. R., Johnson, W. E., O’Brien, S. J., Ramos, M. J. & Antunes, A. The adaptive evolution of the mammalian mitochondrial genome. BMC Genomics 9, 119 (2008).
Article PubMed PubMed Central Google Scholar
Saier, M. H. Computer-aided analyses of transport protein sequences: gleaning evidence concerning function, structure, biogenesis, and evolution. Microbiol. Rev. 58, 71–93 (1994).
Article CAS PubMed PubMed Central Google Scholar
Macey, J. R., Papenfuss, T. J., Kuehl, J. V., Fourcade, H. M. & Boore, J. L. Phylogenetic relationships among amphisbaenian reptiles based on complete mitochondrial genomic sequences. Mol. Phylogenet. Evol. 33, 22–31 (2004).
Article CAS PubMed Google Scholar
Arevalo, E., Davis, S. K. & Sites, J. W. Mitochondrial DNA sequence divergence and phylogenetic relationships among eight chromosome races of the Sceloporus grammicus complex (Phrynosomatidae) in central Mexico. Syst. Biol. 43, 387–418 (1994).
Article Google Scholar
Corl, A., Davis, A. R., Kuchta, S. R., Comendant, T. & Sinervo, B. Alternative mating strategies and the evolution of sexual size dimorphism in the side-blotched lizard, Uta stansburiana: a population-level comparative analysis. Evolution 64, 79–96 (2010).
Article PubMed Google Scholar
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012).
Article CAS PubMed PubMed Central Google Scholar
Zimin, A. V. et al. The MaSuRCA genome assembler. Bioinformatics 29, 2669–2677 (2013).
Article CAS PubMed PubMed Central Google Scholar
Sommer, D. D., Delcher, A. L., Salzberg, S. L. & Pop, M. Minimus: a fast, lightweight genome assembler. BMC Bioinformatics 8, 64 (2007).
Article PubMed PubMed Central Google Scholar
Marçais, G. & Kingsford, C. A fast, lock-free approach for efficient parallel counting of occurrences of k-mers. Bioinformatics 27, 764–770 (2011).
Article PubMed PubMed Central Google Scholar
Hawkins, T. L., O’Connor-Morin, T., Roy, A. & Santillan, C. DNA purification and isolation using a solid-phase. Nucleic Acids Res. 22, 4543–4544 (1994).
Article CAS PubMed PubMed Central Google Scholar
Loman, N. J., Quick, J. & Simpson, J. T. A complete bacterial genome assembled de novo using only nanopore sequencing data. Nat. Methods 12, 733–735 (2015).
Article CAS PubMed Google Scholar
Robins, J. H. et al. Evolutionary relationships and divergence times among the native rats of Australia. BMC Evol. Biol. 10, 375 (2010).
Article CAS PubMed PubMed Central Google Scholar
Macey, J. R. & Verma, A. Homology in phylogenetic analysis: alignment of transfer RNA genes and the phylogenetic position of snakes. Mol. Phylogenet. Evol. 7, 272–279 (1997).
Article CAS PubMed Google Scholar
Swofford, D. L. PAUP* Phylogenetic Analysis Using Parsimony (* and Other Methods), Beta Version 4.0 (Sinauer, Sunderland, MA, 2002).
Google Scholar
Templeton, A. R. Phylogenetic inference from restriction endonuclease cleavage site maps with particular reference to the evolution of humans and the apes. Evolution 37, 221–244 (1983).
Article CAS PubMed Google Scholar
Felsenstein, J. Confidence limits on phylogenies with a molecular clock. Syst. Zool. 34, 152–161 (1985).
Article Google Scholar
Maddison, W. P. & Maddison, D. R. MacClade, Analysis of Phylogeny and Character Evolution, Version 4.03 (Sinauer, Sunderland, MA, 2001).
Google Scholar
Yang, Z. H. PAML 4: phylogenetic analysis by maximum likelihood. Mol. Biol. Evol. 24, 1586–1591 (2007).
Article CAS PubMed Google Scholar
Huerta-Cepas, J., Serra, F. & Bork, P. ETE 3: reconstruction, analysis, and visualization of phylogenomic data. Mol. Biol. Evol. 33, 1635–1638 (2016).
Article CAS PubMed PubMed Central Google Scholar
Kosakovsky Pond, S. L. & Frost, S. D. W. Not so different after all: a comparison of methods for detecting amino acid sites under selection. Mol. Biol. Evol. 22, 1208–1222 (2005).
Article PubMed Google Scholar
Pond, S. L., Frost, S. D. & Muse, S. V. HyPhy: hypothesis testing using phylogenies. Bioinformatics 21, 676–679 (2005).
Article CAS PubMed Google Scholar
Yang, Z., Wong, W. S. & Nielsen, R. Bayes empirical bayes inference of amino acid sites under positive selection. Mol. Biol. Evol. 22, 1107–1118 (2005).
Article CAS PubMed Google Scholar
Benjamini, Y. & Hochberg, Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc. 57, 289–300 (1995).
Google Scholar
Landau, M. et al. ConSurf 2005: the projection of evolutionary conservation scores of residues on protein structures. Nucleic Acids Res. 33, W299–W302 (2005).
Article CAS PubMed PubMed Central Google Scholar
Ashkenazy, H. et al. ConSurf 2016: an improved methodology to estimate and visualize evolutionary conservation in macromolecules. Nucleic Acids Res. 44, W344–W350 (2016).
Article CAS PubMed PubMed Central Google Scholar
The PyMOL Molecular Graphics System, Version 2.0 Schrödinger, LLC.
Macey, J. R., Larson, A., Ananjeva, N. B. & Papenfuss, T. J. Evolutionary shifts in three major structural features of the mitochondrial genome among Iguanian lizards. J. Mol. Evol. 44, 660–674 (1997).
Article CAS PubMed Google Scholar
Brennicke, A. & Clayton, D. A. Nucleotide assignment of alkali-sensitive sites in mouse mitochondrial DNA. J. Biol. Chem. 256, 10613–10617 (1981).
Article CAS PubMed Google Scholar
Hixson, J. E., Wong, T. W. & Clayton, D. A. Both the conserved stem-loop and divergent 5'-flanking sequences are required for initiation at the human mitochondrial origin of light-strand DNA replication. J. Biol. Chem. 261, 2384–2390 (1986).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We thank Clive Stone (Ngatiwai) cultural liaison for the Tuatara genome project for his support and cultural leadership. We thank Ngatiwai, tangata whenua of the rohe from which the Tuatara for genome sequencing was sampled, for supporting the permits and permissions needed to collect, export and sequence Tuatara blood samples. We also thank Lindsay Anderson and Dr. Nicola Nelson (VUW) who collected the blood sample used for the Tuatara genome. Tuatara genome sequencing was supported by grants from the Allan Wilson Center for Molecular Ecology and Evolution, and the University of Otago to NJG. We thank Oliver Ryder (San Diego Zoo) for facilitating sample import to the USA and Richard E. Green, Margot Hartley and Michelle Vierra of Dovetail Genomics (Santa Cruz, CA) for providing samples and details on DNA extraction. Steven L. Salzberg and Art Delcher provided valuable assistance in the assembly of Illumina data during early stages. Danny Lee at Seqmatic (Fremont, CA) provided laboratory assistance. Alex Copeland provided computational assistance. Laboratory advice was provided by Chris Bolds (Sage Science, Boston) and Damon Tigh (BioRad, CA). Marco Passamonti provided valuable information on DUI inheritance in mollusks. We are grateful to Johnathan B. Losos (Living Earth Collaborative, St. Louis) and the St. Louis Zoo for providing Stephens Island samples. We thank staff of the Laboratories of Analytical Biology, National Museum of Natural History (NMNH), Smithsonian Institution for the use of their computer resources, particularly Matt Kweskin. We thank Robert Costello, Smithsonian’s Q?rius, and Sydney Bergman for supporting ESB in this project. Kim Rutherford provided critical data transfer and associated files.

Author information

Authors and Affiliations

Peralta Genomics Institute, Chancellor’s Office, Peralta Community College District, 333 East 8th Street, Oakland, CA, 94606, USA
J. Robert Macey, Charles G. Barbieri, Dustin P. DeMeo & Aaron N. Elliott
AIT Austrian Institute of Technology, Center for Health and Bioresources, Molecular Diagnostics, Giefinggasse 4, 1210, Vienna, Austria
Stephan Pabinger
Global Genome Initiative, National Museum of Natural History, Smithsonian Institution, 1000 Constitution Ave., Washington, DC, 20560, USA
Ella S. Buring, Vanessa L. Gonzalez & Daniel G. Mulcahy
Department of Anatomy, University of Otago, PO Box 913, Dunedin, 9054, New Zealand
Lara Urban & Neil J. Gemmell
Biodiversity Institute and Natural History Museum, University of Kansas, 1345 Jayhawk Blvd., Lawrence, KS, 66045, USA
Paul M. Hime
LOEWE-Center for Translational Biodiversity Genomics, Senckenberg Museum, 60325, Frankfurt, Germany
Stefan Prost
South African National Biodiversity Institute, National Zoological Garden, Pretoria, 0184, South Africa
Stefan Prost

Authors

J. Robert Macey
View author publications
You can also search for this author in PubMed Google Scholar
Stephan Pabinger
View author publications
You can also search for this author in PubMed Google Scholar
Charles G. Barbieri
View author publications
You can also search for this author in PubMed Google Scholar
Ella S. Buring
View author publications
You can also search for this author in PubMed Google Scholar
Vanessa L. Gonzalez
View author publications
You can also search for this author in PubMed Google Scholar
Daniel G. Mulcahy
View author publications
You can also search for this author in PubMed Google Scholar
Dustin P. DeMeo
View author publications
You can also search for this author in PubMed Google Scholar
Lara Urban
View author publications
You can also search for this author in PubMed Google Scholar
Paul M. Hime
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Prost
View author publications
You can also search for this author in PubMed Google Scholar
Aaron N. Elliott
View author publications
You can also search for this author in PubMed Google Scholar
Neil J. Gemmell
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Study design and supervision: J.R.M., D.G.M., and N.J.G. Writing: J.R.M., D.G.M., L.U., S.Pa., V.L.G., and N.J.G. Long-read sequencing: C.G.B., J.R.M., E.S.B., V.L.G., and D.G.M. Transcriptome analysis: E.S.B., V.L.G., and D.G.M. Protein analysis: L.U., J.R.M., D.G.M., and N.J.G. Data handling, deposition, and analysis: S.Pa., D.G.M., C.G.B., D.P.D., E.S.B., V.L.G., P.M.H., S.Pr., A.N.E., J.R.M., and N.J.G.

Corresponding author

Correspondence to J. Robert Macey.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Macey, J.R., Pabinger, S., Barbieri, C.G. et al. Evidence of two deeply divergent co-existing mitochondrial genomes in the Tuatara reveals an extremely complex genomic organization. Commun Biol 4, 116 (2021). https://doi.org/10.1038/s42003-020-01639-0

Download citation

Received: 13 May 2020
Accepted: 21 December 2020
Published: 29 January 2021
DOI: https://doi.org/10.1038/s42003-020-01639-0

This article is cited by

The invasive land flatworm Arthurdendyus triangulatus has repeated sequences in the mitogenome, extra-long cox2 gene and paralogous nuclear rRNA clusters
- Romain Gastineau
- Claude Lemieux
- Jean-Lou Justine
Scientific Reports (2024)
Novel mitochondrial genome rearrangements including duplications and extensive heteroplasmy could underlie temperature adaptations in Antarctic notothenioid fishes
- Bushra Fazal Minhas
- Emily A. Beck
- Julian Catchen
Scientific Reports (2023)
Signatures of positive selection in the mitochondrial genome of neotropical freshwater stingrays provide clues about the transition from saltwater to freshwater environment
- P. G. Nachtigall
- T. S. Loboda
- D. Pinhal
Molecular Genetics and Genomics (2023)
Inheritance through the cytoplasm
- M. Florencia Camus
- Bridie Alexander-Lawrie
- Gregory D. D. Hurst
Heredity (2022)
Comparative genomic analysis of vertebrate mitochondrial reveals a differential of rearrangements rate between taxonomic class
- Paula Montaña-Lozano
- Manuela Moreno-Carmona
- Carlos F. Prada
Scientific Reports (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.