Supplemental Information. Pervasive within-mitochondrion. Single-Nucleotide Variant Heteroplasmy. as Revealed by Single-Mitochondrion Sequencing

Hasonló dokumentumok
Mapping Sequencing Reads to a Reference Genome

Supporting Information

Supplementary Figure 1

Supplementary Table 1. Cystometric parameters in sham-operated wild type and Trpv4 -/- rats during saline infusion and

Supplementary materials to: Whole-mount single molecule FISH method for zebrafish embryo

SOLiD Technology. library preparation & Sequencing Chemistry (sequencing by ligation!) Imaging and analysis. Application specific sample preparation

Trinucleotide Repeat Diseases: CRISPR Cas9 PacBio no PCR Sequencing MFMER slide-1

University of Bristol - Explore Bristol Research

Correlation & Linear Regression in SPSS

Correlation & Linear Regression in SPSS

Statistical Inference

Involvement of ER Stress in Dysmyelination of Pelizaeus-Merzbacher Disease with PLP1 Missense Mutations Shown by ipsc-derived Oligodendrocytes

Forensic SNP Genotyping using Nanopore MinION Sequencing

Miskolci Egyetem Gazdaságtudományi Kar Üzleti Információgazdálkodási és Módszertani Intézet. Correlation & Linear. Petra Petrovics.

Miskolci Egyetem Gazdaságtudományi Kar Üzleti Információgazdálkodási és Módszertani Intézet Nonparametric Tests

Genome 373: Hidden Markov Models I. Doug Fowler

FAMILY STRUCTURES THROUGH THE LIFE CYCLE

discosnp demo - Peterlongo Pierre 1 DISCOSNP++: Live demo

Expression analysis of PIN genes in root tips and nodules of Lotus japonicus

Széchenyi István Egyetem

EN United in diversity EN A8-0206/419. Amendment

Miskolci Egyetem Gazdaságtudományi Kar Üzleti Információgazdálkodási és Módszertani Intézet. Hypothesis Testing. Petra Petrovics.

Miskolci Egyetem Gazdaságtudományi Kar Üzleti Információgazdálkodási és Módszertani Intézet. Nonparametric Tests. Petra Petrovics.

Statistical Dependence

A cell-based screening system for RNA Polymerase I inhibitors

Miskolci Egyetem Gazdaságtudományi Kar Üzleti Információgazdálkodási és Módszertani Intézet Factor Analysis

Supporting Information

Construction of a cube given with its centre and a sideline

T-helper Type 2 driven Inflammation Defines Major Subphenotypes of Asthma

A rosszindulatú daganatos halálozás változása 1975 és 2001 között Magyarországon

FÖLDRAJZ ANGOL NYELVEN

STUDENT LOGBOOK. 1 week general practice course for the 6 th year medical students SEMMELWEIS EGYETEM. Name of the student:

On The Number Of Slim Semimodular Lattices

Nan Wang, Qingming Dong, Jingjing Li, Rohit K. Jangra, Meiyun Fan, Allan R. Brasier, Stanley M. Lemon, Lawrence M. Pfeffer, Kui Li

16F628A megszakítás kezelése

Modular Optimization of Hemicellulose-utilizing Pathway in. Corynebacterium glutamicum for Consolidated Bioprocessing of

Limitations and challenges of genetic barcode quantification

Phenotype. Genotype. It is like any other experiment! What is a bioinformatics experiment? Remember the Goal. Infectious Disease Paradigm

PIACI HIRDETMÉNY / MARKET NOTICE

Tudományos Ismeretterjesztő Társulat

Flowering time. Col C24 Cvi C24xCol C24xCvi ColxCvi

Márkaépítés a YouTube-on

CLUSTALW Multiple Sequence Alignment

Cashback 2015 Deposit Promotion teljes szabályzat

Gottsegen National Institute of Cardiology. Prof. A. JÁNOSI

4 vana, vanb, vanc1, vanc2

ENROLLMENT FORM / BEIRATKOZÁSI ADATLAP

INDEXSTRUKTÚRÁK III.

Report on the main results of the surveillance under article 11 for annex II, IV and V species (Annex B)

Supporting Information

(NGB_TA024_1) MÉRÉSI JEGYZŐKÖNYV

Proxer 7 Manager szoftver felhasználói leírás

Miskolci Egyetem Gazdaságtudományi Kar Üzleti Információgazdálkodási és Módszertani Intézet. Correlation & Regression

Rezgésdiagnosztika. Diagnosztika

The role of aristolochene synthase in diphosphate activation

Mark Auspitz, Fayez Quereshy, Allan Okrainec, Alvina Tse, Sanjeev Sockalingam, Michelle Cleghorn, Timothy Jackson

University of Bristol - Explore Bristol Research

Report on the main results of the surveillance under article 11 for annex II, IV and V species (Annex B)

T Á J É K O Z T A T Ó. A 1108INT számú nyomtatvány a webcímen a Letöltések Nyomtatványkitöltő programok fülön érhető el.

Személyes adatváltoztatási formanyomtatvány- Magyarország / Personal Data Change Form - Hungary

Dr. Sasvári Péter Egyetemi docens

Cluster Analysis. Potyó László

Bird species status and trends reporting format for the period (Annex 2)

A vitorlázás versenyszabályai a évekre angol-magyar nyelvű kiadásának változási és hibajegyzéke

Performance Modeling of Intelligent Car Parking Systems

Bioinformatics: Blending. Biology and Computer Science

A BÜKKI KARSZTVÍZSZINT ÉSZLELŐ RENDSZER KERETÉBEN GYŰJTÖTT HIDROMETEOROLÓGIAI ADATOK ELEMZÉSE

Manuscript Title: Identification of a thermostable fungal lytic polysaccharide monooxygenase and

Report on the main results of the surveillance under article 11 for annex II, IV and V species (Annex B)

SAJTÓKÖZLEMÉNY Budapest július 13.

Suppl. Materials. Polyhydroxyalkanoate (PHA) Granules Have no Phospholipids. Germany

7 th Iron Smelting Symposium 2010, Holland

Report on the main results of the surveillance under article 11 for annex II, IV and V species (Annex B)

Using the CW-Net in a user defined IP network

Report on the main results of the surveillance under article 11 for annex II, IV and V species (Annex B)

Computer Architecture

The beet R locus encodes a new cytochrome P450 required for red. betalain production.

ELEKTRONIKAI ALAPISMERETEK ANGOL NYELVEN

Report on the main results of the surveillance under article 11 for annex II, IV and V species (Annex B)

Report on the main results of the surveillance under article 11 for annex II, IV and V species (Annex B)

Utasítások. Üzembe helyezés

Contact us Toll free (800) fax (800)

A magyarországi Gauss-Krüger-vetületû katonai topográfiai térképek dátumparaméterei

Report on the main results of the surveillance under article 11 for annex II, IV and V species (Annex B)

Választási modellek 3

Angol Középfokú Nyelvvizsgázók Bibliája: Nyelvtani összefoglalás, 30 kidolgozott szóbeli tétel, esszé és minta levelek + rendhagyó igék jelentéssel

ELEKTRONIKAI ALAPISMERETEK ANGOL NYELVEN

Report on the main results of the surveillance under article 11 for annex II, IV and V species (Annex B)

DECLARATION OF PERFORMANCE No. GST REV 1.03 According to Construction Products Regulation EU No. 305/2011

ANNEX V / V, MELLÉKLET

Report on the main results of the surveillance under article 11 for annex II, IV and V species (Annex B)

AZ ERDÕ NÖVEKEDÉSÉNEK VIZSGÁLATA TÉRINFORMATIKAI ÉS FOTOGRAMMETRIAI MÓDSZEREKKEL KARSZTOS MINTATERÜLETEN

A jövőbeli hatások vizsgálatához felhasznált klímamodell-adatok Climate model data used for future impact studies Szépszó Gabriella

2016 év eleji ajánlatok. Speciális áraink 2016 május 31-ig érvényesek!

First experiences with Gd fuel assemblies in. Tamás Parkó, Botond Beliczai AER Symposium

Téli kedvezményes ajánlataink

FÖLDRAJZ ANGOL NYELVEN GEOGRAPHY

Report on the main results of the surveillance under article 11 for annex II, IV and V species (Annex B)

Szoftver-technológia II. Tervezési minták. Irodalom. Szoftver-technológia II.

Report on the main results of the surveillance under article 11 for annex II, IV and V species (Annex B)

Átírás:

ell Reports, Volume 21 Supplemental Information Pervasive within-mitochondrion Single-Nucleotide Variant Heteroplasmy as Revealed by Single-Mitochondrion Sequencing Jacqueline Morris, Young-Ji Na, Hua Zhu, Jae-Hee Lee, Hoa iang, Alexandra V. Ulyanova, ordon H. Baltuch, Steven Brem, H. Isaac hen, David K. Kung, imothy H. Lucas, Donald M. O'Rourke, John A. Wolf, M. Sean rady, Jai-Yoon Sul, Junhyong Kim, and James Eberwine

Supplemental Materials: Inventory of Supplement: Supplemental Figure 1. Sample lineage tree depicting the relationship between samples. his figure shows the pedigree of all of the samples processed in this study, with Figure 2 in the main text highlighting important relationships between a subset of samples derived from the same pedigree (mother). Supplemental Figure 2. Sample lineage tree scaled to SNV events along each lineage. his figure shows the mutational burden in each sample in the context of the relationships between samples (pedigree), with Figure 2 in the main text highlighting pedigrees with unusual mutational burden compared to the others. Supplemental Figure 3. Relationship between mouse or human single mitochondrion samples and allele identities at high confidence positions. his figure shows the mutations present for mother and pup at the high confidence positions identified in both mouse and human, which are characterized further in Figure 2 and Figure 3 in the main text, respectively. Supplemental able 1. Primer sequences used for PR amplification of mouse single mitochondrion genomic samples. his table provides the sequences of primers used to achieve the amplification strategy highlighted in Figure 1. Supplemental able 2. Primer sequences used for PR amplification of human single mitochondrion genomic samples. his table provides the sequences of primers used to achieve the amplification strategy highlighted in Figure 1. Supplemental Information. Experimental Methods and PR error model for calling single nucleotide variants. Additional experimental methods used in this work are presented here. Additionally, this supplement provides more detailed mathematical descriptions of how a threshold was set for calling a mutation as a single nucleotide variant.

Fig. S1. Sample lineage tree depicting the relationship between samples. Related to Figure 2. Each sample is represented by the generic code MxxDx[N/A/][h/c/b] where x is a number. Mx = mother ID. x = cell ID. x is appended with a lowercase letter for samples isolated from the same cell. Dx = dendrite ID, Dx = 0 for astrocytes. [N/A/] = cell type, where N=neuron, A=astrocyte, =population. [h/c/b] = brain region from which the sample was derived: h=hippocampus, c=cortex, b=cerebellum. A reference population sample is grouped by itself on the left of the single mitochondrion samples

Fig. S2. Sample lineage tree scaled to SNV events along each lineage. Related to Figure 2. he branch lengths of the pictured cell lineage tree are scaled by estimated mutation events along each branch. he Maximum Parsimony method (see Methods) of character reconstruction was used to estimate for each branch the mutation events for the positions with changes in the major allele as well as those positions with changes in the minor allele.

A I I () I I A() I,() I (), I A() I,() I (), M27178aD0Ac M27178bD0Ac M27179D0Ac M27186D4Nc M27186D1Nc M27181D0Ac M27182bD0Ac M27182aD0Ac M27182cD0Ac M27184cD1Nc M27184bD1Nc M27184aD1Nc M27185D1Nc M27183aD1Nc M27183bD1Nc M29188D0Ac M29190D0Ac M29189D1Nc M29187D0Ac M30191aD0Ac M30191bD0Ac 3816() 9027() A (A) A() A() A 9461() () () () () () () () () () () () () 16265() () () () () () () () () () B M11P1N (S1) M11P1N (S2) M11P1N (S3) M11P1N (S4) M11P1N (S5) M13X (S6) M12P1N (S7) M12P1N (S8) M15P1N (S9) M15P1N (S10) M15P1N (S11) M15P1N (S12) M15P1N (S13) M15P1N (S14) M15P1N (S15) M15P1N (S16) M15P1N (S17) M16X (S18) M17X (S19) M19X (S20) M110X (S21) 309() () () () () () () () () () () () () () () () () () () () () () Fig. S3. Relationship between mouse or human single mitochondrion samples and allele identities at high confidence positions. Related to Figure 2 and Figure 3. Sample lineage trees of mouse (A) or human (B) single mitochondrion with Major(minor) alleles indicated for each high confidence position. he reference allele is indicated in parentheses next to each position label. Major(minor) alleles are listed for mouse mother and pup at the beginning of each branch point for each mother for each position. here is a single entry when the mother and pup alleles agreed. Human sample code format is MxxPx where M =mother ID, = cell ID, P = process ID, =cell type where N = neuronal-like or X = non-neuronal, with simplified codes in parentheses.

able S1. Primer sequences used for PR amplification of mouse single mitochondrion genomic samples. Related to Figure 1. Version 1 Long PR Primers Forward Reverse Amplicon 1 AAAAAAAA AAAAA Amplicon 2 AAAAA AAAAA Amplicon 3 AAA AAAA Version 2 Long PR Primers Forward Reverse Amplicon 1 AAAAAAA AAAAAA Amplicon 2 AAAAA AAAA Amplicon 3 AAAAAAA AA Short PR primers Forward Reverse Set 1 AAAAAAAA AAA Set 2 AAAAA AA Set 3 AA AAAAA Set 4 AAAAA AAAAAAAAA Set 5 AAAA AAAAAAAAAA Set 6 AAAAA AAAAA Set 7 AAA AAA Set 8 AAAAA AAA Set 9 AAA AAAA Set 10 AAAAAAAA AA Set 11 AAAAAAA AA Set 12 AAAAAA AAA Set 13 AAAAAAA AAAAA Set 14 AAAAA AAAAA Set 15 AAA AA Set 16 AAAAAAAAAA AAAA Set 17 AAA AAAAA Set 18 AAA AA Set 19 AAAAAA AA Set 20 AAAAAAAAA AAAAA Set 21 AAAA AAAA Set 22 AAAAAAAA AA Set 23 AAAAA AAAAAAA Set 24 AAAAAA AAAAA Set 25 AAAAA AAAAAA Set 26 AAAAA AAAAA Set 27 AAAAAA AAAAA Set 28 AAA AAAA Set 29 AAAAAAA AAAAA Set 30 AAAAA AAAA Set 31 AAAAAAA AAAAA Set 32 AAAAAAA AAAAA Set 33 AAAAA AAAAAA Set 34 AAAAA AAAAAAA Set 35 AAAAA AAAA Set 36 AAAAA AAAA Set 37 AAAAAAA AAAAAA Set 38 AAAAAAAAA AAAA Set 39 AAAAA AA Set 40 AAAAAAA AAA Set 41 AAAAAA AAAA Set 42 AAAAA AAAAAA Set 43 AAAAA AAAAAAA Set 44 AAAA AAAAAAA Set 45 AAAAA AAAA Set 46 AAAAAAA AAA Set 47 AAA AAAAAA Set 48 AAAAAA AA Set 49 AAAAAAA AAAA Set 50 AAA AAAA Set 51 AAAAA AAAAAA Set 52 AAAAA AAAA Set 53 AAAAAA AAAA Set 54 AAAAAAA AAAA Set 55 AAAAAAA AAAAA Set 56 AAAAAA AA Forward and reverse primer sequences are listed for both the long PR (Round 1 PR) and short PR (Round 2 PR) reactions.

able S2. Primer sequences used for PR amplification of human single mitochondrion genomic samples. Related to Figure 1. Long PR Primers Forward Reverse Amplicon 1 AAAAA AAAA Amplicon 2 AAAAAAAA AAAAAA Amplicon 3 AAAAAA AAAAAAA Short PR primers Forward Reverse Set 1 AAAAA AA Set 2 AAAAAA AAAA Set 3 AAAA AAAA Set 4 AAAAAAAA AAAAAAAAA Set 5 AAAAA AAAA Set 6 AAAAA AAAAAA Set 7 AAAAAA AAAAAA Set 8 AAAAA AAAAA Set 9 AAAAAAAA AAAAAAA Set 10 AAAAA AAAAAA Set 11 AAAAAA AAAA Set 12 AAAAA AAAA Set 13 AAAAA AAAA Set 14 AAAAAAAA AAAA Set 15 AAAAA AAAA Set 16 AAAAA AAA Set 17 AAAA AAAAAA Set 18 AAAAAA A Set 19 AAAA AAAA Set 20 AAAAAA AAAAAAA Set 21 AAAAA AAAA Set 22 AAAAAA AA Set 23 AAA AAAA Set 24 AAAAAAAA AAAAA Set 25 AAAAAAA AA Set 26 AAAA AAAAAA Set 27 AAAAAA AA Set 28 AAAAAA AA Set 29 AAAAA AAAAAAA Forward and reverse primer sequences are listed for both the long PR (Round1 PR) and short PR (Round 2 PR) reactions.

Supplemental Experimental Procedures: Subjects. Mouse tissue was collected under University of Pennsylvania IAU Protocol# 804867. Postnatal day 1 pups from which cultures were derived were not sexed for these studies. Pups were not sexed because cultures were derived from multiple pups provided by a shared resource. Human brain tissue was collected at the Hospital of the University of Pennsylvania under Institutional Review Board approval # 816223, using standard operating procedures for enrollment and consent of patients for patient numbers 50 (63 year old female) and 8 (65 year old female). See Supplemental Materials. ell ulture. Primary cortical neuronal, astrocyte or combination cultures were plated and maintained as described previously with some modifications (Kaech and Banker, 2006). Briefly, cortical neurons and/or astrocytes were isolated from postnatal day 1 mouse pups and dissociated with trypsin followed by pipetting prior to live cell counting with trypan blue and plating on poly-d-lysine/laminin coated 12-mm coverslips. Neuronal cultures were maintained in MEM with B27 supplement, astrocyte cultures in MEM with 10% FBS, co-cultures in MEM with B27 supplement and 1% FBS. ultures of human cells were prepared and maintained as described previously (Spaethling et al., 2017). A 5 5 5 mm block of cortical tissue was resected as part of a neurosurgical procedure for the treatment of epilepsy or brain tumors. For transport (approximately 10 minutes from surgery to lab), this tissue was immediately transferred to a sterile tube containing ice-cold asf (2 mm al 2-2H 2 O, 10 mm glucose, 3 mm Kl, 26 mm NaHO 3, 2.5 mm NaH 2 PO 4, 1 mm Mgl 2-6H 2 O, and 202 mm sucrose, perfused with 95%/5% O 2 /O 2 ). Upon receipt, tissue was digested with papain (20 U, Worthington Biochemical) and incubated for 10 15 min at 37. he reaction was stopped by addition of Leupeptin (a papain inhibitor, 100 µm, Sigma-Aldrich. Dissociated tissue was centrifuged (1,500 rpm for 3 min) and the supernatant subjected to gentle mechanical dissociation with a fire-polished glass Pasteur pipette. Live cells were stained with trypan blue and counted using an Autocounter (Invitrogen). ells were plated on poly-l-lysine-coated 12-mm coverslips at a density of 3 10 4 cells/coverslip. ultures were maintained at 37 under 95% humidity and 5% O 2 in medium (Neurobasal supplemented with B27 (1%) and penicillin/streptomycin (1%). he medium was changed by replacing 50% with fresh medium every 3 days. Primer Design. All primers were designed by exhaustive computational analysis to minimize secondary structure and cross interactions and to maintain compatible melting temperatures within each set of PR primers. he nested set of long PR primers were designed by first generating three sets of primers that would create amplicons of approximately 5 Kb such that each set was offset from each other. We then designed short PR primers again with overlapping offsets such that the set of amplicons exhaustively covered the long PR amplicons (Fig. 1B). Mitotracker Loading and imaging of cells. One coverslip of cells was transferred to one well of a 4 well dish containing 25 nm Mitotracker Red MXRos (hermofisher Scientific, M-7512) in 500 µl 50% conditioned/50% fresh media. Mitotracker loading was performed at 37 with 5% O 2 for 20 minutes before transferring the coverslip to fresh 50% conditioned/50% fresh media and returning to the incubator for 10 minutes. he coverslip was then transferred to a low profile imaging chamber for imaging and sample collection. Samples were imaged with a Zeiss 710 LSM using the Alexa568 preset filter settings to image Mitotracker Red MXRos and transmitted light for cell morphology. Samples were continuously perfused with normal saline (140 mm Nal, 5.4 mm Kl, 1 mm Mgl 2, 2 mm al 2, 16 mm glucose, 10 mm HEPES, ph 7.3) in between isolations. Mitochondrial DNA Library construction and Sequencing. Library construction was performed using the Ilumina ruseq DNA nano library construction kit according to manufacturer s instructions. Successful short PR reactions were pooled together and cleaned with a 0.5x AMPURE XP bead clean up. Based on concentrations of the pooled short PR reactions obtained from running samples on an Agilent DNA 7500 chip on the Agilent Bioanalyzer, 247 ng of each pool was made up in a final volume of 130 µl of resuspension buffer, transferred to an AFA Fiber Pre-Slit Snap-ap 6x16 mm microube (ovaris 520045) and sheared on a ovaris S2 focusedultrasonicator with the following settings to achieve an average insert size of 350 bp: Duty ycle:10%, Intensity:5, ycles/burst:200, ime: 45s). Sheared DNA was then quantified on an Agilent High Sensitivity DNA kit on the Agilent Bioanalyzer or the Agilent High Sensitivity D5000 kit on the Agilent apestation 2200 before inputting into the 350 bp insert workstream of the library kit protocol. Single mitochondrion libraries were pooled 24 samples per lane and run in rapid mode on a HiSeq 2500 with 2x101bp paired end reads. Variant calling from RNASeq. he SNPiR pipeline (Piskol et al., 2013) was implemented as follows: he burrows-wheeler aligner (BWA) was used to map RNA-Seq reads derived from sequencing of mouse or human single cells using the commands bwa aln fastqfile and bwa samse n4. We only considered positions covered by the RNA-Seq for variant calling if the following criteria were met: (A) read depth coverage > 50, (B) basequality >

0, () mappingquality > 0, (D) read frequency of minor allele 0.04%. SAMtools rmdup (Li et al., 2009) was used to remove identical reads derived from PR duplicates that mapped to the same location. Effect of variants on trna and protein structure. For position 3816, located in the gene encoding the mitochondrial trna for glutamine, the secondary structure was obtained from the mitotrnadb maintained by the Universities of Leipzig and Strasbourg and the output structure used to indicate the location of the variable position in this trna (Jühling et al., 2009). For position 9027, located in the gene encoding mitochondrial cytochrome c oxidase subunit 3, the secondary and tertiary structure of this subunit was modeled by inputting UnitProt entry number P00416 (OX3_MOUSE) into the protein model portal for structural analysis (Schwede et al., 2009). he structure was inferred from the 85% sequence homology with bovine (Bos aurus) heart cytochrome c oxidase protein in the fully oxidized state (1V54), whose structure was determined by x-ray crystallography at a resolution of 1.8 angstroms. At this level of structure resolution and sequence identity, the structural predications for the target (here the mouse protein) are typically correct. his particular subunit of the OX enzyme is highly conserved across Eukaryotes based on protein sequence alignments. PR error model. Here we use a model-based calculation to derive the probability of observing variant allele whose read frequency is greater than some threshold value, θ. Assume N cycles of PR reactions resulting in 2 N total molecules. If a mutation occurs in the 1 st PR cycle, then 50% of the final molecules will have this mutation. In general, if a mutation happens in the ith cycle,!!!! =!! 2!! proportion of the final molecules will have this mutation. Let n be the average sequencing depth, then we assume that out of 2 N total final molecules (including the library prep PR cycles), we have sampled n total molecules. Let X be the number of sequenced reads carrying a non-reference mutation. We wish to compute P X > nθ n, i. Let P(i) denote the probability of a PR mutation in the ith cycle. hen P X > nθ n =!!!! P X > nθ n, i P(i) (1) he probability P(i) of at least one mutation in the ith cycle is P(i) = 1 (1 ε)!!!! (2) where ε is the PR polymerase mutation rate. We note that (1) ignores the O(ε! ) probability of convergent multiple mutations in the ith cycle. he term!! =! 2!!! denotes the number of lineages in the ith cycle where mutations may occur, where ½ is due to semi-conservative replication. aq polymerase error rates are estimated to be between 1 10!! to 1 10!!. herefore, using (2) for aq, P(i)\ycle 1 2 3 4 Min 1 10!! 2 10!! 4 10!! 8 10!! Max 1 10!! 2 10!! 4 10!! 8 10!! he probability of sampling a molecule carrying a mutation from the ith cycle is 2!! and we use the normal approximation to the binomial sampling process and assume the number X of sequenced molecules with variant mutation is X~N(n 2!!, n 2!! 1 2!! ). In our experiments, our average sequencing depth was ~17,500; or, the number of molecules sampled was n = 17,500. Under these parameters P X > nθ n, i is approximately 0 for values of i > 3. For values of i 3, P X > nθ n, i is either approximately 1 or 0 for various values of θ. able below shows the values of P X > nθ n, i for θ = 0.1, 0.2, 0.3, 0.4, and 0.5. ycle i\θ 0.1 0.2 0.3 0.4 0.5 1 ~1 ~1 ~1 ~1 0.5

2 ~1 ~1 ~0 ~0 ~0 3 ~1 ~0 ~0 ~0 ~0 4 ~0 ~0 ~0 ~0 ~0 ombining the two tables in Equation (1), we obtain the following table for P(X > nθ n = 17,500): able 1: aq-related probability of observing false positive variant at frequency greater than θ. P X > nθ n, i \θ 0.1 0.2 0.3 0.4 0.5 Min 7 10!! 3 10!! 1 10!! 1 10!! 0.5 10!! Max 7 10!! 3 10!! 1 10!! 1 10!! 0.5 10!! For the enzyme Q5, the estimates of error rates are more than 100-fold lower. If Q5 enzyme is used in the 1 st four PR cycles, able 1 can be simply scaled to able 2: Q5-related probability of observing false positive variant at frequency greater than θ. P X > nθ n, i \θ 0.1 0.2 0.3 0.4 0.5 Min 7 10!! 3 10!! 1 10!! 1 10!! 0.5 10!! Max 7 10!! 3 10!! 1 10!! 1 10!! 0.5 10!! Shared SNV sites for multiple mitochondria. Based on the PR-derived false positive rate per site, the probability of observing k or more samples with a variant call at a particular site by misincorporation error is approximately n, k 5.13 10!!!, where n =118 is the number of samples in our mouse dataset. Using conservative Bonferroni correction over the ~16,000 sites of the mitochondrial genome, the probability, p, of observing k = 2, 3, 4 samples with a variant call are, p > 1, 0.5, 0.0085, respectively. For human samples with n =22, the Bonferroni corrected probability of k = 2, 3, 4 samples are p = 0.1896, 0.000126, and > 10-7, respectively. Supplemental References: Kaech, S., Banker,. (2006). ulturing hippocampal neurons. Nat. Protoc. 1, 2406-2415. Li, H., Durbin, R. (2009). Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754-1760. Piskol, R., Ramaswami,., Li, J.B. (2013). Reliable identification of genomic variants from RNA-Seq data. Am. J. Hum. enet. 93, 641-651. Jühling, F., Mörl, M., Hartmann, R.K., Sprinzl, M., Stadler, P.F., Pütz, J. (2009). trnadb 2009: compilation of trna sequences and trna genes. Nucleic Acids Res. 37, D159-D162. Schwede,., Sali, A., Honig, B., Levitt, M., Berman, H.M., Jones, D., Brenner, S.E., Burley, S.K., Das, R., Dokholyan, N.V. et al. (2009). Outcome of a Workshop on Applications of Protein Models in Biomedical Research. Structure 17, 151-159.