Detecting errors in mtDNA data by phylogenetic analysis

H. J. Bandelt, P. Lahermo, M. Richards, V. Macaulay

Research output: Contribution to journalArticle

160 Citations (Scopus)

Abstract

Sequencing and documenting a sample of homologous DNA stretches is prone to copying errors in a way rather analogous to the biological replication process. Previous attempts at obtaining representative mtDNA sequences, typically of the control region, for evolutionary studies or forensic purposes have yielded rather unsatisfactory results in many cases. The key ingredient in pinpointing problems with given data is the phylogenetic analysis of closely related mtDNAs within the framework of an established worldwide phylogeny that is supported by coding region information. We develop some general rules by which likely errors in data tables can readily be detected without rereading whole sequences repeatedly. Following these guidelines, one can expect to lower the error rate by at least an order of magnitude, although it will still be hard to beat the mitochondrial gamma polymerase in precision.

LanguageEnglish
Pages64-69
Number of pages6
JournalInternational Journal of Legal Medicine
Volume115
Issue number2
DOIs
Publication statusPublished - Oct 2001

Fingerprint

Biological Phenomena
Phylogeny
Mitochondrial DNA
Guidelines
DNA
phylogeny

Cite this

Bandelt, H. J. ; Lahermo, P. ; Richards, M. ; Macaulay, V. / Detecting errors in mtDNA data by phylogenetic analysis. In: International Journal of Legal Medicine. 2001 ; Vol. 115, No. 2. pp. 64-69.
@article{298dec061f274efbbeecb27f9c3b6f0d,
title = "Detecting errors in mtDNA data by phylogenetic analysis",
abstract = "Sequencing and documenting a sample of homologous DNA stretches is prone to copying errors in a way rather analogous to the biological replication process. Previous attempts at obtaining representative mtDNA sequences, typically of the control region, for evolutionary studies or forensic purposes have yielded rather unsatisfactory results in many cases. The key ingredient in pinpointing problems with given data is the phylogenetic analysis of closely related mtDNAs within the framework of an established worldwide phylogeny that is supported by coding region information. We develop some general rules by which likely errors in data tables can readily be detected without rereading whole sequences repeatedly. Following these guidelines, one can expect to lower the error rate by at least an order of magnitude, although it will still be hard to beat the mitochondrial gamma polymerase in precision.",
keywords = "Error detection, Mitochondrial DNA, Phylogenetic analysis, Sequence data, Types of error",
author = "Bandelt, {H. J.} and P. Lahermo and M. Richards and V. Macaulay",
year = "2001",
month = "10",
doi = "10.1007/s004140100228",
language = "English",
volume = "115",
pages = "64--69",
journal = "International Journal of Legal Medicine",
issn = "0937-9827",
publisher = "Springer Verlag",
number = "2",

}

Detecting errors in mtDNA data by phylogenetic analysis. / Bandelt, H. J.; Lahermo, P.; Richards, M.; Macaulay, V.

In: International Journal of Legal Medicine, Vol. 115, No. 2, 10.2001, p. 64-69.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Detecting errors in mtDNA data by phylogenetic analysis

AU - Bandelt, H. J.

AU - Lahermo, P.

AU - Richards, M.

AU - Macaulay, V.

PY - 2001/10

Y1 - 2001/10

N2 - Sequencing and documenting a sample of homologous DNA stretches is prone to copying errors in a way rather analogous to the biological replication process. Previous attempts at obtaining representative mtDNA sequences, typically of the control region, for evolutionary studies or forensic purposes have yielded rather unsatisfactory results in many cases. The key ingredient in pinpointing problems with given data is the phylogenetic analysis of closely related mtDNAs within the framework of an established worldwide phylogeny that is supported by coding region information. We develop some general rules by which likely errors in data tables can readily be detected without rereading whole sequences repeatedly. Following these guidelines, one can expect to lower the error rate by at least an order of magnitude, although it will still be hard to beat the mitochondrial gamma polymerase in precision.

AB - Sequencing and documenting a sample of homologous DNA stretches is prone to copying errors in a way rather analogous to the biological replication process. Previous attempts at obtaining representative mtDNA sequences, typically of the control region, for evolutionary studies or forensic purposes have yielded rather unsatisfactory results in many cases. The key ingredient in pinpointing problems with given data is the phylogenetic analysis of closely related mtDNAs within the framework of an established worldwide phylogeny that is supported by coding region information. We develop some general rules by which likely errors in data tables can readily be detected without rereading whole sequences repeatedly. Following these guidelines, one can expect to lower the error rate by at least an order of magnitude, although it will still be hard to beat the mitochondrial gamma polymerase in precision.

KW - Error detection

KW - Mitochondrial DNA

KW - Phylogenetic analysis

KW - Sequence data

KW - Types of error

UR - http://www.scopus.com/inward/record.url?scp=0034778633&partnerID=8YFLogxK

U2 - 10.1007/s004140100228

DO - 10.1007/s004140100228

M3 - Article

VL - 115

SP - 64

EP - 69

JO - International Journal of Legal Medicine

T2 - International Journal of Legal Medicine

JF - International Journal of Legal Medicine

SN - 0937-9827

IS - 2

ER -