Barcoding's next top model: an evaluation of nucleotide substitution models for specimen identification

dc.contributor.authorCollins, R
dc.contributor.authorBoykin, L
dc.contributor.authorCruickshank, R
dc.contributor.authorArmstrong, Karen
dc.date.accessioned2016-09-12T02:49:53Z
dc.date.available2012-01-23
dc.date.issued2012-06
dc.date.submitted2011-11-15
dc.description.abstract1. DNA barcoding studies use Kimura's two-parameter substitution model (K2P) as the de facto standard for constructing genetic distance matrices. Distances generated under this model then provide the basis for most downstream analyses, but uncertainty in model choice is rarely explored and could potentially affect how reliably DNA barcodes discriminate species. 2. Using information-theoretic approaches for a data set comprising 14 472 DNA barcodes from 14 published studies, we tested whether the K2P model was a good fit at the species level and whether applying a better fitting model biased error rates or changed overall identification success. 3. We report that the K2P was a poorly fitting model at the species level; it was never selected as the best model and very rarely selected as a credible alternative model. Despite the lack of support for the K2P model, differences in distance between best model and K2P model estimates were usually minimal, and importantly, identification success rates were largely unaffected by model choice even when interspecific threshold values were reassessed. 4. Although these conclusions may justify using the K2P model for specimen identification purposes, we found simpler metrics such as p distance performed equally well, perhaps obviating the requirement for model correction in DNA barcoding. Conversely, when incorporating genetic distance data into taxonomic studies, we advocate a more thorough examination of model uncertainty.
dc.format.extentpp.457-465, 9 pages
dc.identifierhttps://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=elements_prod&SrcAuth=WosAPI&KeyUT=WOS:000304902500001&DestLinkType=FullRecord&DestApp=WOS_CPL
dc.identifier.citationCollins, R. A., Boykin, L. M., Cruickshank, R. H. and Armstrong, K. F. (2012). Barcoding's next top model: an evaluation of nucleotide substitution models for specimen identification. Methods in Ecology and Evolution, 3: 457–465.
dc.identifier.doi10.1111/j.2041-210X.2011.00176.x
dc.identifier.eissn2041-210X
dc.identifier.issn2041-210X
dc.identifier.other953VX (isidoc)
dc.identifier.urihttps://hdl.handle.net/10182/7336
dc.languageen
dc.language.isoen
dc.publisherBritish Ecological Society
dc.relationThe original publication is available from British Ecological Society - https://doi.org/10.1111/j.2041-210X.2011.00176.x - http://onlinelibrary.wiley.com/doi/10.1111/j.2041-210X.2011.00176.x/full
dc.relation.isPartOfMethods in Ecology and Evolution
dc.relation.urihttps://doi.org/10.1111/j.2041-210X.2011.00176.x
dc.rights© 2012 The Authors. Methods in Ecology and Evolution © 2012 British Ecological Society
dc.subjectAkaike information criterion
dc.subjectDNA barcoding
dc.subjectKimura-two-parameter
dc.subjectmodel selection
dc.subjectpairwise distances
dc.subjecttaxonomy
dc.subject.anzsrc2020ANZSRC::3103 Ecology
dc.subject.anzsrc2020ANZSRC::3109 Zoology
dc.subject.anzsrc2020ANZSRC::4104 Environmental management
dc.titleBarcoding's next top model: an evaluation of nucleotide substitution models for specimen identification
dc.typeJournal Article
lu.contributor.unitLU
lu.contributor.unitLU|Agriculture and Life Sciences
lu.contributor.unitLU|Agriculture and Life Sciences|ECOL
lu.contributor.unitLU|OLD BPRC
lu.contributor.unitLU|Research Management Office
lu.contributor.unitLU|Research Management Office|OLD QE18
lu.identifier.orcid0000-0003-4973-2651
pubs.issue3
pubs.publication-statusPublished
pubs.publisher-urlhttp://onlinelibrary.wiley.com/doi/10.1111/j.2041-210X.2011.00176.x/full
pubs.volume3
Files