DNA identification by pedigree likelihood ratio accommodating population substructure and mutations
- Jianye Ge^{1, 2}Email author,
- Bruce Budowle^{1, 2} and
- Ranajit Chakraborty^{1, 2}
DOI: 10.1186/2041-2223-1-8
© Ge et al; licensee BioMed Central Ltd. 2010
Received: 29 September 2009
Accepted: 4 October 2010
Published: 4 October 2010
Abstract
DNA typing is an important tool in missing-person identification, especially in mass-fatality disasters. Identification methods comparing a DNA profile from unidentified human remains with that of a direct (from the person) or indirect (for example, from a biological relative) reference sample and ranking the pairwise likelihood ratios (LR) is straightforward and well defined. However, for indirect comparison cases in which several members from a family can serve as reference samples, the full power of kinship analysis is not entirely exploited. Because biologically related family members are not genetically independent, more information and thus greater power can be attained by simultaneous use of all pedigree members in most cases, although distant relationships may reduce the power. In this study, an improvement was made on the method for missing-person identification for autosomal and lineage-based markers, by considering jointly the DNA profile data of all available family reference samples. The missing person is evaluated by a pedigree LR of the probability of DNA evidence under alternative hypotheses (for example, the missing person is unrelated or if they belong to this pedigree with a specified biological relationship) and can be ranked for all pedigrees within a database. Pedigree LRs are adjusted for population substructure according to the recommendations of the second National Research Council (NRCII) Report. A realistic mutation model was also incorporated to accommodate the possibility of false exclusion. The results show that the effect of mutation on the pedigree LR is moderate, but LRs can be significantly decreased by the effect of population substructure. Finally, Y chromosome and mitochondrial DNA were integrated into the analysis to increase the power of identification. A program titled MPKin was developed, combining the aforementioned features to facilitate genetic analysis for identifying missing persons. The computational complexity of the algorithms is explained, and several ways to reduce the complexity are introduced.
Background
Over the past two decades, forensic DNA typing has become widely accepted as a powerful tool in criminal and civil investigations. This technology has become invaluable in many missing-person identifications. There are a number of scenarios in which person identification is required: these include cases of war victims found in mass graves, missing soldiers or military personnel from past wars, people missing due to dynamic social reasons (for example, murder), remains from mass disasters due to natural catastrophes or terrorism attacks (for example, airplane crashes, the World Trade Center tragedy and the southeast Asia tsunami) and basic paternity testing. In attempts to identify these individuals, DNA profiles from unidentified people may be compared with direct reference samples of the missing person (antemortem samples), such as buccal swabs collected before their disappearance, or items they have used, such as toothbrushes, hairbrushes or preserved dental casts. In some cases, direct comparisons are not possible because an antemortem sample is not available, or the chain of custody may not be established reliably, reducing the confidence in an association. Alternatively, a missing person may be identified by kinship analysis using family reference samples (biological relatives such as parents, offspring, siblings or cousins) of the person to be identified.
Traditionally, statistical inference was based on pairwise comparison of the DNA profiles of the unknown sample and a single family reference sample, and then ranking the likelihood ratios (LRs) for specified biological relationships. Numerous statistical methods are available for evaluation of kinship between individuals. Li and Sacks [1] first provided a general method to obtain the conditional probability for any pair of relatives. Jacquard [2] described the most general method for a pairwise relationship using nine condensed identity states. Thompson [3] pioneered the maximum likelihood method by summarizing the k coefficients for major pairwise relationships, which were the probabilities that two individuals might have 0, 1 or 2 genes identical by descent. However, a pairwise comparison does not exploit the potential full power for identification, because it does not take into account all genetic information jointly when multiple family reference samples are available. Substantial progress in the past few years has been made in the determination of missing-person identity by pedigree kinship analysis [4–8]. Lau et al.[8] used standard parentage analysis, which includes both parents of a missing person, for the identification of victims of the Indian Ocean tsunami disaster of 2004. Buckleton et al.[9] discussed pedigree LR calculations with adjustments for population substructure effects with a few simple examples; however, no detailed algorithm for pedigree LR was given. Drabek [10] reviewed current software used for kinship analysis and reported that a number of software programs can provide the function to calculate pedigree likelihoods but they do not all offer comprehensive approaches. Dawid et al.[11] used a Bayesian network for identification using pedigree information, which incorporated the possibility of mutation, but with no adjustment for population substructure. DNAView [12] can calculate pedigree LRs without population substructure correction. For simple paternity cases, a short tandem repeat (STR) mutation model was implemented, which requires users to specify how rare it is for a mutational event of ≥ 2 steps to occur. For complex pedigrees (kinship), DNAView implemented an 'AABB' model, which simply assigns 'PI = μ', where μ is the locus-specific mutation rate. As stated in the DNAView manual [13], this model is 'a very crude way' (page 92) and could lead to 'a gross underestimate' (page 110). Hepler et al.[14] did incorporate population substructure into HUGIN (Handling Uncertainty In General Inference) but did not address mutation. Familias [15] does address both population substructure and mutation, but the mutation models are not appropriate for human STR loci. The 'equal probability model' and 'proportional model' used in Familias are not necessarily the best for STR loci [16, 17], and the 'decreasing model' includes a parameter (number of 'possible' alleles) that cannot be determined, because mutation probability is not related to allele frequency and the number of possible alleles [16, 17].
In all the above approaches, the details of genotype inference for the untyped family members in the reference pedigree were not disclosed, especially when both population substructure and mutation were incorporated. The computational complexity of the pedigree LR was not presented, and only autosomal loci were considered in the identification calculation. In addition, it has not been recognized in these studies that the mutation rates for generating integer and fractional STR alleles are different.
In this study, we combined pedigree analysis, population substructure and mutation analysis, and developed a method to calculate pedigree LR based on the classic Elston-Stewart (ES) algorithm [18]. To facilitate the use of the described pedigree analysis, a software program (MPkin) was developed. Population substructure was incorporated to comply with recommendation 4.1 in the NRCII Report [19]. A realistic mutation model is also embedded to address potential mismatches between true biological relatives, so that the method will yield a LR for any pedigree, although the number could be very small for pedigrees with multiple large-step mutations. Ge et al.[20] previously described the basic idea of the method to calculate pedigree LR with examples in absence of population substructure and mutation. Because reference family member(s) may not be available to type, the details of the methods used to infer the genotypes of untyped references were discussed. The theoretical computation complexity for pedigrees with inferred genotypes for untyped family members was analyzed. Several approaches were introduced to reduce the exponential computation complexity caused by untyped individuals in a family pedigree. The computational complexity of pedigree likelihood ratio (PLR) calculations with population substructure and/or mutations was compared. In addition, calculation of LRs for Y chromosome haplotypes and mitochondrial (mt)DNA haplotypes was performed, which can be directly combined with LR of autosomal STRs under the assumption of independence.
Method
General principle
To evaluate whether a missing person (MP) belongs to a family pedigree (P), one or more reference family members from the putative pedigree are typed. Identification is assessed by comparing two alternative hypotheses: (i) H_{p}: MP is the specific member of the putative pedigree and (ii) H_{d}: MP is unrelated to the known reference members of the putative pedigree.
where G_{ MP } refers to the DNA profile of the missing-person (from remains) and G_{ P } is the joint DNA profile of all typed family members in the pedigree, computed conditions imposed by the hypotheses H_{p} and H_{d}, respectively. H_{p} is favored if the LR is > 1; when the LR is < 1, H_{d} is better supported. For H_{p}, the position of MP in P is usually fixed. However, several scenarios could apply to H_{d}; for example, the biological mother but not biological father of MP is already in P, MP is a half sibling but not a full sibling of someone in P, or MP is not related to anyone in P. Multiple LRs can be compared in terms of different H_{d}. If no prior information of MP is provided to specify H_{d}, MP may be regarded as not related to anyone in P.
Pedigree likelihood algorithm
in which G_{ i } represents the genotype (at a specific locus) of the i-th person of a pedigree, and each member is classified as either a founder (that is, a person without antecedent relatives in the pedigree, with their genotype represented as G_{ founder } ), or an offspring (G_{ o } ) from a given mother (G_{ m } ) and father (G_{ f } ). The locus-specific likelihood (L) of a pedigree is the summation over all possible genotype combinations, G_{ i } , for each member (of course, for the typed members in the pedigree, the observed genotypes are considered as the only possibility). Within the summation, the probability of each possible genotype combination of a pedigree is computed as the product of two factors: (i) joint probability of all founder genotypes, Pr(∏G_{ founder }) and the product of each of the probabilities of offspring genotypes conditional on parental genotypes for trios, Pr(G_{ o } |G_{ f } , G_{ m } ) or (ii) the probability of allele transmission in the pedigree. Computed in this fashion, values for L across all loci are multiplied to get a combined L value, denoted by Pr(G_{ MP }, G_{ P }), which is in turn used in the final LR calculation (see equation 1). The computational complexity of the ES algorithm increases linearly with the number of trios in a complete pedigree (that is, a pedigree with all family members typed).
Using this algorithm as the general rule of pedigree likelihood evaluation, under the hypothesis H_{ p } , evaluation of Pr(G_{ MP }, G_{ P } |H_{ p }) in equation (1) is performed with the genotype, G_{ MP } , of the missing person (from remains) as the genotype of their presumed position in the pedigree. By contrast, under the hypothesis H_{ d } , MP is simply an unrelated individual to any other reference family members.
Genotype inference of untyped persons
In some situations, not all family members of a reference pedigree may be typed. Genotypes of these untyped individuals can only be inferred from those of the typed relatives, such as parents (one or both of them typed), offspring and spouse, under the assumption that the untyped family members are truly the designated biological relative specified. For example, for two parents with the genotypes {11, 12} and {13, 14}, the possible genotypes of their offspring, barring mutations, are {11, 13}, {11, 14}, {12, 13} and {12, 14}. Likewise, without any mutation, given a mother that has genotype {11, 12} with two of her offspring being {11,13} and {11,14}, the biological father of the children is inferred as {13,14}.
If a mutation is a possible consideration even for individuals with typed parents or offspring, the genotype of untyped individuals theoretically can be all possible genotypes at that locus. However, not all genotypes need to be inferred for each untyped individual in the pedigree. The computational complexity can be reduced by reducing the number of individuals with inferred genotypes. For nuclear families with a single offspring and a single typed parent, the genotypes of the untyped parent are not needed. For nuclear families with several offspring, the genotypes of both parents should be inferred, because the probabilities of allele transmissions from the untyped parent to multiple offspring are not independent. For example, for a family with a single typed parent {10, 11} and two offspring {10, 12} and {10, 13}, the transmission probability is not equal to 1/2*Pr(12)*1/2*Pr(13). The preferred approach is to infer the genotype of the untyped parent, {12, 13} and then calculate the transmission probability (1/16) based on the genotypes of both parents.
To reduce computational complexity, an untyped individual, defined as one whose genotypes has to be inferred, is termed an untyped connector (UC), which includes (1) untyped founders with > 1 offspring, (ii) untyped founders with a single offspring but an untyped spouse, (iii) untyped non-founders who are not leaf or bottom nodes in the pedigree tree.
Population substructure correction
where p(A) is the allele frequency of allele A[21, 22]. According to the NRCII recommendation, θ is set at 0.01 for large populations and 0.03 for small, isolated populations, but can be set to population- and even locus-specific θ values.
where O_{ i } is the i- th OSTP. Each O_{ i } has two possibilities: the first or the second allele is a founder allele. In this situation, the likelihood of founders and the likelihood of transmission cannot be clearly separated, because they are not independent.
Mutation correction
where α is the probability of being a one step mutation and μ is the mutation rate of the locus. Equal probabilities for gaining or losing repeats are assumed.
According to the AABB annual report [29], > 95% of mutations result in one-step differences, hence α was set at 0.95; mutations of > 2 steps are unlikely, but several mutation steps are allowed in this model. The mutation rates of the forensically used STR loci are on the order of 10^{-3} to 10^{-4} per locus per generation [29, 30]. Moreover, as the number of members in a pedigree and the number of STR loci used for analysis increase, the chance of detecting a mutation increases. Hence, the potential for mutation must be accommodated. Moreover, because males have higher mutation rates than females [29], different locus-specific mutation rates must be used for the father and mother within a pedigree.
The mechanism of mutations between integer (for example, 9) and fractional (for example, 10.2) STR alleles is different from slippage-based mutation. The probability of a partial repeat mutation should be lower than the average STR mutation rates and higher than the SNP mutation rates (for example, 10^{-8}). Because there are no data on partial repeat mutations, we arbitrarily set the probability at 10^{-5}, but further investigations are needed to establish a more meaningful probability.
Y chromosome and mtDNA
Autosomal STRs, Y chromosome haplotypes and mtDNA haplotypes do not display departures from expectations of independence, except when notable levels of substructure were detected [31, 32]. Hence, LRs of Y and mtDNA haplotypes can be directly combined together and with LRs for autosomal STRs. The general principle of calculating LRs of Y and mtDNA haplotypes is the same as that for autosomal markers, which compares the likelihoods of missing-person and putative pedigree haplotypes given H_{p} (that is, the product of multiple haplotype transmission probabilities) or H_{d} (that is, haplotype frequency in a population).
There may be multiple Y or mtDNA references in a putative pedigree. Only the closest available relatives are considered, because minimum transmission reduces the probability of mutations. The number of haplotype transmissions is used to determine the closest relatives. For example, if Y haplotypes are available for the father and uncles of a putative missing person, only the haplotype of the father is considered. Father-offspring is the closest relationship for Y chromosome markers with only one haplotype transmission; followed by full siblings, grandchildren or grandparents. For mtDNA haplotypes, the same logic applies to the maternal lineage. The transmission probability between Y haplotypes is the product of the transmission probability of alleles at each locus under the Two Phase Mutation Model. For mtDNA haplotypes, the transmission probability of two mtDNA haplotypes with > 2 nucleotide differences is 0; otherwise, it is 1 [33].
Discussion
Computational complexity analysis
If a pedigree can be separated into several independent subpedigrees, the complexity will be further reduced by using one multiplication between likelihoods of two subpedigrees instead of several multiplications for several NGUC values. For many cases of likelihood calculations for a given H_{d}, this partition could significantly reduce the complexity.
Effect of population substructure and mutation
Running time (seconds) for 20,000 simulations
Co-ancestry coefficient | ||
---|---|---|
θ = 0 | θ = 0.01 | |
No mutation | 1,140 | 1,344 |
With mutation | 6,893 | 12,946 |
The likelihood ratios are also compared in the absence and presence of population substructure, with and without mutation by the same simulated pedigrees used above (Figure 3). The differences of logarithms of pedigree LRs [Log_{10}(LR)] were plotted. Differences in Log_{10}(LR)s in the absence and presence of mutation are usually < 0.1. Clearly, mutation has a limited effect on the LR ratio, because of the relatively low probability of mutation in a single case scenario. Often LRs with the mutation model are slightly lower than those that do not consider a mutation model, because the transmission probabilities between two identical alleles with the mutation model are < 1. However, incorporation of mutation is necessary to avoid possible false exclusions if mutations do exist in pedigrees. Population substructure substantially decreases the LRs, mostly from 2 to 100 times less than a scenario in which the absence of population substructure is assumed, as in this example. In other words, population substructure has a notable effect on the LR, and should be included in most identification cases.
Mutation model
The probabilities of gaining or losing steps depend on the number of steps between alleles [37]; for now equal mutation rates are assumed for gaining or losing steps in our model, but this can be adjusted if desired, which may be necessary as additional data on STR mutation patterns are developed. Currently, silent/null alleles are not considered because it is difficult to determine if a null allele exists in a complex pedigree with multiple untyped individuals.
Allele frequency
A minimum allele frequency rule was adopted to accommodate identity-testing requirements. If the frequency of an allele is < 5/2n (a threshold value supported by Budowle et al.[38], with n being the sample size of the locus), then the frequency of the allele will be automatically raised to 5/2n. Invoking a minimum allele frequency threshold will result in the sum of the allele frequencies being > 1, which in turn will make the LR more conservative than other approaches. For example, Familias [15] normalized the allele frequencies so that the sum of the allele frequencies is 1, which may change the allele frequencies slightly and lead to different LR results compared with a minimal allele frequency correction approach (see Additional file 1). If the same allele frequencies are used and the sum of allele frequencies is 1 for a locus, Familias produces the same LR as MPKin in the absence of mutations.
Validation
The software MPKin was validated in part with assistance from the International Commission on Missing Persons (ICMP). LRs of each locus of three pedigrees calculated by DNAView, Familias and MPKin were identical. Familias and MPKin can further calculate LRs accommodating population substructure and mutation. Familias may also give LR with mutations, but the mutation models used in Familias are not applicable to human STRs (see Additional file 1).
Conclusion
In summary, this study provides a descriptive approach to assist the forensic DNA community in person identification for complex forensic identity and paternity testing cases. This process evaluates the putative biological relationships of individuals by calculating and ranking pedigree LRs for multiple putative pedigrees. Currently, this approach does not address linked markers because the linkage adjustments currently are unnecessary for forensic autosomal STRs. Adjustments for population substructure and mutation are incorporated, and LR values can be provided for any pedigree even with multiple large-step mutations. There is no limit on pedigree structure (for instance, incest can be addressed) and number of family members. However, because of the complexity of computation, incorporating multiple UCs can take time; for instance, MPkin can accommodate up to 4-5 UCs with population substructure and mutation in a reasonable time (several hours for a 13-STR pedigree with 4-5 UCs). The computation time can be decreased by ignoring large-step mutations. Fortunately, most forensic cases do not exceed this limitation. Lastly, the process described here can accommodate autosomal loci, Y chromosome haplotypes and mtDNA haplotypes. Independence across these three genetic marker systems is assumed.
Declarations
Acknowledgements
This study was supported by the US Public Health Service research grant GM 41399 from the US National Institutes of Health. We thank Dr Tsewei Wang and Dr Douglas Birdwell from University of Tennessee for their comments on the manuscript.
Authors’ Affiliations
References
- Li CC, Sacks L: The derivation of joint distribution and correlation between relatives by the use of stochastic matrices. Biometrics. 1954, 10: 347-360. 10.2307/3001590.View ArticleGoogle Scholar
- Jacquard A: The Genetic Structure of Populations. 1974, New York: SpringerView ArticleGoogle Scholar
- Thompson EA: The estimation of pairwise relationships. Ann Hum Genet. 1975, 39: 173-188. 10.1111/j.1469-1809.1975.tb00120.x.View ArticlePubMedGoogle Scholar
- Brenner C, Weir BS: Issues and strategies in the DNA identification of World Trade Center victims. Theor Pop Biol. 2003, 63: 173-178. 10.1016/S0040-5809(03)00008-X.View ArticleGoogle Scholar
- Cash HD, Hoyle JW, Sutton AJ: Development under extreme conditions: forensic bioinformatics in the wake of the World Trade Center disaster. Pac Symp Biocomput. 2003, 638-53.Google Scholar
- LeClair B, Niezgoda S, Carmody GR, Shaler RC: Kinship analysis and human identification in mass disasters: The use of MDKAP for the World Trade Center tragedy. 13th International Symposium on Human Identification. 2002Google Scholar
- Leclair B, Fregean CJ, Bowen KL, Fourney RM: Enhanced kinship analysis and STR-based DNA typing for human identification in mass fatality incidents: The Swissair flight 111 disaster. J Forensic Sci. 2004, 49: 939-953.PubMedGoogle Scholar
- Lau G, Tan WF, Tan PH: After the Indian Ocean Tsunami: Singapore's contribution to the International Disaster Victim Identification effort in Thailand. Ann Acad Med Singapore. 2005, 34: 341-351.PubMedGoogle Scholar
- Buckleton J, Triggs C, Clayton T: Disaster victim identification, identification of missing persons, and immigration cases. Forensic DNA Evidence Interpretation. Edited by: Buckleton J, Triggs CM, Walsh SJ. 2005, Boca Raton: CRC Press, 395-437.Google Scholar
- Drabek J: Validation of software for calculating the likelihood ratio for parentage and kinship. Forensic Sci Int Genet. 2009, 3: 112-118. 10.1016/j.fsigen.2008.11.005.View ArticlePubMedGoogle Scholar
- Dawid AP, Mortera J, Vicard P: Objected-oriented Bayesian networks for complex forensic DNA profiling problems. Forensic Sci Int. 2007, 169: 195-205. 10.1016/j.forsciint.2006.08.028.View ArticlePubMedGoogle Scholar
- Brenner CH: Symbolic kinship program. Genetics. 1997, 145: 535-542.PubMed CentralPubMedGoogle Scholar
- Brenner CH: DNAView manual, version 27.49.Google Scholar
- Hepler AB, Weir BS: Object-oriented Bayesian networks for paternity cases with allelic dependencies. Forensic Sci Int Genet. 2008, 2: 166-175. 10.1016/j.fsigen.2007.12.003.PubMed CentralView ArticlePubMedGoogle Scholar
- Egeland T, Mostad PF, Mevag B, Stenersen M: Beyond traditional paternity and identification cases: selecting the most probable pedigree. Forensic Sci Int. 2000, 110: 47-59. 10.1016/S0379-0738(00)00147-X.View ArticlePubMedGoogle Scholar
- Ellegren H: Microsatellite mutations in the germline: implications for evolutionary inference. Trends Genet. 2000, 16: 551-558. 10.1016/S0168-9525(00)02139-9.View ArticlePubMedGoogle Scholar
- Schlotterer C: Evolutionary dynamics of microsatellite DNA. Chromosoma. 2000, 109: 365-371. 10.1007/s004120000089.View ArticlePubMedGoogle Scholar
- Elston RC, Stewart J: A general model for the genetic analysis of pedigree data. Hum Hered. 1971, 21: 523-542. 10.1159/000152448.View ArticlePubMedGoogle Scholar
- National Research Council Committee on DNA Forensic Science: An Update: the Evaluation of Forensic DNA Evidence. 1996, Washington (DC): National Academy PressGoogle Scholar
- Ge J, Wang T, Birdwell JD, Chakraborty R: Further remarks on: 'Paternity analysis in special fatherless cases without direct testing of alleged father' [FSI 146 S (2004) S159-S161] and remarks on it [FSI 163 (2006) 158-160]. Forensic Sci Int. 2007, 172: e6-8. 10.1016/j.forsciint.2007.07.008.View ArticlePubMedGoogle Scholar
- Balding DJ, Nichols RA: DNA profile match probability calculation - How to allow for population stratification, relatedness, database selection and single bands. Forensic Sci Int. 1994, 64: 125-140. 10.1016/0379-0738(94)90222-4.View ArticlePubMedGoogle Scholar
- Evett IW, Weir BS: Interpreting DNA Evidence: Statistical Genetics for Forensic Scientists. 1998, Sunderland, MA: Sinauer, 140-141.Google Scholar
- Chakraborty R, Stivers DN, Zhong Y: Estimation of mutation rates from parentage exclusion data: applications to STR and VNTR loci. Mut Res. 1996, 354: 41-48.View ArticleGoogle Scholar
- DiRienzo A, Peterson AC, Garza JC, Valdes AM, Slatkin M, Freimer NB: Mutational processes of simple-sequence repeat loci in human populations. Proc Natl Acad Sci USA. 1994, 91: 3166-3170. 10.1073/pnas.91.8.3166.View ArticleGoogle Scholar
- Estoup A, Jarne P, Cornuet JM: Homoplasy and mutation model at microsatellite loci and their consequences for population genetic analysis. Mol Ecol. 2002, 11: 1591-1604. 10.1046/j.1365-294X.2002.01576.x.View ArticlePubMedGoogle Scholar
- Kimura M, Crow JF: The number of alleles that can be maintained in a finite population. Genetics. 1964, 49: 725-738.PubMed CentralPubMedGoogle Scholar
- Kimura M, Ohta T: Stepwise mutation model and distribution of allelic frequencies in a finite population. Proc Natl Acad Sci USA. 1978, 75: 2868-2872. 10.1073/pnas.75.6.2868.PubMed CentralView ArticlePubMedGoogle Scholar
- Crow JF, Kimura M: An Introduction to Population Genetics Theory. 1970, New York: Harper, Row, 591-Google Scholar
- AABB: Annual report summary for testing in 2006. [http://www.aabb.org/sa/facilities/Documents/rtannrpt06.pdf]
- Brinkmann B, Klintschar M, Neuhuber F, Hühne J, Rolf B: Mutation rate in human microsatellites: influence of the structure and length of the tandem repeat. Am J Hum Genet. 1998, 62: 1408-1415. 10.1086/301869.PubMed CentralView ArticlePubMedGoogle Scholar
- Walsh B, Redd A, Hammer M: Joint match probabilities for Y chromosomal and autosomal markers. Forensic Sci Int. 2008, 174: 234-238. 10.1016/j.forsciint.2007.03.014.View ArticlePubMedGoogle Scholar
- Budowle B, Ge J, Aranda X, Planz J, Eisenberg A, Chakraborty R: Texas population substructure and its impact on estimating the rarity of Y STR haplotypes from DNA evidence. J Forensic Sci. 2009, 54: 1016-1021. 10.1111/j.1556-4029.2009.01105.x.View ArticlePubMedGoogle Scholar
- Scientific Working Group on DNA Analysis Methods (SWGDAM): Guidelines for mitochondrial DNA (mtDNA) nucleotide sequence interpretation. Forensic Sci Comm. 2003, 5:Google Scholar
- Ruitberg CM, Reeder DJ, Butler JM: STRBase: a short tandem repeat DNA database for the human identity testing community. Nucleic Acids Res. 2001, 29: 320-322. 10.1093/nar/29.1.320.PubMed CentralView ArticlePubMedGoogle Scholar
- Dawid AP, Mortera J, Pascali VL: Non-fatherhood or mutation? A probabilistic approach to parental exclusion in paternity testing. Forensic Sci Int. 2001, 124: 55-61. 10.1016/S0379-0738(01)00564-3.View ArticlePubMedGoogle Scholar
- Egeland T, Mostad PF: Statistical genetics and genetical statistics: a forensic perspective. Scand J Stat. 2002, 29: 297-307. 10.1111/1467-9469.00284.View ArticleGoogle Scholar
- Ge J, Budowle B, Aranda XG, Planz JV, Eisenberg AJ, Chakraborty R: Mutation rates at Y chromosome short tandem repeats in Texas populations. Forensic Sci Int Genet. 2009, 3: 179-184. 10.1016/j.fsigen.2009.01.007.View ArticlePubMedGoogle Scholar
- Budowle B, Monson KL, Chakraborty R: Estimating minimum allele frequencies for DNA profile frequency estimates for PCR-based loci. Int J Legal Med. 1996, 108: 173-6. 10.1007/BF01369786.View ArticlePubMedGoogle Scholar
Copyright
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Comments
View archived comments (1)