Forensic DNA Phenotyping: a review on SNP panels, genotyping techniques, and prediction models

doi:10.21203/rs.3.rs-2674028/v1

Download PDF

Systematic Review

Forensic DNA Phenotyping: a review on SNP panels, genotyping techniques, and prediction models

https://doi.org/10.21203/rs.3.rs-2674028/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

In the past few years, forensic DNA phenotyping (FDP) has attracted a strong interest in the forensic research. Among the increasing publications, many have focused on testing the available panels to infer biogeographical ancestry (BGA) on less represented populations and understanding the genetic mechanisms underlying externally visible characteristics (EVC). However, there are currently no publications that gather all the existing panels limited to FDP and discuss the main technical limitations of the technique. In this review, we performed a bibliographic search in Scopus database of FDP-related literature, which resulted in a total of 48, 43 and 15 panels for BGA, EVC and both BGA and EVC inference, respectively. Here we provide a list of commercial and non-commercial FDP panels and the FDP limitations regarding the lack of harmonization in terms of terminology (i.e., categorization and measurement of traits) and reporting, the lack of genetic knowledge and environment influence to select markers and develop panels, and the debate surrounding the selection of genotyping technologies and prediction models and algorithms. In conclusion, this review aims to be an updated guide and to present an overview of the current FDP-related literature.

Epigenetics & Genomics

Bioinformatics

Forensic DNA phenotyping

Molecular photofitting

Forensic Intelligence

Externally Visible Characteristics

Biogeographical Ancestry

In the forensic field, the use of human DNA has been mostly centred around individual identification using short tandem repeats (STR) [1–4]. This is achieved by “traditional matching”, also called forensic DNA identification, which is based on the comparison of an unknown DNA profile, obtained from a biological sample found in the crime scene, with a known DNA profile [5–8]. However, in some cases there are no matches, or no known profiles from a person of interest (POI) to compare it with [6,9]. Thus, if other options are not feasible, such as using eyewitness statements, dragnets, or familial searching, these cases remain unsolved [7–12].

To overcome this, a new intelligence method emerged in the early 2000s, following the increase of genome-wide association studies (GWAS) that link common genomic variations, in particular single nucleotide polymorphisms (SNPs), with diseases and other phenotypic traits [9,12–17]. SNPs are base substitutions, insertions, or deletions, that are normally bi-allelic with low mutation rates and high heritability [18–21]. Moreover, the small size of their PCR amplicons makes them useful to analyse typically forensic degraded and low amount DNA samples [1,11,13,19,21–23]. These findings have a big forensic potential since the prediction of externally visible characteristics (EVC) and bio-geographical ancestry (BGA), together with sex and age estimation, can provide a somehow physical description of a sample’s donor [7,8,13,17,18,24]. Hence, the so-called forensic DNA phenotyping (FDP) (or molecular photo-fitting) aims to act as a “biological witness” [2,25], providing new leads and reducing the pool of potential suspects [9,11,17]. FDP is also useful in missing persons’ investigations and for the identification of human remains [16,22,26–32]. Even though it has already been applied in some forensic cases [33–35], it raises several ethical, legal, and social issues about the limits of its application, dividing the forensic community [7,8,10,11,36,37].

As mentioned before, STR profiling is a well-established and regulated technique owing to the great efforts from scientists and law enforcement to establish validated protocols in all forensic laboratories and to create police databases that contain profiles from criminals and missing persons [1,4,8] (more information available on the STRBase website [38]). On the contrary, due to the relatively new appearance of FDP, there is no standardization of methodologies [11,39,40]. For instance, several SNP typing techniques have been adapted to analyse a growing number of SNPs in a single run and to input forensic-type samples [40,41]. Although TaqMan® polymerase chain reactions (PCR) and single base extension (SBE) coupled with capillary electrophoresis (CE) (in particular, SNaPshot™ minisequencing) is extensively used, many efforts are now focused on implementing next generation sequencing (NGS) protocols [8,11,13,17,41].

In the last years, the available literature regarding FDP has grown exponentially: several reviews on new forensic developments started to include a small presentation of FDP [1–4,17,21,22,25,27,42–46]. Nonetheless, the number of articles exclusively dedicated to FDP is limited and not many evaluate in depth BGA [7,47–49] or EVC (e.g., pigmentation traits [7,24,36,50–52] and other characteristics such as weight, height, or facial morphology [8–12,14,53,54]). Phillip’s review [49] is one of the few to include a comparison of BGA-informative markers and panels worth of consideration for forensic application, while Mehta’s [13] and Schneider’s [7] reviews describe the most informative panels for both BGA and EVC inference. The latest reviews on the current state of EVC prediction have been published by Tozzo et al. [8], Pośpiech et al. [12] and Dabas et al. [54]. All these include an extensive summary on newly found genetic markers and most common panels to predict continental, sub-continental and admixed ancestries, pigmentation traits (eye, hair and skin colour, hair greying), hair morphology (shape and thickness), eyebrow morphology (colour, thickness, monobrow), height, weight (BMI), facial morphology, presence of freckles, male-pattern baldness, and myopia. They also give a few comments on different prediction algorithms, genotyping technologies, and some limitations in FDP.

Despite this, there are currently no publications that gather all the existing research limited to FDP. Thus, this review will include only those articles that have specifically developed and/or applied panels with the aim to use it as an intelligence tool to reduce the set of suspects or to identify human remains, excluding those regarding the discovery of markers in a non-forensic/clinical setting. This will allow us to have a general view of all the commercial, commonly used, and other customized sets of markers and to provide an evaluation of the FDP field. In particular, the focus will be on the lack of harmonisation concerning classification algorithms and methodologies, and limitations in terms of reference datasets, informative SNPs, environmental influences, and lack of a common lexicon, among others. To do so, we will take as reference the following reviews [7,8,12,13,52,54] and present a more detailed summary that will include the panels’ markers (type and number), genotyping technology, statistical methods, traits, and related literature. Hence, the aim of this scoping review is to present an overview of the current FDP-related literature, so it serves as an updated guide of the global aspects of FDP which can redirect readers to further specific reading.

This review followed the Preferred Reported Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews (PRISMA-ScR) guidelines [55].

Any published paper, written in English, between 2000 and 2022, and whose focal point was FDP (in particular, EVCs and BGA) were eligible for inclusion. It is important to notice that only those papers that researched genetic human variation for FDP applications using SNPs were considered, whilst those that referred to the ethical, legal, and social implications, were not because they are beyond the scope of this review.

Four separate searches were performed on Scopus database (last search in January 2023). First, a generalized search was carried out as follows: "forensic DNA phenotyping" OR "forensic DNA intelligence" OR "molecular photo-fitting". To obtain a more specialized search on the topic, the other four searches were conducted with the following combination of keywords: 1) "external visible characteristics" OR "physical appearance" OR "physical trait" OR "physical characteristic" AND "forensic"; 2) "biogeographical ancestry" AND "forensic"; and 3) "SNP typing" OR "prediction model" AND "forensic". A total of 1016 records were obtained from Scopus (FDP=376, EVC=241, BGA=97 and methods=302). After removal of duplicates (n=77), a manual selection of documents was first performed based on title and abstract and after on a full text evaluation. The following criteria was used to select the articles: if they inferred BGA or/and EVC, if they specified their aim was for forensic phenotyping and not identification purposes, if the analysis was performed with human DNA samples/data, if the main marker type was SNPs and if the manuscript was available and not retracted. It concluded with the inclusion of 201 articles. Finally, 101 records were identified from the chosen papers' references. For each article, author(s), title, year of publication, publication journal and details on their studied FDP trait can be found in the supplemental material.

3.1 BGA

Bio-geographical ancestry (BGA) describes the most likely continental and/or sub-continental regions of origin of an individual’s ancestors. Despite it being based on the genomic differences and similarities among populations [1,56], it should not be confused with the notion of ethnicity, nationality, or religious affiliations since it does not represent the place of birth or where one lives [7,57,58].

Although STRs were the first markers proposed to infer someone's origins [59,60], SNPs show greater inter-population differences and a positive association with ancestral populations [3,18]. Current research is focused on combining different types of markers, such as InDels (i.e., insertions and deletions) [61] and microhaplotypes (MH) [62] with SNPs, especially for the analysis of DNA mixtures and admixed individuals, respectively [48]. However, in this review, only those panels including SNPs will be considered.

There are three types of SNPs considered as ancestry-informative markers (AIMs or AISNPs): Y-chromosome SNPs (Y-SNPs), mitochondrial SNPs (mt-SNPs) and autosomal SNPs (aSNPs). The first two define paternal and maternal haplogroups and they have been historically used for evolution studies because of their low recombination rates, their non-random geographical distribution, and their well-known global frequency distribution [63,64]. Interestingly, Y-SNPs show a better genetic differentiation with geographical distance than mtDNA or autosomal SNPs due to patrilocality [5]. The main issue when inferring ancestry using non-autosomal markers is that although being highly accurate when recent ancestors were from the same region [65], they only represent half of the lineage [48] and they can lead to misinterpretation of complex origins [49]. Therefore, autosomal SNPs are proposed in combination with parental SNPs to infer admixed ancestries [7,21,35,48,66,67].

3.1.1 BGA-related literature

In 2001, Jobling published the first review that considered Y-SNP haplogroup inferring as an exclusion tool to target an initial suspect [5]. Many reviews that discuss the aspects of BGA inference such as the development of panels and selecting classification algorithms are available [47,48,67–70]. However, only few discuss the different panels for FDP application [7,13,49], and they are usually centred around the most used ones. In this review, a total of 48 sets of markers that have been developed and/or applied in forensic genetics can be found in the Tables 1 and 2.

Table 1. Non-commercial BGA panels proposed in the literature, including their reference article, number and AISNPs type, first used genotyping technology and prediction model, inferred populations and related articles. EUR: European (NEU: north), AFR: African (WAF: west, NAF: north, NEAF: north-east), ASN: Asian (EAS: east, WAS: west, WEAS: west-east, SAS: south, SWAS: south-west, SEAS: south-east, central-south, NAS: north), EURAS: Eurasian, NAM: Native American, AUS: Australian, OCE: Oceanian (NOCE: near), PAC: Pacific, NES: near east, MES: Middle East, MED: Mediterranean.

	AISNPs	Genotyping technology	Statistical model	Inferred BGA	Related articles
Y-AISNPs Panel
Major Y-chromosome haplogroup typing kit. [71,72]	29 Y-SNPs	SNaPshot™ + CE	MDS	31 major global Y-haplogroups	[73]
[74]	30 Y-SNPs	SNaPshot™ + CE	MDS	32 major EUR Y-haplogroups	NA
[75]	37 Y-SNPs	SNaPshot™ + CE	CRT	Major EUR Y-haplogroups	NA
[76]	13 Y-SNPs	SNaPshot™ + CE	GDA CRT NB (Snipper)	Major ASN Y-haplogroups	NA
[77]	12 Y-SNPs	SNaPshot™ + CE	CRT	Venezuelan Y-haplogroups	NA
[78]	28 Y-SNPs	SNaPshot™ + CE	CRT	Macedonian Y-haplogroups	NA
[79]	28 Y-SNPs	SNaPshot™ + CE	CRT	Major global Y-haplogroups	NA
[80]	7 Y-SNPs	SNaPshot™ + CE	CRT	EUR, EAS, AFR Y-haplogroups	NA
[81]	859 Y-SNPs	NGS	CRT	640 Y-haplogroups	NA
[82]	9 Y-SNPs	PCR-REBA + Sequencing	CRT	Major global Y-haplogroups	NA
mt-AISNPs Panel
[83]	11 mt-SNPs 1 mt-InDel	SNaPshot™ + CE	CRT	15 mt-haplogroups	NA
[84]	36 mt-SNPs	SNaPshot™ + CE	CRT	43 mt-haplogroups (AFR, west and east EURAS, NAM)	NA
[85]	26 mt-SNPs	SNaPshot™ + CE	CRT	20 OCE and 10 AFR, EUR and ASN mt-haplogroups	NA
[86]	62 mt-SNPs	SNaPshot™ + CE	CRT	70 global mt-haplogroups (AFR, NAM, WEAS, EAS, AUS, OCE)	NA
[87]	52 mt-SNPs	SNaPshot™ + CE	CRT	Major global mt-haplogroups	[87]
aAISNPs Panel
[88]	6 aSNPs	SNaPshot™ + CE	NJ	Major AUS sub-populations	NA
SNPforID 34-plex [89–91]	34 aSNPs	SNaPshot™ + CE	NB (Snipper and STRUCTURE)	3 populations (AFR, EUR, EAS)	[91–99]
[66]	176 aSNPs	SNPstream + CE	NB (STRUCTURE) ML (unspecified)	4 populations (EUR, WAF, NAM, EAS)	NA
[100,101]	16 aSNPs	SNaPshot™ + CE	NB (Snipper)	6 AUS sub-populations	NA
[102]	47 aSNPs	GeneChip® array TaqMan® SNP genotyping	NB (STRUCTURE)	4 populations (AFR, EURAS, EAS, NAM)	NA
Seldin set. [103]	128 aSNPs	TaqMan® SNP genotyping	NB (STRUCTURE)	4 populations (AFR, EUR, EAS, NAM)	NA
[104]	16 aSNPs	SNaPshot™ + CE	MLR	7 populations (WAF, NAF, Turkey, NES, Balkan states, NEU, Japan)	NA
EurasiaPlex [105]	23 aSNPs	SNaPshot™ + CE	NB (Snipper and STRUCTURE)	2 sub-populations (EUR and EAS, MES, and SAS)	[91,98,99]
EUROFORGEN Global AIM-SNP [95]	128 aSNPs	Sequenom® MassARRAY® Sanger sequencing	MDS NB (Snipper and STRUCTURE)	5 populations (AFR, EUR, EAS, NAM, OCE)	[106,107]
Kidd Lab [108]	55 aSNPs	TaqMan® SNP genotyping	MDS NB (STRUCTURE)	7 to 8 populations (sub-Saharan AFR, admixed and NEAF, SWAS, EUR, Siberian, SAS, EAS, SEAS, PAC, NAM)	[109]
[110]	14 aSNPs	SNaPshot™ + CE	NB (Snipper) SVM	3 populations (EUR, AFR, EAS)	NA
EurEAs_Gplex [111]	14 aSNPs	SNaPshot™ + CE	MDS NB (Snipper and STRUCTURE)	3 populations (EUR, AFR and EAS)	NA
Global AIMs Nano [112]	31 aSNPs	SNaPshot™ + CE	NB (Snipper and STRUCTURE)	5 populations (AFR, EUR, EAS, OCE, NAM)	NA
[113]	32 aSNPs	TaqMan® SNP genotyping	NB (STRUCTURE)	MED and SWAS	NA
[114]	74 aSNPs	TaqMan® SNP genotyping Sequenom® MassARRAY®	NB (STRUCTURE)	10 populations (sub-Saharan AFR and NAF, EUR, SWAS, NAS, SAS, EAS, SEAS, OCE, NAM)	[115]
[116]	130 aSNPs	Sequenom® MassARRAY®	MLR	EUR and 5 ASN sub-populations	NA
[68]	142 aSNPs	Not used	NB (Snipper and STRUCTURE) MLR GDA	4 populations (AFR, EUR, EAS, NAM)	[117]
[67]	93 aSNPs	Not used	NN	7 populations (AFR, EUR, CSAS, MEA, EAS, NAM, OCE)	NA
JapanesePlex [118]	60 aSNPs	SNaPshot™ + CE	NB (Snipper)	EAS sub-populations	NA
SWA AISNP panel [58]	86 aSNPs	TaqMan® SNP genotyping	NB (STRUCTURE)	SWAS and MED sub-populations	[109]
EUROFORGEN NAME [119]	111 aSNPs	Sequenom® MassARRAY®	NB (Snipper and STRUCTURE)	NAF and MES	[120]
[121]	48 aSNPs	Not used	NB (ADMIXTURE) MLR	Chinese sub-populations (Uygur, Han, Mongolian)	NA
Population Informative multiplex for the Americas (PIMA) [122]	26 aSNPs	SNaPshot™ + CE	PCA, NB (Snipper)	NAM sub-populations	[123]
Multiple AISNPs Panel
[65]	7 Y-SNPs 12 mt-SNPs 6 aSNPs	SNaPshot™ + CE HRM	NB (STRUCTURE)	2 populations (ASN and EUR)	NA
[124]	31 aSNPs 21 InDels	SNaPshot™ + CE	NB (Snipper and STRUCTURE)	5 populations (AFR, EAS, MES, EUR, CSAS)	NA
Pacifiplex [125]	27 aSNPs 2 X-SNPs	SNaPshot™ + CE	NB (Snipper and STRUCTURE)	OCE sub-populations	[96,98,99]
MAPlex [126,127]	144 aSNPs 20 MH	NGS	NB (Snipper and STRUCTURE)	3 populations (EAS, SAS, NOCE)	[128]

Table 2. Commercial BGA panels proposed in the literature, including their reference article, number and AISNPs type, first used genotyping technology and prediction model, inferred populations and related articles. EUR: European (NWEU: north-west, SWEU: south-west), AFR: African, ASN: Asian (EAS: east, SAS: south, SWAS: south-west), NAM: Native American, OCE: Oceanian, MES: Middle East.

	SNPs	Genotyping technology	Statistical model	Inferred BGA	Related articles
Signet™ Y-SNP kit (Marligen Bioscience Inc.)	42 Y-SNPs Amelogenin	Multiplex PCR + flow cytometry	CRT	6 major global Y-haplogroups	[129]
Precision ID Ancestry panel (Thermofisher Scientific)	165 aSNPs	NGS	HID-SNP Genotyper PlugIn (undisclosed)	7 populations (EUR, AFR, NAM, EAS, OCE, SAS, SWAS)	[35,57,130–145]
DNAWitness™ (DNAPrint Genomics)	178 aSNPs	SNPstream®	NA	4 populations (sub-Saharan AFR, NAM, EAS, EUR)	NA
DNAWitness-Y™ (DNAPrint Genomics)	NA	SNPstream®	NA	Y-haplogroups	NA
DNAWitness-Mito™ (DNAPrint Genomics)	NA	SNPstream®	NA	mt-haplogroups	NA
EUROWitness™ (DNAPrint Genomics)	NA	SNPstream®	NA	4 EUR sub-populations (NWEU, SWEU, MES and SAS)	NA

The first commercially available tools for forensic inference of BGA were launched in 2003 by DNAPrint Genomics: DNAWitness-Y™ and DNAWitness-Mito™ for parental lineages, and DNAWitness™ to infer sub-Saharan African, Native American, East Asian, and European ancestries (the latest also can be sub-divided into North-western European, South-eastern European, Middle Eastern and South Asian using the EUROWitness™ panel). These panel had already been applied to solve real forensic cases, such as the Louisiana rapist or the Night Stalker [33]. After they were discontinued, a well-known NGS-based commercial solution was presented by ThermoFisher Scientific: The Precision ID Ancestry Panel (before known as HID-Ion AmpliSeq™ Ancestry Panel) [146], which includes 165 aAISNPs, allowing the differentiation of African, European, American, East Asian, South Asian, Southwest Asian, and Oceanian populations.

Concerning the non-commercial panels, one of the first applied panels in forensic research was proposed by the SNPforID Consortium. The SNPforID 34-plex panel [89] allows differentiation among sub-Saharan Africans, Europeans and East Asians and is suitable to use with SNaPshot™ technology. This panel is included in the online webtool Snipper App, developed by the University of Santiago de Compostela (USC) [147] and it allows to use the panel to infer three to five populations and to choose a classifier among naïve Bayes (NB; applying or not Hardy-Weinberg equilibrium (HWE)), multinomial logistic regression (MLR) or genetic distance algorithm (GDA) (according to allele and genotype frequency).

The following years, three population-specific panels were developed to be used in combination with the 34-plex: a 23-plex called Eurasiaplex [105], which enhances differentiation between Europeans and South Asians; the Pacifiplex [125], a panel of 29 AIMs for differentiating Oceanian populations; and the 26-plex Population Informative Multiplex for the Americas (PIMA) dedicated to Indigenous American populations [122]. Other most used sets in this field are the Kidd’s lab panel, containing 55 SNPs that can distinguish seven to eight continental regions [108], and the EUROFORGEN Global AIM-SNP set which is composed of 128 autosomal SNPs to differentiate the main five global groups (Africa, Europe, East Asia, Native America, and Oceania) [95]. This last panel was reduced to a 31-plex, the Global AIMs Nano, and can be combined with the EUROFORGEN NAME [119], which uses 111 aAISNPs to enhance differentiation of Middle Eastern and North Africans.

3.2 EVC

Externally visible characteristics (EVC) are described as physical traits that are apparent at view (i.e., pigmentation, height, weight, and facial morphology). Genetically, they are considered complex traits due to their multigenic and multifactional nature [11,14,148], since they are influenced by environmental factors [11,17], such as climate, altitude, and nutrition [1,6,14,36]. The markers used to infer EVC are commonly referred to as phenotypic-informative SNPs (PISNPs) and can be both present in the coding and non-coding regions of the DNA [21].

The pigmentation variation depends on the amount, type, and distribution of melanin and eumelanin and it varies depending on the sex, age, ultra-violet ray (UVR) exposure and body site [24,50,148–150]. It is said that their genetics follow a semi-Mendelian inheritance [11,24], and their heritability is between 60 and 90% [7,12,14]. For this very reason, they have been the focus of FDP - since their genetics is extensively studied in clinical research - and they were the first ones to be predicted. The eye colour is the most successfully predicted EVC and is highly variable in European ancestries [43,151]. It is normally categorized in two or three groups: blue (non-brown) vs brown (non-blue), or blue vs intermediate (green/hazel) vs brown/black. Additionally, they are divided into light (blue, green) and dark categories (brown, black). Instead, hair colour is usually divided into blond, brown, red, and black groups. This trait is highly influenced by age, since individuals with red and blond hair in their childhood usually transition to blond and brown – respectively –, and hair whitens/greys when older [6,9,11,24,43]. In addition, the categorization of skin colour in humans is the most complex and the most varying among studies. Usually, they are divided according to the Fitzpatrick Scale [152] as very pale vs pale/light vs intermediate/olive/light-brown vs brown vs dark/black. For this reason, skin colour is the most complicated trait to predict among the pigmentation phenotypes [21,43]. Another unique pigmentation feature is the presence of ephelides - or freckles – which is also affected by URV exposure and age [153,154]. Their prediction can be based on a two- and four-categorical model: non-freckled vs freckled, or light-freckled vs mild-freckled vs severe-freckled vs non-freckled.

Two of the most interesting quantitative traits are height and weight. On one hand, stature is an easy to measure trait, leading to homogeneous, reliable, and accurate data [12,14]. It is highly polygenic and greatly influenced by environmental factors (e.g., social class, income, education, family size, housing, urban locations, etc.) [9,14]. Although it has been immensely studied in clinical research, few studies are directed to its incorporation to FDP. On the other hand, an individual’s weight is usually measured using the body mass index (BMI), and it is said to have a heritability around 60 to 70%, despite knowing that epigenetic factors have more influence that genetic ones [155].

The most ambitious and challenging phenotype to incorporate is facial morphology. Even though environment has a small effect, and the genetic component is strong, it is highly polygenic, and the knowledge of their underlying genetic mechanisms is scarce [8,9,11,12,156]. The FaceBase Consortium (see [157]) and the International Visible Trait Genetics (VisiGen) Consortium (see [158]) have discovered many markers associated with the actual human morphology and researchers tend to first focus on single facial features to later apply it to whole-face predictions [11,16,156,159]. Some of the individual traits that have been investigated are related to eye morphology, such as eyelid fold, epicanthal index and palpebral fissure distance and inclination.

Another trait that could be incorporated in FDP is hair morphology, including hair shape, which can be grouped into three categories: straight, wavy, and curly; hair thickness from the scalp hair; eyebrows (e.g., monobrows) and beard; and hair loss, in particular male-patterned baldness (MPB). Moreover, an interesting phenotype that has only been suggested once [160] is the relative hand skill (HSR) or handedness, which is based on the preference of using the right or left hand to perform complex tasks.

3.2.1 EVC-related literature

The prediction of physical characteristics is being extensively researched, in comparison with BGA. There are many reviews solely focused on the current knowledge on genetic mechanisms of pigmentation traits and facial morphology, as well as discovering new and more informative markers [161–174], and few include other traits that have potential to be included as part of FDP (freckles [164,171,175,176], facial morphology [177–181], high myopia [182], handedness [160], hair greying [183,184] or hair morphology [26,184–186]). In terms of prediction panels, there are seven reviews that include detailed descriptions on traits and their associated SNPs [7,8,12,13,15,52,54]. This review includes a total of 43 sets of markers to infer EVC (Table 3).

Table 3. Commercial and non-commercial EVC panels proposed in the literature, including their reference article, number of PISNPs, first used genotyping technology and prediction model, inferred traits and related articles.

	SNPs	Genotyping technology	Statistical model	Inferred traits	Related articles
Pigmentation traits
RETINOME™ (DNAPrint Genomics)	NA	NA	NA	Eye colour	NA
IrisPlex [151,187]	6 PISNPs	SNaPshot™ + CE	MLR	Eye colour	[35,98,99,123,149,150,172,188–211]
SHEP 1 [212]	13 PISNPs	SNaPshot™ + CE	NB (Snipper)	Eye colour	[97,123,194,201,202,204]
[213]	19 PISNPs	TaqMan® SNP genotyping	CRT	Eye colour	[204]
[214]	23 PISNPs 2 InDels	TaqMan® SNP genotyping SNaPshot™ + CE	LR	Eye colour	NA
[215]	2 PISNPs	TaqMan® SNP genotyping	LR	Eye colour	NA
[216]	5 PISNPs	Sanger sequencing	MLR	Eye colour	NA
[217]	137 PISNPs	NGS	LR, CRT, RF, XGB, MARS, NN, SVM and NB	Eye colour	NA
EC11 [218]	11 PISNPs	Sequenom® MassARRAY®	LR, CRT	Eye colour	NA
HIrisPlex [219,220]	23 PISNPs 1 InDel	SNaPshot™ + CE TaqMan® SNP genotyping	MLR	Eye colour Hair colour	[31,136,142,205,206,221–226]
[227]	12 PISNPs	SNaPshot™ + CE	MLR, BLR, NB	Eye colour Hair colour	[208]
[228]	10 PISNPs 2 InDels	Solid-phase fluorescent minisequencing	GDA	Hair colour	NA
[229,230]	5 PISNPs	SNaPshot™ + CE	GDA	Hair colour	NA
[231]	11 PISNPs Amelogenin	SNaPshot™ + CE	BN	Hair colour	NA
[232]	13 SNPs	Sequenom® MassARRAY® SNaPshot™ + CE	MLR, LASSO regression	Hair colour	[209]
SHEP 4 [233]	12 PISNPs	SNaPshot™ + CE	LR NB (Snipper, iterative NB)	Hair colour	[97]
[183]	12-14 PISNPs Amelogenin	NGS	NN	Hair greying	NA
HIrisPlex-S [234]	41 SNPs	SNaPshot™ + CE	MLR	Eye colour Hair colour Skin colour	[109,141,166,205,206,235–244]
[245]	13 PISNPs	SNaPshot™ + CE	MDR MLR	Eye colour Hair colour Skin colour	NA
[209]	12 PINSPs	SNaPshot™ + CE	LR, OR	Eye colour Hair colour Skin colour	NA
CAN-E, CAN-S and CAN-H [246]	277 PISNPs	Not used	LR, MLR, RF, XGB, ANN, OR and SR	Eye colour Hair colour Skin colour	NA
[247]	12 PISNPs	TaqMan® SNP genotyping	MLR	Eye colour Hair colour Skin colour	NA
[248]	7 PISNPs	TaqMan® SNP genotyping	GDA	Eye colour Skin colour	[193,249]
[249]	8 PISNPs	TaqMan® SNP genotyping	GDA	Eye colour Skin colour	[204,250]
[251]	5 PISNPs	NGS	GDA	Eye colour Skin colour	NA
[252]	14 PISNPs	SNaPshot™ + CE	LR, RF and NN	Skin colour Tanning Freckles	NA
[154]	5 PISNPs	KASP Genotyping Chemistry TaqMan® SNP genotyping	MLR	Freckles	NA
[153]	12-14 PISNPs	NGS	LR	Freckles	[206]
SHEP 1 [253]	110 PISNPs	SNaPshot™ + CE	NB (Snipper)	Skin colour	[97]
Other hair-related traits
[254]	6 PISNPs	SNaPshot™ + CE NGS	LR, CRT and NN	Hair morphology	NA
[185]	32-33 PISNPs	NGS Sequenom® MassARRAY®	LR	Hair morphology	[206]
[255]	14 PISNPs	Microarray	MLR	Hair morphology	NA
[256]	4-21 PISNPs	NGS	LR	Hair morphology	NA
[257]	5-20 PISNPs	SNaPshot™ + CE NGS	LR	Male-pattern baldness	NA
[258]	25 PISNPs	SNaPshot™ + CE PCR-RFLP	LR	Male-pattern baldness	NA
Facial traits
[156,159]	24 PISNPs 68 AISNPs Amelogenin	SNPStream™	PLSR, BRIM	Facial morphology	NA
[259]	~ 90.000 PISNPs	Microarray	PCA, LR	Facial morphology	NA
[181]	1 PISNPs	NGS	LR	Eyelid	NA
[260]	4 PISNPs	Microarray	PCA	Facial morphology	NA
[261]	21 PISNPs	SNaPshot™ + CE	OR MLR	Ear morphology	NA
Other traits
[155]	8 PISNPs 4 CpG sites	SNaPshot™ + CE Pyrosequencing	RF	BMI	NA
[262]	180 PISNPs	Microarray	LR	Height	NA
[263]	412-689 PISNPs	Microarray	LR	Height	NA

The only commercial test purely focused on EVC inference was RETINOME™, also developed by DNAPrint™ Genomics who guaranteed a 97% of correct eye colour predictions [264]. Nonetheless, the most currently used free online tools to predict pigmentation traits are the IrisPlex system and its updated versions (HIrisPlex and HIrisPlex-S), created by the Erasmus University Medical Centre Rotterdam [151,219,234,265]. IrisPlex uses six SNPs to predict blue, intermediate, and brown colours with an average accuracy of 0.94 AUC, 0.74 AUC and 0.95 AUC, respectively. The HIrisPlex system allows the inference of eye and hair colour by simultaneously targeting 23 SNPs and 1 InDel. It is possible to obtain accuracies of 0.92 AUC for red, 0.83 AUC for black, 0.80 AUC for blond and 0.72 AUC for brown. Moreover, the final HIrisPlex-S system can predict 5 skin pigmentation categories together with eye and hair colour using 41 SNPs with an accuracy of 0.74 AUC for very light, 0.72 AUC for light, 0.73 AUC for intermediate, 0.88 AUC for dark and 0.96 AUC for dark to black categories.

Other panels, more precisely the SHEP panels [212,233,253] to infer pigmentation traits, have been included in the Snipper App [147]. Regarding eye colour, this webtool allows to select between 7, 13 or 23 SNPs to infer blue, green/hazel, or brown eyes. Hair colour can be classified in two or four categories (light vs dark, or red vs blond vs brown vs black) when genotyping 12 markers, whereas skin colour is categorized as light, intermediate, or black typing 10 SNPs. These traits can be predicted using NB, MLR, or GDA.

3.3 BGA and EVC

As observed, some physical characteristics, in particular pigmentation traits, vary according to continental populations [8,11,66,148]. Thus, it is important to always consider both when interpretating the results. A total number of 15 panels that infer simultaneously BGA and EVC have been included in this review (Table 4).

Table 4. Commercial and non-commercial BGA and EVC panels proposed in the literature, including their reference article, number, and SNPs type, first used genotyping technology, prediction model and inferred populations and traits. EUR: European, AFR: African (NAF: north, AFR-AME: American), ASN: Asian (EAS: east, CAS: central, SAS: south, SWAS: southwest), NAM: Native American, OCE: Oceanian, MED: Mediterranean, HIS: Hispanic.

	SNPs	Genotyping technology	Statistical model	Inferred BGA and traits	Related articles
MiSeq FGx™ Forensic Genomic System (includes ForenSeq™ Signature kit B)	22 PISNPs 56 aAISNPs	NGS	Illumina ForenSeq Universal analysis Software™ (Undisclosed)	BGA (Undisclosed) Eye, hair, and skin colour	[28,143,242,266–283]
Parabon™ Snapshot®	Undisclosed	NGS	Undisclosed	BGA (EUR, MED, EAS, CAS, AFR) Eye, hair, and skin colour Freckling Face shape	NA
[284]	6 PISNPs 4 AISNPs	SNaPshot™ + CE	NB (STRUCTURE)	BGA (EUR, AFR, ASN) Eye, hair, and skin colour	NA
[285]	60 PISNPs 43 AISNPs	SNaPshot™ + CE	NB (STRUCTURE)	BGA (AFR, AFR-AME, EUR, SAS, ASN, NAM, HIS) Eye, hair, and skin colour Hair morphology Male-pattern baldness	NA
[286]	21 mt-AISNPs 28 Y-AISNPs 14 AI-/PISNPs	SNaPshot™ + CE	GDA	BGA (AFR, EUR, NAF/MED, ASN, EAS) Eye, hair, and skin colour	NA
Identitas v1 Forensic Chip [287]	192,658 aSNPs 3,012 Y-SNPs 5,075 X-SNPs 428 mt-SNPs	Microchip	MLR	BGA (EUR, AFR, EAS, SAS, NAM) Eye and hair colour Kinship Sex	NA
[288]	31 PISNPs 19 AISNPs	SNaPshot™ + CE	MLR NB (Snipper)	BGA (EUR, AFR-AME, NAM/HIS, ASN) Eye colour	NA
32-plex [289,290]	10 PISNPs 22 AISNPs	SNaPshot™ + CE	NB (STRUCTURE and Snipper) DAPC	BGA (AFR, EUR, SAS, EAS, NAM) Eye, hair, and skin colour	[124]
MiniPlex [291]	5 mt-AISNPs 4 Y-AISNPs 1 Y-AI InDel 5 aAISNPs 3 PISNPs	SNaPshot™ + CE	MLR, NB (Snipper)	BGA (5 global mt- and Y-haplogroups, AFR, EUR, EAS, OCE, NAM) Eye colour Lineage	NA
VISAGE Basic Tool for Ancestry and Appearance (BT A&A) [292]	41 PISNPs 153 AISNPs	NGS	NB (Snipper)	BGA (AFR, EUR, EAS, NAM, OCE, SAS) Eye, hair, and skin colour	[293–295]
Ion AmpliSeq™ PhenoTrivium Panel [296]	41 PISNPs 163 AISNPs 120 Y-AISNPs	NGS	NB (Snipper)	BGA (AFR, EAS, SAS, SWAS, EUR, NAM, OCE) Eye, hair, and skin colour	[297]
[298]	67 AISNPs 23 PISNPs 35 Y-AISNPs	NGS	NB (Snipper) CRT	BGA (Pakistan pub-populations) Eye, hair, and skin colour	NA
[299]	2 AISNPs 3 PISNPs	TaqMan® SNP genotyping	BN	BGA (EUR, ASN) Eye colour	NA
[109]	41 PISNPs 141 AISNPs	NGS	LR MLR	BGA (AFR, EUR, ASN, NAM, SWAS, MED) Eye, hair, and skin colour	NA
Phenotype Expert [300]	41 PISNPs 14 Y-ASNPs Amelogenin 4 ABO blood group SNPs	Microchip	MLR CRT	BGA (Slavic Y-haplogroups) Eye, hair, and skin colour	NA

The VISAGE Consortium presented their first appearance and ancestry single assay, referred as VISAGE Basic Tool (BT) [292]. It consists of a total of 153 AISNPs for continental origin inference, most of them part of the EUROFORGEN Global AIM-MPs ancestry panel [95], two SNPs from Kidd’s panel [108,146] and 11 from the Precision ID ancestry panel [146], and the 41 SNPs from the HIrisPlex-S panel [265] for pigmentation inference.

In terms of commercial solutions, the MiSeq FGx™ Forensic Genomics System (which includes the ForenSeq™ DNA Signature Prep Kit and ForenSeq™ Universal Analysis Software) (Illumina S.A., USA) [301] is one of the most complete forensic tools since it contains two panels: the first one including 27 autosomal, 7 X- and 24 Y-chromosomal STRs and 94 identity-SNPs for identification purposes, while the second panel contains 56 ancestry- (to classify four populations (European, American, African, and East Asian)) and 22 phenotype-informative SNPs (for eye and hair colour). Moreover, the VisiGen Consortium developed another commercial solution, which included Identitas v1 Forensic Chip and Identify software [287,302], which allows inference of bi-parental BGA, eye and hair colour, relatedness, and sex by interrogating 201,173 genome-wide autosomal (192,658), Y- (3,012), X- (5,075) and mt-SNPs (428). Finally, Parabon Nanolabs offers the Snapshot™ DNA Phenotyping Service [303], which they deem capable of creating a complete profile, including genetic ancestry, eye, hair, and skin colour, freckling and face shape.

The relatively new appearance of FDP and its debated implementation [304–308] translates into a complicated harmonization of its methodology, which is clear after inspecting all the SNP panels included in this review. It can be concluded that the factors that will influence the accuracy of the prediction are the genetic heritability of the trait, the method of SNP selection and genotyping, the informativeness of the SNP, the reference dataset, and the mathematical approach [7,9,12,103]. Thus, before FDP methods can be used in forensic investigations, they need to be standardized and forensically validated, according to the Scientific Working Group on DNA Analysis Methods (SWGDAM) guidelines [309], to finally provide reliable and reproducible results. To do so, all the technical advantages and limitations of FDP must be considered. In addition, a consensus between researchers and field experts is needed to prepare protocols and directives to meet all ethical, social, and legal requirements (reviewed in [310] ).

4.1 Terminology and reporting

The first most important issue is the terminology employed to identify FDP research. Although the word ‘FDP’ was already introduced in 2008 [10,47], not all articles on BGA or EVC inference identify it as such and simply refer it as an intelligence tool. For instance, only 78 articles included in this review identify FDP (two of them as molecular photofitting). Thus, the correct identification of the term as keyword and in the text would allow a more congruent literature search.

Similarly, the second issue is the definition, categorization, and measurement of traits. On one hand, considering the nature of the traits (i.e., quantitative, like height and BMI; or qualitative, such as pigmentation traits and BGA), encasing the latter into categories, may lead to oversimplification [24], irreproducible results, and incomparable studies [287]. This especially becomes challenging when analysing data from multiple sources. Moreover, these categories are usually mistaken with stereotypes or sense of nationality [33,287]. Although categorization in forensics is preferred [39,205,214,311] – since the application in casework implies human interpretation (i.e., investigators) -, some researchers recommend using a continuous and quantitative spectrum instead [9,188,195]. On the other hand, measurements tend to be quite subjective, with most studies based on self-reported EVCs data via questionnaires or reported by simple observation of a non- or medical expert. For example, even when pigmentation traits are usually recorded via digital photographs they are later interpreted and put into categories by researchers. To avoid errors due to different perceptions of a trait [24,197], several studies suggested applying specialized equipment and reflectance, bioimaging and biochemical technologies [194,210] to find stronger genotype-phenotype associations [214]. In the case of BGA, information on up until a third-degree familial ancestry is usually reported and accompanied with a family pedigree.

The same issue arises when FDP results are being reported. For instance, Atwood et al. [34] compared different service providers in terms of prediction accuracy, clarity of reporting and consistent terminology, limitations, cost, and time. The authors concluded that it is imperative that guidelines are created for a shared methodology, and clear reporting and easy interpretation of the analysis for non-experts. Interestingly, results were shown in many ways, from simple verbal “not-/likely” to highlighting -or not- the highest probability for each trait variation or ancestry, or finally with a visual map representing where the individual falls on the represented population clusters.

4.2 Development of panels

Before developing a panel for a certain trait or combinations of traits, researchers concentrate on finding the most informative set of markers for each trait. Usually, the discovery is performed via GWAS and later, confirmed by association studies [9,13,287]. This allows to avoid false positives and to find genes with weaker effects that may have been ignored [10,14]. Even so, these studies are usually carried out with small sample size and are not extensively replicated, creating some scepticism on the validity of the found associations [14]. Ideally, a worldwide population scan would be key to find candidate genes [312], considering that normally sub-populations are less represented in exploratory panels [266]. Other studies find SNPs by comparing allele frequencies found in genetic population databases (e.g., HapMap, 1000 Genomes, CEPH Human Genome Diversity Panel, Complete Genomics) with specialized tools (e.g., SPSmart, FROG-kb [313,314]).

In the case of BGA inference, it is important to select those variants with extreme allele frequency differences between populations [65,69,89,102,315] and obtain marker combinations to have equivalent levels of differentiation among those [95]. On the contrary, the genetic complexity of EVCs, due to pleiotropies (i.e., a single SNP influencing multiple traits) [11,14], epistasis (i.e., several SNPs influencing a single trait) [11,197,245], allelic heterogeneity [151,232,257], phenotypic variability, and gene-environment interactions, need to be assessed before selecting the candidate markers. However, these genetic mechanisms are still not fully understood [9], and it is possible that many other implicated and more informative genes are being ignored [15].

One of the first debates is centred around the number of SNPs needed in a panel to obtain reliable predictions. On one hand, small SNP panels must contain the most informative and differentiating markers and are ideal for the current available SNaPshot™ technologies and to obtain lesser partial profiles when typing low DNA samples [90,112,291]. On the other side, increasing the number of SNPs improves the accuracy, especially with missing data [80,136,287,315]. However, the number of SNPs will also depend on the analysis’ purpose and the genetic complexity of the trait. For instance, the four or five main continental populations can be distinguished with ease using less than 40 markers [39,291], and eye colour can be distinguished with only six SNPs [151]. Conversely, even though the heritability of height and eye pigmentation is similar, the number of SNPs needed to infer stature is increasing by hundreds as its molecular mechanisms are discovered [262,263]. In this sense, several authors believe that it is better to have markers with a strong influence [110,312], due to the scare amount of DNA in the samples, while others suggest finding genes with weak effects to complement the inference [254,258,316]. Also, in the case of BGA, researchers recommend using a two-tier approach: first, a panel with maximum 100 markers to infer at least 12 global populations, and later other panels to refine sub-population inference [39,58,121,131]. That is the case of the SNPfor ID 34-plex [90] and its EurasiaPlex [105], Pacifiplex [125] and PIMA’s [122] sub-panels.

Even though some researchers evaluated the capacity of EVC-associated variants to be used as AISNPs [100,104,110,165–167,315,317,318], making indirect inferences based on either BGA or EVC is a highly debated practice. Indeed, some authors made assumptions about individuals’ appearance using only BGA data [34,129], or vice-versa [16,36,67,148,319]. Nonetheless, most researchers discourage this practice, especially with the increasing population admixture [9,33,43,312] and the fact that some shared alleles may not be related to ancestry but to environmental exposures that are the same in different populations [33]. Despite this, it is still important to infer BGA, as well as biological age and sex, together with EVCs, especially if a trait is restricted to a population, sex, or age group [10,165,320], to avoid any misleading interpretations.

Extensive lists of markers associated with EVCs are available [8,15,54] and they have been combined in multiple ways, yet the number of overlapping of unique markers is minimal. Soundararajan et al. [39] reported this same fact on BGA panels and emphasised the need for a collaboration among researchers to find the “best” markers and test them on a large data set representative of all global populations. Therefore, validation and inter-laboratory testing of panels is important to meet the specific quality requirements typical of forensic DNA analysis. Only few systems have been validated for forensic use [6,7,9,11,27]. Furthermore, panels are commonly developed using homogeneous and European reference data, and then validated in other populations; and they are replicated and validated adopting different methodologies, generating a more complicated comparison exercise [39]. The best outcomes would be to adapt the panel to each individual population [321] or obtain a complete allele frequency data for all existing populations and subpopulations [131].

Another factor that influences this choice is if SNPs are found in ‘coding’ or ‘non-coding’ genes, and their informativeness of other health-related phenotypes. This first differentiation follows the legal regulations that have been used for STR identification and although scientist have discussed that these categories do not reflect the reality, it is still used as a reason to include or discard markers. However, FDP implies the use of ‘associative’ markers that can be found in both non- and coding regions. The fear of including coding markers is based on their higher potential to provide health information [9,148,197], although non-coding genes can provide similar information if they are in linkage disequilibrium with the implicated coding genes [10] or regulatory regions [322]. Moreover, many disease- or trait-related candidate genes are first discovered when researching pathological or extreme variations, and other mutation within are found to be associated with normal variation instead [15,188]. For example, OCA2 gene mutations are associated with eye colour and oculocutaneous albinism [15,24,164]. Regarding these off-target phenotypes, Bradbury et al. [323] studied the possibility to reveal health information while predicting EVC, and only 27 out of 1766 FDP-related markers were associated with risk of having cancer, induced asthma or risk of alcoholism. However, these associations do not mean that an individual is suffering from these diseases and a single marker cannot be used to predict or confirm these risks.

Finally, there is a continuous debate on using commercial or non-commercial panels. While commercial houses’ strongest point is their constant supply of ready to use kits, they claim the kit’s technical information (e.g., markers, accuracy, statistical model, etc.) as their intellectual property. Hence, researchers cannot ensure a truthful validation and reproducibility of the kit. Consequently, some companies have been discontinued, like DNAPrint [33]; while others, such as Parabon Nanolabs have been criticised by many FDP-experts [53].

4.3 Genotyping technology

All available SNP typing methodologies have already been evaluated for forensic application ([18–20,22,27]). These techniques are known to be very versatile, allowing the combination of different chemical reactions, assay formats and detection methods [19,20]. Then again, not all techniques are suitable as FDP faces similar problems to STR identification when analysing forensic samples (i.e., low quantity and degraded DNA and often mixtures). The selection of methodology will be based on its accuracy, multiplexing and automation capacity, high-throughput, cost, and time; as well as the purpose of the analysis (e.g., the number of traits and markers to be included).

A great number of genetic techniques have been used to infer BGA or EVCs [13,40,196,242,324]: PCR assays (e.g., PCR-RFLP [171], PCR-REBA [82], and most commonly TaqMan® SNP genotyping assay), microarrays (e.g., GeneChip™ [102,287]), minisequencing (e.g., SNPlex™), MALDI-TOF (matrix-assisted laser desorption/ionization – time-of-flight) together with mass spectrometry (MS) detection (e.g., Sequenom® MassARRAY®) [107,116,120] and high-resolution melting (HRM) [65,196,324]. While some techniques like Sequenom® MassARRAY® or HRM do not reach the sensitivity requirements for forensic samples [107,115,325], others have been developed but discontinued, such as Genomelab™ SNPstream® [66,156,159,214] and Genplex®. Nonetheless, the golden standard is still SNaPshot™ (SBE-CE assay) due to its robustness, simplicity, and efficiency, but more precisely because the instrument is already present in forensic laboratories and great efforts were invested in their standardization [2,13,40,196,325].

Despite this, SNaPshot™-CE has a higher risk of contamination and error, and more importantly is limited to analyse one single trait inferred with 30 to 40 markers at a time and hence, it cannot keep up with the increasing number of markers needed for FDP [98,99,238,287]. For this reason, researchers are shifting to NGS techniques, in particular Ion Torrent™ (Thermofisher Scientific) and Illumina® [41,98,326]. They allow higher throughput, multiplexing capacity and sequencing accuracy [15], as well as the possibility to automate and sequence different markers in the same run (e.g., STR, SNPs, InDels, microhaplotypes) [23,143]. However, this implies a longer preparation, sequencing, and analysis time [271]. As a result, the current focus is on testing SNaPshot™ panels using NGS instruments [98,99,237,238,243,320], applying single cell sequencing and NGS to analyse mixtures and touch DNA samples [136,142,241,297], and automating analysis and result interpretation to reduce analysis time [23]. This last one would allow a better handling of the samples, increase simple size, and reduce costs and time.

All these techniques have their advantages and limitations, making it harder to choose one to proceed with their standardization. Moreover, the methodology will be chosen depending on the investigation requirements and purpose [8,14,19,98] and any new one such as MPS needs to be extensively validated in larger datasets and optimized before being incorporated [35,112,271]. Other factors that restrain technological advancement in the field are the costs to renovate workspaces, to train the staff, and to increase bioinformatic support and storage capacity [13,44,45,291].

4.4 Prediction models and algorithms

Prediction models are created to support and understand the relationship between genotype and phenotype [14,15]. There are two type of algorithms that can be used to predict BGA or EVC outcomes: statistical and machine-learning (ML). Statistical algorithms, such as MLR, work better when the predictors are dependent from each other, while ML algorithms usually assume independence among predictors [15] and detect in a linear or more complex way the dependency between variable and attributes [217]. Both methods may provide similar accuracy when the same SNP panel is used [15] although ML methods require a higher computational cost and expertise. Indeed, several articles compared and introduced different classifiers for FDP analysis [12,48,67,68,70,117,205,206,217,227,252,299,327].

Two of the most used programs, STRUCTURE and Snipper, are based on the NB algorithm. This algorithm calculates how likely a trait belongs to a class comparing it with the allele frequencies that are observed in each cluster and make assumptions on unknown profiles. [68,90]. It is also capable of incorporating missing data [68]. The gold standard for BGA inference is the STRUCTURE software (and its updated version, ADMIXTURE), because of its “efficient clustering based on similarities or dissimilarities with the other samples” [48,49,95] and thus, good inference of admixture proportions, but only if the populations are well differentiated in the reference data [90,117]. Its main disadvantages are assuming HWE, which is not compatible with BGA nor EVC inference [68], and its long and computationally intensive run times when classifying single profiles with large datasets - since the parental data and the unknown profile need to be analysed simultaneously and missing data needs to be imputed. Otherwise, Snipper can solve some of the issues STRUCTURE presents, providing a faster analysis [89,90], allowing the incorporation of one’s own reference dataset [105] and being able to classify single profiles in real time [105]. The later has been both used for BGA and EVC inference.

Other alternative methods have also been tested. For example, GDA provides a continuous clustering by evaluating the informative proportions of each component, it doesn’t assume HWE, and it can be used as input for hierarchical clustering, like neighbour joining trees. Although it is highly sensitive to noise [48], it has been proven better for admixture classification [67,117]. On another note, visual representations of individual and population structure like principal component analysis (PCA), discriminant analysis of principal components (DAPC) [290] or multidimensional scaling (MDS) are helpful to interpretate the outcome [68]. However, since they are reduced to the two or three most important components, it may lead to misclassification [48]. In addition, logistic regression (bi- or multinomial LR) is perfect for assessing categorical outcomes, even though it tends to misclassify partial profiles [68]. It has been traditionally applied to infer pigmentation colours [151,219]. Also, multifactor dimensionality reduction (MDR) is used in small sample size studies to better detect epistatic effects [233,245,328]. Other available and tested ML methods are linear discriminant analysis (LDA), support vector machine (SVM) [110,217,316], partial least square regression (PLRS) [156], extreme gradient boosting (XGB) [217,246], classification and regression trees (CRT) [204,217,218,254], multi-variate adaptive regression splines (MARS) [217], bootstrapped response-based imputation modelling (BRIM), ordinal and stepwise regressions (OR and SR) [209,246], and deep learning approaches such as neural networks (NN) and random forest (RF) [67,117,155,183,217,246,252,316]. NN are proposed as an alternative to LR as it recognises the patterns of complex data typical from EVC inference [156,254].

Hence, not all algorithms are appropriate, and will need to be selected depending on several aspects. First, the amount and type of data [217], as well as the impact of missing/partial profiles in the classification performance [68,89]. Second, the reference population, which not only affects the selection of SNPs but also the training of the classifiers. These must be representative of all variations and ancestries, especially when estimating admixed individuals [35,67,68,117,329]. Third, with the inability to incorporate environmental factors to the prediction, only sex and age can be incorporated as covariates. In the same way, the accuracy of the model will increase when considering both BGA and EVC if there is population dependency [188,189]. Some researchers defend that “when all the causing factors of a trait will be accounted for in the model, then the accuracy will be the same in all populations” [330].

Lastly, there are many options to interpretate the results obtained from the prediction model. It is key that field and legal experts easily understand and apply the findings. Logically, one may recommend continuing using likelihood rations (LR), since it already used in STR identification [133,188,195,272]. Nonetheless, as Caliebe et al. observed [321], since FDP does not apply the same principle of comparing two hypotheses (i.e., sample belonging to a random individual vs the suspect), and the highest value may not represent the correct category [35]. Hence, it will be more appropriate to use statistical probability, represented as posterior odds (PO), but unfortunately, statistics are often harder to understand by the plain audience. Other ways to represent accuracy have been incorporated: area under the curve (AUC) for categorical predictions – that vary from 0.5 (random phenotype) to 1 (exact phenotype) [7,11,12,15,17]; and correlation (R or R²) or mean squared error (MSE) for quantitative measurements [15].

The expectations that the forensic experts have on FDP reveals the need to provide accurate and tangible results to solve more complicated investigations. In this review, we investigated those panels that had been developed precisely for FDP and analysed the limitations to have in mind before an agreed application of the technique in the forensic workload. Among the available bibliography, 304 publications were strictly related to FDP inference and only 80 of them clearly identified that the research was for FDP inference. A total of 48 panels have been developed for BGA inference, six being commercial tests; while only one of the 43 panels available to infer several EVCs is from a commercial vendor. In addition, BGA and EVC can be simultaneously inferred with 15 panels, two of which are wildly used commercial solutions.

Throughout the literature, there is a recurrent stance from researchers: FDP is not to use in trial, but during the investigation step. This reasoning is because FDP cannot reach the level of “scientific certainty” that has been attributed to STR identification. Hence, although the justice seeks for an “absolute truth”, there needs to be a shift regarding the expectations on the results’ conclusiveness [2]. Realistically, in the near future of FDP, accuracy will not improve drastically. This is because even if more genetic and environmental interactions are found, the fully understanding of the effects on phenotypes complicates at the same time. There are a few things that can be done to increase the results accuracy, such as using quantitative and continuous predictions, promoting validation on all possible human populations and sub-populations, and investigating the incorporation of prior knowledge in the models [206]. The same can be said about incorporating other traits into the FDP profile, since the current extensive research (on height, weight, and facial morphology, among others) does not provide enough weight to obtain acceptable prediction accuracies. Moreover, there is an increasing interest in combining FDP with epigenetic information, not only to infer age, but to infer traits that are age-dependent like hair greying, and with other types of analysis, such as investigative genetic genealogy (IGG) or behavioural tendencies. These last two come with many ethical implications, such as the violation of genetic information of family members or whether a tendency such as aggression or depression is more influenced by physiological, rather than genetic factors and thus, considered medical information.

Finally, a decision concerning methodology advancement must be made by forensic services, either MPS is incorporated to laboratories to keep up with the increasing demand of high number of markers and traits - that current SNaPshot™ methods cannot -, or either, if FDP is considered as a tool that will not be used regularly and only in “desperate times”, this task is to be entrusted to specialized external centres. Nonetheless, the advancement of FDP application will rest on the efforts of the forensic community on creating guidelines and standards for EVC and BGA inference, from their measurement and categorization to their genotyping and prediction models.

AIM/AISNP	Ancestry-informative marker/SNP
aSNP	Autosomal SNP
AUC	Area under the curve
B/MLR	Bi- or multinomial linear regression
BGA	Biogeographical ancestry
BMI	Body mass index
BRIM	Bootstrapped response-based imputation modelling
CE	Capillary electrophoresis
CRT	Classification and regression tree
DAPC	Discriminant PCA
EVC	Externally visible characteristic
FDP	Forensic DNA phenotyping
GDA	Genetic distance algorithm
GWAS	Genome-wide association study
HSR	Relative hand skill
HWE	Hardy-Weinberg equilibrium
InDel	Insertion and deletion
LDA	Linear discriminant analysis
LR	Likelihood ratio
MALDI-TOF-MS	Matrix-assisted laser desorption/ionization - time-of-flight - mass spectrometry
MARS	Multi-variate adaptive regression splines
MDR	Multifactor dimensionality reduction
MDS	Multidimensional scaling
MH	Microhaplotypes
ML	Machine learning
MPB	Male-pattern baldness
MSE	Mean squared error
mtSNP	Mitochondrial DNA SNP
NB	Naïve Bayes
NGS	Next generation sequencing
NJ	Neighbour joining tree
NN	Neural networks
OR	Ordinal regression
PCA	Principal component analysis
PCR	Polymerase chain reaction
PCR-REBA	PCR-reverse blot hybridization assay
PCR-RFLP	PCR-restriction fragment polymorphism
PISNP	Phenotype-informative SNP
PLRS	Partial least square regression
PO	Posterior odds
POI	Person of interest
RF	Random forest
SBE	Single base extension
SNP	Single nucleotide polymorphism
SR	Stepwise regression
STR	Short tandem repeat
SVM	Support vector machine
UVR	Ultraviolet radiation
Y-SNP	Chromosome Y SNP
XGB	Extreme gradient boosting

Acknowledgements

This work was supported by the Audacity grant: Meet the Unknown: The Future of CRIMinal Forensic Genomics PhenoTYPing (CRIMTYP) from the Institute of Advanced Studies (University of Luxembourg).

Declaration of Conflict of Interest

The authors declare that there are no conflicts of interest.

Jobling MA, Gill P. Encoded evidence: DNA in forensic analysis. Nat Rev Genet. 2004; 5(10):739–51.
Walsh SJ. Recent advances in forensic genetics. Expert Rev Mol Diagn. 2004; 4(1):31–40.
Daniel R, Walsh SJ. The Continuing Evolution of Forensic DNA Profiling - From STRS to SNPS. Australian Journal of Forensic Sciences. 2006; 38(2):59–74.
Butler JM, Coble MD, Vallone PM. STRs vs. SNPs: Thoughts on the future of forensic DNA testing. Forensic Sci Med Pathol. 2007; 3(3):200–5.
Jobling MA. Y-chromosomal SNP haplotype diversity in forensic analysis. Forensic Sci Int. 2001; 118(2–3):158–62.
Matheson S. DNA Phenotyping: Snapshot of a Criminal. Cell. 2016; 166(5):1061–4.
Schneider PM, Prainsack B, Kayser M. The Use of Forensic DNA Phenotyping in Predicting Appearance and Biogeographic Ancestry. Dtsch Arztebl Int. 2019; 116(51–52):873.
Tozzo P, Politi C, Delicati A, Gabbin A, Caenazzo L. External visible characteristics prediction through SNPs analysis in the forensic setting: A review. Frontiers in Bioscience - Landmark. 2021; 26(10):828–50.
Kayser M. Forensic DNA Phenotyping: DNA Testing for Externally Visible Characteristics. Encyclopedia of Forensic Sciences: Second Edition. 2013;369–74.
Kayser M, Schneider PM. DNA-based prediction of human externally visible characteristics in forensics: Motivations, scientific challenges, and ethical considerations. Forensic Sci Int Genet. 2009; 3(3):154–61.
Canales Serrano A. Forensic DNA phenotyping: A promising tool to aid forensic investigation. Current situation. Spanish Journal of Legal Medicine. 2020; 46(4):183–90.
Pośpiech E, Teisseyre P, Mielniczuk J, Branicki W. Predicting Physical Appearance from DNA Data: Towards Genomic Solutions. Genes 2022, Vol 13, Page 121. 2022; 13(1):121.
Mehta B, Daniel R, Phillips C, McNevin D. Forensically relevant SNaPshot® assays for human DNA SNP analysis: a review. International Journal of Legal Medicine 2016 131:1. 2016; 131(1):21–37.
Pulker H, Lareu MV, Phillips C, Carracedo A. Finding genes that underlie physical traits of forensic interest using genetic tools. Forensic Sci Int Genet. 2007; 1(2):100–4.
Kayser M. Forensic DNA Phenotyping: Predicting human appearance from crime scene material for investigative purposes. Forensic Sci Int Genet. 2015; 18:33–48.
Stephan CN, Caple JM, Guyomarc’h & P, Claes P, Guyomarc’ P, Peter &. An overview of the latest developments in facial imaging. Forensic Sci Res. 2019; 4(1):10–28.
Haddrill PR. Developments in forensic DNA analysis. Emerg Top Life Sci. 2021; 5(3):381–93.
Budowle B. SNP Typing Strategies. Forensic Sci Int. 2004; 146(SUPPL.):S139–42.
Sobrino B, Carracedo A. SNP Typing in Forensic Genetics: a review. Methods Mol Biol. 2005; 297:107–26.
Sobrino B, Brión M, Carracedo A. SNPs in forensic genetics: a review on SNP typing methodologies. Forensic Sci Int. 2005; 154(2–3):181–94.
Decorte R. Genetic identification in the 21st century—Current status and future developments. Forensic Sci Int. 2010; 201(1–3):160–4.
Ziętkiewicz E, Witt M, Daca P, Żebracka-Gala J, Goniewicz M, Jarząb B, et al. Current genetic methodologies in the identification of disaster victims and in forensic analysis. Journal of Applied Genetics 2011 53:1. 2011; 53(1):41–60.
Kowalczyk M, Zawadzka E, Szewczuk D, Gryzi Nska M, Jakubczak A. Molecular markers used in forensic genetics. Med Sci Law. 2018; 0(0):1–9.
Liu F, Wen B, Kayser M. Colorful DNA polymorphisms in humans. Semin Cell Dev Biol. 2013; 24(6–7):562–75.
Gurkan C, Bulbul O, Kidd KK. Editorial: Current and Emerging Trends in Human Identification and Molecular Anthropology. Front Genet. 2021; 12:1041.
Oh CS, Shin DH, Hong JH, Lee SD, Lee E. Single-nucleotide polymorphism analyses on ABCC11, EDAR, FGFR2, and ABO genotypes of mummified people of the Joseon Dynasty, South Korea. Anthropological Science. 2018; 126(2):67–73.
Watherston J, McNevin D, Gahan ME, Bruce D, Ward J. Current and emerging tools for the recovery of genetic information from post mortem samples: New directions for disaster victim identification. Forensic Sci Int Genet. 2018; 37:270–82.
Ambers A, Bus MM, King JL, Jones B, Durst J, Bruseth JE, et al. Forensic genetic investigation of human skeletal remains recovered from the La Belle shipwreck. Forensic Sci Int. 2020; 306:110050.
Bogdanowicz W, Allen M, Branicki W, Lembring M, Gajewska M, Kupiec T. Genetic identification of putative remains of the famous astronomer Nicolaus Copernicus. Proc Natl Acad Sci U S A. 2009; 106(30):12279–82.
Kupiec T, Branicki W. Genetic examination of the putative skull of Jan Kochanowski reveals its female sex. Croat Med J. 2011; 52(3):403.
King TE, Fortes GG, Balaresque P, Thomas MG, Balding D, Delser PM, et al. Identification of the remains of King Richard III. Nature Communications 2014 5:1. 2014; 5(1):1–8.
Zupanič Pajnič I. Identification of a Slovenian pre-war elite couple killed in the Second World War. Forensic Sci Int. 2021; 327.
Sankar P. Forensic DNA Phenotyping: Continuity and Change in the History of Race, Genetics, and Policing. In: Wailoo K, Nelson A, Lee C, editors. Genetics and the Unsettled Past: The Collision of DNA, Race, and History. Rutgers University Press; 2012; 104–13.
Atwood L, Raymond J, Sears A, Bell M, Daniel R. From Identification to Intelligence: An Assessment of the Suitability of Forensic DNA Phenotyping Service Providers for Use in Australian Law Enforcement Casework. Front Genet. 2021; 11(568701).
Hollard C, Keyser C, Delabarde T, Gonzalez A, Vilela Lamego C, Zvénigorosky V, et al. Case report: on the use of the HID-Ion AmpliSeq^TM Ancestry Panel in a real forensic case. Int J Legal Med. 2017; 131(2):351–8.
Graham EAM. DNA reviews: predicting phenotype. Forensic Science, Medicine, and Pathology 2008 4:3. 2008; 4(3):196–9.
Lippert C, Sabatini R, Maher MC, Kang EY, Lee S, Arikan O, et al. Identification of individuals by trait prediction using whole-genome sequencing data. Proceedings of the National Academy of Sciences. 2017; 114(38):10166–71.
Butler JM, Reeder DJ. STRBase [Internet]. National Institute of Standards and Technology.
Soundararajan U, Yun L, Shi M, Kidd KK. Minimal SNP overlap among multiple panels of ancestry informative markers argues for more international collaboration. Forensic Sci Int Genet. 2016; 23:25–32.
Mehta BM. Genotyping tools for forensic DNA phenotyping: From low-to high-throughput [Internet]. University of Canberra; 2019.
Børsting C, Morling N. Next generation sequencing and its applications in forensic genetics. Forensic Sci Int Genet. 2015; 18:78–89.
Budowle B, van Daal A. Forensically relevant SNP classes. Biotechniques. 2008; 44(5):603–10.
Kayser M, de Knijff P. Improving human forensics through advances in genetics, genomics and molecular biology. Nat Rev Genet. 2011; 12(3):179–92.
Wells JD, Linville JG. Biology/DNA/Entomology: Overview. Encyclopedia of Forensic Sciences: Second Edition. 2013;387–93.
Morelato M, Barash M, Blanes L, Chadwick S, Dilag J, Kuzhiumparambil U, et al. Forensic Science: Current State and Perspective by a Group of Early Career Researchers. Foundations of Science 2016 22:4. 2016; 22(4):799–825.
Pope S, Puch-Solis R. Interpretation of DNA data within the context of UK forensic science — investigation. Emerg Top Life Sci. 2021; 5(3):395–404.
Frudakis TN. Molecular Photofitting: Predicting ancestry and phenotype using DNA [Internet]. Academic Press. Elsevier Inc.; 2008.
Cheung EYY, Gahan ME, McNevin D. Predictive DNA analysis for biogeographical ancestry. Australian Journal of Forensic Sciences. 2018; 50(6):651–8.
Phillips C. Forensic genetic analysis of bio-geographical ancestry. Forensic Sci Int Genet. 2015; 18:49–65.
Tully G. Genotype versus phenotype: Human pigmentation. Forensic Sci Int Genet. 2007; 1(2):105–10.
Branicki W. Studies on predicting pigmentation phenotype for forensic purposes. Problems of Forensic Sciences. 2009; LXXVII:29–52.
Branicki W, Kayser M. Prediction of Human Pigmentation Traits from DNA Polymorphisms. eLS. 2015;1–10.
Wolinsky H. CSI on steroids. EMBO Rep. 2015; 16(7):782–6.
Dabas P, Jain S, Khajuria H, Nayak BP. Forensic DNA phenotyping: Inferring phenotypic traits from crime scene DNA. J Forensic Leg Med. 2022; 88.
Tricco AC, Lillie E, Zarin W, O’Brien KK, Colquhoun H, Levac D, et al. PRISMA Extension for Scoping Reviews (PRISMA-ScR): Checklist and Explanation. Ann Intern Med. 2018; 169(7):467–73.
Kayser M. In reply. Dtsch Arztebl Int. 2020; 117(15):269–70.
Al-Asfi M, McNevin D, Mehta B, Power D, Gahan ME, Daniel R. Assessment of the Precision ID Ancestry panel. Int J Legal Med. 2018; 132(6):1581–94.
Bulbul O, Speed WC, Gurkan C, Soundararajan U, Rajeevan H, Pakstis AJ, et al. Improving ancestry distinctions among Southwest Asian populations. Forensic Sci Int Genet. 2018; 35:14–20.
Lowe AL, Urquhart A, Foreman LA, Evett IW. Inferring ethnic origin by means of an STR profile. Forensic Sci Int. 2001; 119(1):17–22.
Rowold DJ, Herrera RJ. Inferring recent human phylogenies using forensic STR technology. Forensic Sci Int. 2003; 133(3):260–5.
Phillips C, Santos C, Fondevila M, Carracedo Á, Lareu MV. Inference of Ancestry in Forensic Analysis I: Autosomal Ancestry-Informative Marker Sets. Methods in Molecular Biology. 2016; 1420:233–53.
Oldoni F, Kidd KK, Podini D. Microhaplotypes in forensic genetics. Forensic Sci Int Genet. 2019; 38:54–69.
Kayser M. Forensic use of Y-chromosome DNA: a general overview. Hum Genet. 2017; 136(5):621.
Kivisild T. Maternal ancestry and population history from whole mitochondrial genomes. Investig Genet. 2015; 6(1).
Prestes PR, Mitchell RJ, Daniel R, Sanchez JJ, van Oorschot RAH. Predicting biogeographical ancestry in admixed individuals – values and limitations of using uniparental and autosomal markers. http://dx.doi.org/101080/0045061820151022600. 2015; 48(1):10–23.
Halder I, Shriver M, Thomas M, Fernandez JR, Frudakis T. A panel of ancestry informative markers for estimating individual biogeographical ancestry and admixture from four continents: Utility and applications. Hum Mutat. 2008; 29(5):648–58.
Qu Y, Tran D, Ma W. Deep Learning Approach to Biogeographical Ancestry Inference. Procedia Comput Sci. 2019; 159:552–61.
Cheung EYY, Gahan ME, McNevin D. Prediction of biogeographical ancestry from genotype: a comparison of classifiers. Int J Legal Med. 2017; 131(4):901–12.
Parfenchyk MS, Kotava SA. The Theoretical Framework for the Panels of DNA Markers Formation in the Forensic Determination of an Individual Ancestral Origin. Russian Journal of Genetics 2021 57:1. 2021; 57(1):1–9.
Alladio E, Poggiali B, Cosenza G, Pilli E. Multivariate statistical approach and machine learning for the evaluation of biogeographical ancestry inference in the forensic field. Scientific Reports 2022 12:1. 2022; 12(1):1–17.
Brión M, Sanchez JJ, Balogh K, Thacker C, Blanco-Verea A, Børsting C, et al. Analysis of 29 Y-chromosome SNPs in a single multiplex useful to predict the geographic origin of male lineages. Int Congr Ser. 2006; 1288:13–5.
Brión M, Sanchez JJ, Balogh K, Thacker C, Blanco-Verea A, Børsting C, et al. Introduction of a single nucleotide polymorphism-based ‘Major Y-chromosome haplogroup typing kit’ suitable for predicting the geographical origin of male lineages. Electrophoresis. 2005; 26(23):4411–20.
Lessig R, Edelmann J, Thiele K, Kozhemyako V, Jonkisz A, Dobosz T. Results of Y-SNP typing in three different populations. Forensic Sci Int Genet Suppl Ser. 2008; 1(1):219–21.
Brión M, Sobrino B, Blanco-Verea A, Lareu M v., Carracedo A. Hierarchical analysis of 30 Y-chromosome SNPs in European populations. Int J Legal Med. 2005; 119(1):10–5.
Onofri V, Alessandrini F, Turchi C, Pesaresi M, Buscemi L, Tagliabracci A. Development of multiplex PCRs for evolutionary and forensic applications of 37 human Y chromosome SNPs. Forensic Sci Int. 2006; 157(1):23–35.
Bouakaze C, Keyser C, Amory S, Crubézy E, Ludes B. First successful assay of Y-SNP typing by SNaPshot minisequencing on ancient DNA. Int J Legal Med. 2007; 121(6):493–9.
Chiurillo MA, Lander N, Rojas M, Sayegh M, Ramirez JL. Development of Y-SNP typing assay for forensic application in Venezuelan population. Forensic Sci Int Genet Suppl Ser. 2009; 2(1):444–5.
Noveski P, Trivodalieva S, Efremov GD, Plaseska-Karanfilska D. Y chromosome single nucleotide polymorphisms typing by SNaPshot MINISEQUENCING. Balkan Journal of Medical Genetics. 2010; 13(1):9–16.
van Oven M, Ralf A, Kayser M. An efficient multiplex genotyping approach for detecting the major worldwide human Y-chromosome haplogroups. Int J Legal Med. 2011; 125(6):879.
Muro T, Iida R, Fujihara J, Yasuda T, Watanabe Y, Imamura S, et al. Simultaneous determination of seven informative Y chromosome SNPs to differentiate East Asian, European, and African populations. Leg Med. 2011; 13(3):134–41.
Ralf A, van Oven M, Montiel González D, de Knijff P, van der Beek K, Wootton S, et al. Forensic Y-SNP analysis beyond SNaPshot: High-resolution Y-chromosomal haplogrouping from low quality and quantity DNA using Ion AmpliSeq and targeted massively parallel sequencing. Forensic Sci Int Genet. 2019; 41:93–106.
Oh S, Kim J, Park S, Kim S, Lee K, Lee YH, et al. Prediction of Y haplogroup by polymerase chain reaction-reverse blot hybridization assay. Genes Genomics. 2019; 41(3):297–304.
McNevin D, Bate A, Daniel R, Walsh SJ. A preliminary mitochondrial DNA SNP genotyping assay for inferring genealogy. Australian Journal of Forensic Science. 2011; 43(1):39–51.
van Oven M, Vermeulen M, Kayser M. Multiplex genotyping system for efficient inference of matrilineal genetic ancestry with continental resolution. Investig Genet. 2011; 2(1):6.
Ballantyne KN, van Oven M, Ralf A, Stoneking M, Mitchell RJ, van Oorschot RAH, et al. MtDNA SNP multiplexes for efficient inference of matrilineal genetic ancestry within Oceania. Forensic Sci Int Genet. 2012; 6(4):425–36.
Chaitanya L, van Oven M, Weiler N, Harteveld J, Wirken L, Sijen T, et al. Developmental validation of mitochondrial DNA genotyping assays for adept matrilineal inference of biogeographic ancestry at a continental level. Forensic Sci Int Genet. 2014; 11(1):39–51.
Palencia-Madrid L, Vinueza-Espinosa D, Baeta M, Rocandio AM, de Pancorbo MM. Validation of a 52-mtSNP minisequencing panel for haplogroup classification of forensic DNA samples. Int J Legal Med. 2020; 134(3):929–36.
Daniel R, Walsh SJ, Piper A. Investigation of single-nucleotide polymorphisms associated with ethnicity. Int Congr Ser. 2006; 1288:79–81.
Phillips C, Salas A, Sánchez JJ, Fondevila M, Gómez-Tato A, Álvarez-Dios J, et al. Inferring ancestral origin using a single multiplex assay of ancestry-informative marker SNPs. Forensic Sci Int Genet. 2007; 1(3–4):273–80.
Phillips C, Fondevila M, Lareau MV. A 34-plex autosomal SNP single base extension assay for ancestry investigations. Methods in Molecular Biology. 2012; 830:109–26.
Fondevila M, Phillips C, Santos C, Freire Aradas A, Vallone PM, Butler JM, et al. Revision of the SNPforID 34-plex forensic ancestry test: Assay enhancements, standard reference sample genotypes and extended population studies. Forensic Sci Int Genet. 2013; 7(1):63–74.
Phillips C, Fondevila M, Vallone PM, Carla S, Freire-Aradas A, Butler JM, et al. Characterization of U.S. population samples using a 34plex ancestry informative SNP multiplex. Forensic Sci Int Genet Suppl Ser. 2011; 3(1):e182–3.
Khodjet-El-Khil H, Fadhlaoui-Zid K, Cherni L, Phillips C, Fondevila M, Carracedo Á, et al. Genetic analysis of the SNPforID 34-plex ancestry informative SNP panel in Tunisian and Libyan populations. Forensic Sci Int Genet. 2011; 5(3):e45–7.
Prestes PR, Mitchell RJ, Santos C, van Oorschot RAH. The SNPforID 34-plex—Its ability to infer level of admixture in individuals. Forensic Sci Int Genet Suppl Ser. 2013; 4(1):e13–4.
Phillips C, Parson W, Lundsberg B, Santos C, Freire-Aradas A, Torres M, et al. Building a forensic ancestry panel from the ground up: The EUROFORGEN Global AIM-SNP set. Forensic Sci Int Genet. 2014; 11(1):13–25.
Santos C, Fondevila M, Ballard D, Banemann R, Bento AM, Børsting C, et al. Forensic ancestry analysis with two capillary electrophoresis ancestry informative marker (AIM) panels: Results of a collaborative EDNAP exercise. Forensic Sci Int Genet. 2015; 19:56–67.
Gomes C, Fondevila M, Palomo-Díez S, Pardiñas AF, López-Matayoshi C, Baeza-Richer C, et al. Phenotyping the ancient world: The physical appearance and ancestry of very degraded samples from a chalcolithic human remains. Forensic Sci Int Genet Suppl Ser. 2017; 6:e484–6.
Daniel R, Santos C, Phillips C, Fondevila M, van Oorschot RAH, Carracedo, et al. A SNaPshot of next generation sequencing for forensic SNP analysis. Forensic Sci Int Genet. 2015; 14:50–60.
Mehta B, Daniel R, Phillips C, Doyle S, Elvidge G, McNevin D. Massively parallel sequencing of customised forensically informative SNP panels on the MiSeq. Electrophoresis. 2016; 37(21):2832–40.
Daniel R, Sanchez JJ, Nassif NT, Hernandez A, Walsh SJ. SNPs associated with physical traits: A valuable tool for the inference of biogeographical ancestry. Forensic Sci Int Genet Suppl Ser. 2008; 1(1):538–40.
Daniel R, Sanchez JJ, Nassif NT, Hernandez A, Walsh SJ. Partial forensic validation of a 16plex SNP assay for the inference of biogeographical ancestry. Forensic Sci Int Genet Suppl Ser. 2009; 2(1):477–8.
Kersbergen P, van Duijn K, Kloosterman AD, den Dunnen JT, Kayser M, de Knijff P. Developing a set of ancestry-sensitive DNA markers reflecting continental origins of humans. BMC Genet. 2009; 10(1):69.
Kidd JR, Friedlaender FR, Speed WC, Pakstis AJ, de La Vega FM, Kidd KK. Analyses of a set of 128 ancestry informative single-nucleotide polymorphisms in a global set of 119 population samples. Investig Genet. 2011; 2(1):1.
Poetsch M, Blöhm R, Harder M, Inoue H, von Wurmb-Schwark N, Freitag-Wolf S. Prediction of people’s origin from degraded DNA - Presentation of SNP assays and calculation of probability. Int J Legal Med. 2013; 127(2):347–57.
Phillips C, Aradas AF, Kriegel AK, Fondevila M, Bulbul O, Santos C, et al. Eurasiaplex: A forensic SNP assay for differentiating European and South Asian ancestries. Forensic Sci Int Genet. 2013; 7(3):359–66.
Eduardoff M, Gross TE, Santos C, de La Puente M, Ballard D, Strobl C, et al. Interlaboratory evaluation of the EUROFORGEN Global ancestry-informative SNP panel by massively parallel sequencing using the Ion PGM^TM. Forensic Sci Int Genet. 2016; 23:178–89.
Lundsberg B, Johansen P, Børsting C, Morling N. Development and optimisation of five multiplex assays with 115 of the AIM SNPs from the EUROFORGEN AIMs set on the Sequenom® MassARRAY® system. Forensic Sci Int Genet Suppl Ser. 2013; 4(1):e182–3.
Kidd KK, Speed WC, Pakstis AJ, Furtado MR, Fang R, Madbouly A, et al. Progress toward an efficient panel of SNPs for ancestry inference. Forensic Sci Int Genet. 2014; 10(1):23–32.
Bulbul O, Filoglu G. Development of a SNP panel for predicting biogeographical ancestry and phenotype using massively parallel sequencing. Electrophoresis. 2018; 39(21):2743–51.
Rogalla U, Rychlicka E, Derenko M v., Malyarchuk BA, Grzybowski T. Simple and cost-effective 14-loci SNP assay designed for differentiation of European, East Asian and African samples. Forensic Sci Int Genet. 2015; 14:42–9.
Daca-Roszak P, Pfeifer A, Zebracka-Gala J, Jarząb B, Witt M, Ziętkiewicz E. EurEAs_Gplex—A new SNaPshot assay for continental population discrimination and gender identification. Forensic Sci Int Genet. 2016; 20:89–100.
de la Puente M, Santos C, Fondevila M, Manzo L, Carracedo, Lareu M v., et al. The Global AIMs Nano set: A 31-plex SNaPshot assay of ancestry-informative SNPs. Forensic Sci Int Genet. 2016; 22:81–8.
Bulbul O, Cherni L, Khodjet-El-Khil H, Rajeevan H, Kidd KK. Evaluating a subset of ancestry informative SNPs for discriminating among Southwest Asian and circum-Mediterranean populations. Forensic Sci Int Genet. 2016; 23:153–8.
Li CX, Pakstis AJ, Jiang L, Wei YL, Sun QF, Wu H, et al. A panel of 74 AISNPs: Improved ancestry inference within Eastern Asia. Forensic Sci Int Genet. 2016; 23:101–10.
Ren P, Liu J, Zhao H, Fan XP, Xu YC, Li CX. Construction of a rapid microfluidic-based SNP genotyping (MSG) chip for ancestry inference. Forensic Sci Int Genet. 2019; 41:145–51.
Hwa HL, Lin CP, Huang TY, Kuo PH, Hsieh WH, Lin CY, et al. A panel of 130 autosomal single-nucleotide polymorphisms for ancestry assignment in five Asian populations and in Caucasians. Forensic Sci Med Pathol. 2017; 13(2):177–87.
Cheung EYY, Gahan ME, McNevin D. Prediction of biogeographical ancestry in admixed individuals. Forensic Sci Int Genet. 2018; 36:104–11.
Yuasa I, Akane A, Yamamoto T, Matsusue A, Endoh M, Nakagawa M, et al. Japaneseplex: A forensic SNP assay for identification of Japanese people using Japanese-specific alleles. Leg Med. 2018; 33:17–22.
Pereira V, Freire-Aradas A, Ballard D, Børsting C, Diez V, Pruszkowska-Przybylska P, et al. Development and validation of the EUROFORGEN NAME (North African and Middle Eastern) ancestry panel. Forensic Sci Int Genet. 2019; 42:260–7.
Ribeiro J, Pereira V, Kondili A, Miniati P, Børsting C, Morling N. Typing of 111 ancestry informative markers in an Albanian population. Forensic Sci Int Genet Suppl Ser. 2015; 5:e124–5.
Jin XY, Wei YY, Lan Q, Cui W, Chen C, Guo YX, et al. A set of novel SNP loci for differentiating continental populations and three Chinese populations. PeerJ. 2019; 2019(3):e6508.
Carvalho Gontijo C, Porras-Hurtado LG, Freire-Aradas A, Fondevila M, Santos C, Salas A, et al. PIMA: A population informative multiplex for the Americas. Forensic Sci Int Genet. 2020; 44:102200.
Freire-Aradas A, Ruiz Y, Phillips C, Maroñas O, Söchtig J, Tato AG, et al. Exploring iris colour prediction and ancestry inference in admixed populations of South America. Forensic Sci Int Genet. 2014; 13:3–9.
Gross TE, Zaumsegel D, Rothschild MA, Schneider PM. Combined analysis of two different ancestry informative assays using SNPs and Indels in Eurasian populations. Forensic Science International: Genetics Supplement. 2013; series 4:e25–6.
Santos C, Phillips C, Fondevila M, Daniel R, van Oorschot RAH, Burchard EG, et al. Pacifiplex: an ancestry-informative SNP panel centred on Australia and the Pacific region. Forensic Sci Int Genet. 2016; 20:71–80.
Phillips C, McNevin D, Kidd KK, Lagacé R, Wootton S, de la Puente M, et al. MAPlex - A massively parallel sequencing ancestry analysis multiplex for Asia-Pacific populations. Forensic Sci Int Genet. 2019; 42:213–26.
Cheung EYY, Phillips C, Eduardoff M, Lareu MV, McNevin D. Performance of ancestry-informative SNP and microhaplotype markers. Forensic Sci Int Genet. 2019; 43:102141.
Xavier C, de la Puente M, Phillips C, Eduardoff M, Heidegger A, Mosquera-Miguel A, et al. Forensic evaluation of the Asia Pacific ancestry-informative MAPlex assay. Forensic Sci Int Genet. 2020; 48.
Wetton JH, Tsang KW, Khan H. Inferring the population of origin of DNA evidence within the UK by allele-specific hybridization of Y-SNPs. Forensic Sci Int. 2005; 152(1):45–53.
Espregueira Themudo G, Smidt Mogensen H, Børsting C, Morling N. Frequencies of HID-ion AmpliSeq ancestry panel markers among greenlanders. Forensic Sci Int Genet. 2016; 24:60–4.
Truelsen DM, Farzad MS, Mogensen HS, Pereira V, Tvedebrink T, Børsting C, et al. Typing of two Middle Eastern populations with the Precision ID Ancestry Panel. Forensic Sci Int Genet Suppl Ser. 2017; 6:e301–2.
García O, Ajuriagerra JA, Alday A, Alonso S, Pérez JA, Soto A, et al. Frequencies of the precision ID ancestry panel markers in Basques using the Ion Torrent PGMTM platform. Forensic Sci Int Genet. 2017; 31:e1–4.
Pereira V, Mogensen HS, Børsting C, Morling N. Evaluation of the Precision ID Ancestry Panel for crime case work: A SNP typing assay developed for typing of 165 ancestral informative markers. Forensic Sci Int Genet. 2017; 28:138–45.
Nakanishi H, Pereira V, Børsting C, Yamamoto T, Tvedebrink T, Hara M, et al. Analysis of mainland Japanese and Okinawan Japanese populations using the precision ID Ancestry Panel. Forensic Sci Int Genet. 2018; 33:106–9.
Wang Z, He G, Luo T, Zhao X, Liu J, Wang M, et al. Massively parallel sequencing of 165 ancestry informative SNPs in two Chinese Tibetan-Burmese minority ethnicities. Forensic Sci Int Genet. 2018; 34:141–7.
Young JM, Martin B, Kanokwongnuwut P, Linacre A. Detection of forensic identification and intelligence SNP data from latent DNA using three commercial MPS panels. Forensic Sci Int Genet Suppl Ser. 2019; 7(1):864–5.
Pereira V, Santangelo R, Børsting C, Tvedebrink T, Almeida APF, Carvalho EF, et al. Evaluation of the Precision of Ancestry Inferences in South American Admixed Populations. Front Genet. 2020; 11.
Xie T, Shen C, Liu C, Fang Y, Guo Y, Lan Q, et al. Ancestry inference and admixture component estimations of Chinese Kazak group based on 165 AIM-SNPs via NGS platform. Journal of Human Genetics 2020 65:5. 2020; 65(5):461–8.
He G, Liu J, Wang M, Zou X, Ming T, Zhu S, et al. Massively parallel sequencing of 165 ancestry-informative SNPs and forensic biogeographical ancestry inference in three southern Chinese Sinitic/Tai-Kadai populations. Forensic Sci Int Genet. 2021; 52.
Cooley AM, Meiklejohn KA, Damaso N, Robertson JM, Dawson Cruz T. Performance Comparison of Massively Parallel Sequencing (MPS) Instruments Using Single-Nucleotide Polymorphism (SNP) Panels for Ancestry. SLAS Technol. 2021; 26(1):103–12.
Shan MA, Meyer OS, Refn M, Morling N, Andersen JD, Børsting C. Analysis of skin pigmentation and genetic ancestry in three subpopulations from pakistan: Punjabi, pashtun, and baloch. Genes (Basel). 2021; 12(5):733.
Young JM, Power D, Kanokwongnuwut P, Linacre A. Ancestry and phenotype predictions from touch DNA using massively parallel sequencing. Int J Legal Med. 2021; 135(1):81–9.
Ambers AD, Churchill JD, King JL, Stoljarova M, Gill-King H, Assidi M, et al. More comprehensive forensic genetic marker analyses for accurate human remains identification using massively parallel DNA sequencing. BMC Genomics. 2016; 17(9):21–30.
Jin S, Chase M, Henry M, Alderson G, Morrow JM, Malik S, et al. Implementing a biogeographic ancestry inference service for forensic casework. Electrophoresis. 2018; 39(21):2757–65.
Mogensen HS, Tvedebrink T, Børsting C, Pereira V, Morling N. Ancestry prediction efficiency of the software GenoGeographer using a z-score method and the ancestry informative markers in the Precision ID Ancestry Panel. Forensic Sci Int Genet. 2020; 44:102154.
Precision ID Ancestry Panel [Internet].
Snipper app suite version 3 [Internet].
Branicki W, Brudnik U, Wojas-pelc A. Genetic prediction of pigmentary traits in forensic studies. Forensic Sci Int. 2005; LXIV:343–57.
Martinez-Cadenas C, Penãa-Chilet M, Ibarrola-Villava M, Ribas G. Gender is a major factor explaining discrepancies in eye colour prediction based on HERC2/OCA2 genotype and the IrisPlex model. Forensic Sci Int Genet. 2013; 7(4):453–60.
Pietroni C, Andersen JD, Johansen P, Andersen MM, Harder S, Paulsen R, et al. The effect of gender on eye colour variation in European populations and an evaluation of the IrisPlex prediction model. Forensic Sci Int Genet. 2014; 11(1):1–6.
Walsh S, Liu F, Ballantyne KN, van Oven M, Lao O, Kayser M. IrisPlex: A sensitive DNA tool for accurate prediction of blue and brown eye colour in the absence of ancestry information. Forensic Sci Int Genet. 2011; 5(3):170–80.
Gupta V, Sharma VK. Skin typing: Fitzpatrick grading and others. Clin Dermatol. 2019; 37(5):430–6.
Kukla-Bartoszek M, Pośpiech E, Woźniak A, Boroń M, Karłowska-Pik J, Teisseyre P, et al. DNA-based predictive models for the presence of freckles. Forensic Sci Int Genet. 2019; 42:252–9.
Hernando B, Ibañez MV, Deserio-Cuesta JA, Soria-Navarro R, Vilar-Sastre I, Martinez-Cadenas C. Genetic determinants of freckle occurrence in the Spanish population: Towards ephelides prediction from human DNA samples. Forensic Sci Int Genet. 2018; 33:38–47.
Cho S, Lee EH, Kim H, Lee JM, So MH, Ahn JJ, et al. Validation of BMI genetic risk score and DNA methylation in a Korean population. Int J Legal Med. 2021; 135(4):1201–12.
Claes P, Hill H, Shriver MD. Toward DNA-based facial composites: Preliminary results and validation. Forensic Sci Int Genet. 2014; 13:208–16.
Welcome | FaceBase [Internet].
The VisiGen Consortium [Internet].
Claes P, Liberton DK, Daniels K, Rosana KM, Quillen EE, Pearson LN, et al. Modeling 3D Facial Shape from DNA. PLoS Genet. 2014; 10(3):e1004224.
Phillips C, Barbaro A, Lareu M v., Salas A, Carracedo A. Initial study of candidate genes on chromosome two for relative hand skill. Int Congr Ser. 2006; 1288:798–800.
van Daal A. The genetic basis of human pigmentation. Forensic Sci Int Genet Suppl Ser. 2008; 1(1):541–3.
Fridman C, Cardena MMSG, Lima F de A, Gonçalves F de T. Is it possible to use Forensic DNA phenotyping in Brazilian population? Forensic Sci Int Genet Suppl Ser. 2015; 5:e378–80.
de Araújo Lima F, de Toledo Gonçalves F, Fridman C. SLC24A5 and ASIP as phenotypic predictors in Brazilian population for forensic purposes. Leg Med. 2015; 17(4):261–6.
Andrade ES, Fracasso NCA, Strazza Júnior PS, Simões AL, Mendes-Junior CT. Associations of OCA2-HERC2 SNPs and haplotypes with human pigmentation characteristics in the Brazilian population. Leg Med. 2017; 24:78–83.
Veltre V, de Angelis F, Biondi G, Rickards O. Evaluation of skin-related variants in African ancestry populations and their role in personal identification. Forensic Sci Int Genet Suppl Ser. 2019; 7(1):172–4.
Andersen JD, Meyer OS, Simão F, Jannuzzi J, Carvalho E, Andersen MM, et al. Skin pigmentation and genetic variants in an admixed Brazilian population of primarily European ancestry. Int J Legal Med. 2020; 134(5):1569–79.
Zaumsegel D, Rothschild MA, Schneider PM. SNPs for the analysis of human pigmentation genes—A comparative study. Forensic Sci Int Genet Suppl Ser. 2008; 1(1):544–6.
Branicki W, Szczerbińska A, Brudnik U, Wolańska-Nowak P, Kupiec T. The OCA2 gene as a marker for eye colour prediction. Forensic Sci Int Genet Suppl Ser. 2008; 1(1):536–7.
Andersen JD, Johansen P, Wulf HC, Petersen B, Børsting C, Morling N. Genetic variants and skin colour in Danes. Forensic Sci Int Genet Suppl Ser. 2011; 3(1):e153–4.
Andersen JD, Pietroni C, Johansen P, Andersen MM, Pereira V, Børsting C, et al. Importance of nonsynonymous OCA2 variants in human eye color prediction. Mol Genet Genomic Med. 2016; 4(4):420–30.
Fracasso NC de A, de Andrade ES, Wiezel CEV, Andrade CCF, Zanão LR, da Silva MS, et al. Haplotypes from the SLC45A2 gene are associated with the presence of freckles and eye, hair and skin pigmentation in Brazil. Leg Med. 2017; 25:43–51.
Meyer OS, Lunn MMB, Garcia SL, Kjærbye AB, Morling N, Børsting C, et al. Association between brown eye colour in rs12913832:GG individuals and SNPs in TYR, TYRP1, and SLC24A4. PLoS One. 2020; 15(9):e0239131.
Andersen JD, Johansen P, Mogensen HS, Børsting C, Morling N. Eye colour and SNPs in Danes. Forensic Sci Int Genet Suppl Ser. 2011; 3(1):e151–2.
Salvo NM, Mathisen MG, Janssen K, Berg T, Olsen GH. Experimental long-distance haplotyping of OCA2-HERC2 variants. Forensic Sci Int Genet Suppl Ser. 2022; 8:188–90.
Yan J, Cao LP, Ye Y, Wu J, Fu XD, Hou YP. Association of melanocortin-1-receptor gene polymorphism with freckles in Chinese Han population. Forensic Sci Int Genet Suppl Ser. 2013; 4(1):e320–1.
Fridman C, Ferreira MA, Marano LA, Forlenza BS. Analysis of genetic polymorphisms associated with the presence of freckles for phenotypic prediction. Forensic Sci Int Genet Suppl Ser. 2022; 8:26–8.
Liu F, van der Lijn F, Schurmann C, Zhu G, Chakravarty MM, Hysi PG, et al. A Genome-Wide Association Study Identifies Five Loci Influencing Facial Morphology in Europeans. PLoS Genet. 2012; 8(9):e1002932.
Wang Q, Jin B, Luo X, Feng T, Xia Y, Liang W, et al. Association between BMP4 gene polymorphisms and eyelid traits in Chinese Han population. Forensic Sci Int Genet Suppl Ser. 2017; 6:e355–6.
Li L, Wang Q, Wu S, Li Z, Jiang Y, Luo X, et al. What makes your “eyes” look different? Forensic Sci Int Genet Suppl Ser. 2019; 7(1):105–6.
Jin B, Zhu J, Wang H, Chen D, Su Q, Wang L, et al. A primary investigation on SNPs associated with eyelid traits of Chinese Han Adults. Forensic Sci Int Genet Suppl Ser. 2015; 5:e669–70.
Wang Q, Jin B, Liu F, Li Z, Tan Y, Liang W, et al. DNA-based eyelid trait prediction in Chinese Han population. Int J Legal Med. 2021; 135(5):1743–52.
Xie M, Song F, Li J, Ma H, Wu J, Hou Y. Characteristics of SNPs related with high myopia traits in Chinese Han population. Forensic Sci Int Genet Suppl Ser. 2017; 6:e35–6.
Pośpiech E, Kukla-Bartoszek M, Karłowska-Pik J, Zieliński P, Woźniak A, Boroń M, et al. Exploring the possibility of predicting human head hair greying from DNA using whole-exome and targeted NGS data. BMC Genomics 2020 21:1. 2020; 21(1):1–18.
Adhikari K, Fontanil T, Cal S, Mendoza-Revilla J, Fuentes-Guajardo M, Chacón-Duque JC, et al. A genome-wide association scan in admixed Latin Americans identifies loci influencing facial and scalp hair features. Nature Communications 2016 7:1. 2016; 7(1):1–12.
Pośpiech E, Chen Y, Kukla-Bartoszek M, Breslin K, Aliferi A, Andersen JD, et al. Towards broadening Forensic DNA Phenotyping beyond pigmentation: Improving the prediction of head hair shape from DNA. Forensic Sci Int Genet. 2018; 37:241–51.
Jawad M, Adnan A, Rehman RA, Nazir S, Adeyemo OA, Amer SAM, et al. Evaluation of facial hair-associated SNPs: a pilot study on male Pakistani Punjabi population. Forensic Sci Med Pathol. 2022;1–10.
Walsh S, Lindenbergh A, Zuniga SB, Sijen T, de Knijff P, Kayser M, et al. Developmental validation of the IrisPlex system: Determination of blue and brown iris colour for forensic intelligence. Forensic Sci Int Genet. 2011; 5(5):464–71.
Pośpiech E, Draus-Barini J, Kupiec T, Wojas-Pelc A, Branicki W. Prediction of Eye Color from Genetic Data Using Bayesian Approach. J Forensic Sci. 2012; 57(4):880–6.
Walsh S, Wollstein A, Liu F, Chakravarthy U, Rahu M, Seland JH, et al. DNA-based eye colour prediction across Europe with the IrisPlex system. Forensic Sci Int Genet. 2012; 6(3):330–40.
Chaitanya L, Walsh S, Andersen JD, Ansell R, Ballantyne K, Ballard D, et al. Collaborative EDNAP exercise on the IrisPlex system for DNA-based prediction of human eye colour. Forensic Sci Int Genet. 2014; 11(1):241–51.
Prestes PR, Mitchell RJ, Daniel R, Ballantyne KN, van Oorschot RAH. Evaluation of the IrisPlex system in admixed individuals. Forensic Sci Int Genet Suppl Ser. 2011; 3(1):e283–4.
Purps J, Geppert M, Nagy M, Roewer L. Evaluation of the IrisPlex eye colour prediction tool in a German population sample. Forensic Sci Int Genet Suppl Ser. 2011; 3(1):e202–3.
Pneuman A, Budimlija ZM, Caragine T, Prinz M, Wurmbach E. Verification of eye and skin color predictors in various populations. Leg Med. 2012; 14(2):78–83.
Andersen JD, Johansen P, Harder S, Christoffersen SR, Delgado MC, Henriksen ST, et al. Genetic analyses of the human eye colours using a novel objective method for eye colour classification. Forensic Sci Int Genet. 2013; 7(5):508–15.
Dembinski GM, Picard CJ. Evaluation of the IrisPlex DNA-based eye color prediction assay in a United States population. Forensic Sci Int Genet. 2014; 9(1):111–7.
Venables SJ, Mehta B, Daniel R, Walsh SJ, van Oorschot RA, McNevin D. Assessment of high resolution melting analysis as a potential SNP genotyping technique in forensic casework. Electrophoresis. 2014; 35(21–22):3036–43.
Dario P, Mouriño H, Oliveira AR, Lucas I, Ribeiro T, Porto MJ, et al. Assessment of IrisPlex-based multiplex for eye and skin color prediction with application to a Portuguese population. Int J Legal Med. 2015; 129(6):1191–200.
Martinez-Cadenas C, Peña-Chilet M, Llorca-Cardeñosa MJ, Cervera R, Ibarrola-Villava M, Ribas G. Gender and eye colour prediction discrepancies: A reply to criticisms. Forensic Sci Int Genet. 2014; 9(1):e7–9.
Liu F, Walsh S, Kayser M. Of sex and IrisPlex eye colour prediction: A reply to Martinez-Cadenas et al. Forensic Sci Int Genet. 2014; 9(1):e5–6.
Pośpiech E, Karłowska-Pik J, Ziemkiewicz B, Kukla M, Skowron M, Wojas-Pelc A, et al. Further evidence for population specific differences in the effect of DNA markers and gender on eye colour prediction in forensics. Int J Legal Med. 2016; 130(4):923.
Yun L, Gu Y, Rajeevan H, Kidd KK. Application of six IrisPlex SNPs and comparison of two eye color prediction systems in diverse Eurasia populations. Int J Legal Med. 2014; 128(3):447–53.
Bulbul O, Zorlu T, Filoglu G. Prediction of human eye colour using highly informative phenotype SNPs (PISNPs). Australian Journal of Forensic Science. 2018; 52(1):27–37.
Al-Rashedi NAM, Mandal AM, ALObaidi LA. Eye color prediction using the IrisPlex system: a limited pilot study in the Iraqi population. Egypt J Forensic Sci. 2020; 10(1):1–6.
Salvoro C, Faccinetto C, Zucchelli L, Porto M, Marino A, Occhi G, et al. Performance of four models for eye color prediction in an Italian population sample. Forensic Sci Int Genet. 2019; 40:192–200.
Katsara MA, Branicki W, Walsh S, Kayser M, Nothnagel M. Evaluation of supervised machine-learning methods for predicting appearance traits from DNA. Forensic Sci Int Genet. 2021; 53:102507.
Katsara MA, Branicki W, Pośpiech E, Hysi P, Walsh S, Kayser M, et al. Testing the impact of trait prevalence priors in Bayesian-based genetic prediction modelling of human appearance traits. Forensic Sci Int Genet. 2021; 50:102412.
Liu F, van Duijn K, Vingerling JR, Hofman A, Uitterlinden AG, Janssens ACJW, et al. Eye color and the prediction of complex phenotypes from genotypes. Current Biology. 2009; 19(5):R192–3.
Kastelic V, Pośpiech E, Draus-Barini J, Branicki W, Drobnič K. Prediction of eye color in the Slovenian population using the IrisPlex SNPs. Croat Med J. 2013; 54(4):381.
Caliebe A, Harder M, Schuett R, Krawczak M, Nebel A, von Wurmb-Schwark N. The more the merrier? How a few SNPs predict pigmentation phenotypes in the Northern German population. European Journal of Human Genetics 2016 24:5. 2015; 24(5):739–47.
Wollstein A, Walsh S, Liu F, Chakravarthy U, Rahu M, Seland JH, et al. Novel quantitative pigmentation phenotyping enhances genetic association, epistasis, and prediction of human eye colour. Scientific Reports 2017 7:1. 2017; 7(1):1–11.
Paparazzo E, Gozalishvili A, Lagani V, Geracitano S, Bauleo A, Falcone E, et al. A new approach to broaden the range of eye colour identifiable by IrisPlex in DNA phenotyping. Scientific Reports 2022 12:1. 2022; 12(1):1–10.
Ruiz Y, Phillips C, Gomez-Tato A, Alvarez-Dios J, Casares De Cal M, Cruz R, et al. Further development of forensic eye color predictive tests. Forensic Sci Int Genet. 2013; 7(1):28–40.
Allwood JS, Harbison SA. SNP model development for the prediction of eye colour in New Zealand. Forensic Sci Int Genet. 2013; 7(4):444–52.
Mengel-From J, Børsting C, Sanchez JJ, Eiberg H, Morling N. Human eye colour and HERC2, OCA2 and MATP. Forensic Sci Int Genet. 2010; 4(5):323–8.
Shapturenko MN, Vakula SI, Kandratsiuk A v., Gudievskaya IG, Shinkevich M v., Luhauniou AU, et al. HERC2 (rs12913832) and OCA2 (rs1800407) genes polymorphisms in relation to iris color variation in Belarusian population. Forensic Sci Int Genet Suppl Ser. 2019; 7(1):331–2.
Alghamdi J, Amoudi M, Kassab AC, al Mufarrej M, al Ghamdi S. Eye color prediction using single nucleotide polymorphisms in Saudi population. Saudi J Biol Sci. 2019; 26(7):1607.
Kukla-Bartoszek M, Teisseyre P, Pośpiech E, Karłowska-Pik J, Zieliński P, Woźniak A, et al. Searching for improvements in predicting human eye colour from DNA. International Journal of Legal Medicine 2021. 2021;(3):1–13.
Meyer OS, Salvo NM, Kjærbye A, Kjersem M, Andersen MM, Sørensen E, et al. Prediction of Eye Colour in Scandinavians Using the EyeColour 11 (EC11) SNP Set. Genes (Basel). 2021; 12(6).
Walsh S, Liu F, Wollstein A, Kovatsi L, Ralf A, Kosiniak-Kamysz A, et al. The HIrisPlex system for simultaneous prediction of hair and eye colour from DNA. Forensic Sci Int Genet. 2013; 7(1):98–115.
Walsh S, Kayser M. A Practical Guide to the HIrisPlex System: Simultaneous Prediction of Eye and Hair Color from DNA. Methods in Molecular Biology. 2016; 1420:213–31.
Draus-Barini J, Walsh S, Pośpiech E, Kupiec T, Głab H, Branicki W, et al. Bona fide colour: DNA prediction of human eye and hair colour from ancient and contemporary skeletal remains. Investig Genet. 2013; 4(1):1–15.
Walsh S, Chaitanya L, Clarisse L, Wirken L, Draus-Barini J, Kovatsi L, et al. Developmental validation of the HIrisPlex system: DNA-based eye and hair colour prediction for forensic and anthropological usage. Forensic Sci Int Genet. 2014; 9(1):150–61.
Chaitanya L, Pajnič IZ, Walsh S, Balažic J, Zupanc T, Kayser M. Bringing colour back after 70 years: Predicting eye and hair colour from skeletal remains of World War II victims using the HIrisPlex system. Forensic Sci Int Genet. 2017; 26:48–57.
Zupanič Pajnič I. Identification of a Slovenian prewar elite couple killed in the Second World War. Forensic Sci Int. 2021; 327.
Kukla-Bartoszek M, Pośpiech E, Spólnicka M, Karłowska-Pik J, Strapagiel D, Żądzińska E, et al. Investigating the impact of age-depended hair colour darkening during childhood on DNA-based hair colour prediction with the HIrisPlex system. Forensic Sci Int Genet. 2018; 36:26–33.
Carratto TMT, Marcorin L, do Valle-Silva G, de Oliveira MLG, Donadi EA, Simões AL, et al. Prediction of eye and hair pigmentation phenotypes using the HIrisPlex system in a Brazilian admixed population sample. Int J Legal Med. 2021; 135(4):1329–39.
Kastelic V, Drobnič K. A single-nucleotide polymorphism (SNP) multiplex system: the association of five SNPs with human eye and hair color in the Slovenian population and comparison using a Bayesian network and logistic regression model. Croat Med J. 2012; 53(5):401.
Grimes EA, Noake PJ, Dixon L, Urquhart A. Sequence polymorphism in the human melanocortin 1 receptor gene as an indicator of the red hair phenotype. Forensic Sci Int. 2001; 122(2–3):124–9.
Branicki W, Kupiec T, Wolańska-Nowak P, Brudnik U. Determination of forensically relevant SNPs in the MC1R gene. Int Congr Ser. 2006; 1288:816–8.
Branicki W, Brudnik U, Kupiec T, Wolañska-Nowak P, Wojas-Pelc A. Determination of Phenotype Associated SNPs in the MC1R Gene. J Forensic Sci. 2007; 52(2):349–54.
Branicki W, Wolańska-Nowak P, Brudnik U, Kupiec T, Szymańska K, Wojas-Pelc A. Forensic application of a rapid test for red hair colour prediction and sex determination. Z Zagadnien Nauk Sadowych. 2007; 69:37–51.
Branicki W, Liu F, van Duijn K, Draus-Barini J, Pośpiech E, Walsh S, et al. Model-based prediction of human hair color using DNA variants. Hum Genet. 2011; 129(4):443.
Söchtig J, Phillips C, Maroñas O, Gómez-Tato A, Cruz R, Alvarez-Dios J, et al. Exploration of SNP variants affecting hair colour prediction in Europeans. Int J Legal Med. 2015; 129(5):963–75.
Chaitanya L, Breslin K, Zuñiga S, Wirken L, Pośpiech E, Kukla-Bartoszek M, et al. The HIrisPlex-S system for eye, hair and skin colour prediction from DNA: Introduction and forensic developmental validation. Forensic Sci Int Genet. 2018; 35:123–35.
Marano LA, Andersen JD, Goncalves FT, Garcia ALO, Fridman C. Evaluation of HIrisplex-S system markers for eye, skin and hair color prediction in an admixed Brazilian population. Forensic Sci Int Genet Suppl Ser. 2019; 7(1):427–8.
Carratto TMT, Marcorin L, Debortoli G, Silva G v., Fracasso NCA, Oliveira MLG, et al. Evaluation of the HIrisPlex-S system in a Brazilian population sample. Forensic Sci Int Genet Suppl Ser. 2019; 7(1):794–6.
Turchi C, Onofri V, Melchionda F, Fattorini P, Tagliabracci A. Development of a forensic DNA phenotyping panel using massive parallel sequencing. Forensic Sci Int Genet Suppl Ser. 2019; 7(1):177–9.
Breslin K, Wills B, Ralf A, Ventayol Garcia M, Kukla-Bartoszek M, Pospiech E, et al. HIrisPlex-S system for eye, hair, and skin color prediction from DNA: Massively parallel sequencing solutions for two common forensically used platforms. Forensic Sci Int Genet. 2019; 43:102152.
Kukla-Bartoszek M, Szargut M, Pośpiech E, Diepenbroek M, Zielińska G, Jarosz A, et al. The challenge of predicting human pigmentation traits in degraded bone samples with the MPS-based HIrisPlex-S system. Forensic Sci Int Genet. 2020; 47:102301.
Chen Y, Branicki W, Walsh S, Nothnagel M, Kayser M, Liu F. The impact of correlations between pigmentation phenotypes and underlying genotypes on genetic prediction of pigmentation traits. Forensic Sci Int Genet. 2021; 50:102395.
Ralf A, Kayser M. Investigative DNA analysis of two-person mixed crime scene trace in a murder case. Forensic Sci Int Genet. 2021; 54:102557.
Ragazzo M, Puleri G, Errichiello V, Manzo L, Luzzi L, Potenza S, et al. Evaluation of OpenArray^TM as a Genotyping Method for Forensic DNA Phenotyping and Human Identification. Genes (Basel). 2021; 12(2):1–10.
Melchionda F, Silvestrini B, Robino C, Bini C, Fattorini P, Martinez-Labarga C, et al. Development and Validation of MPS-Based System for Human Appearance Prediction in Challenging Forensic Samples. Genes (Basel). 2022; 13(10).
Gentile F, Cherubini A, Colloca D, Passero A, Pirocchi V, Casamassima G, et al. Evaluation of PyroMark Q48 Autoprep with HIrisPlex-S in an Italian population sample. Forensic Sci Int Genet Suppl Ser. 2022;
Pośpiech E, Wojas-Pelc A, Walsh S, Liu F, Maeda H, Ishikawa T, et al. The common occurrence of epistasis in the determination of human pigmentation and its impact on DNA-based pigmentation phenotype prediction. Forensic Sci Int Genet. 2014; 11(1):64–72.
Palmal S, Adhikari K, Mendoza-Revilla J, Fuentes-Guajardo M, Silva de Cerqueira CC, Bonfante B, et al. Prediction of eye, hair and skin colour in Latin Americans. Forensic Sci Int Genet. 2021; 53:102517.
Valenzuela RK, Henderson MS, Walsh MH, Garrison NA, Kelch JT, Cohen-Barak O, et al. Predicting Phenotype from Genotype: Normal Pigmentation. J Forensic Sci. 2010; 55(2):315.
Spichenok O, Budimlija ZM, Mitchell AA, Jenny A, Kovacevic L, Marjanovic D, et al. Prediction of eye and skin color in diverse populations using seven SNPs. Forensic Sci Int Genet. 2011; 5(5):472–8.
Hart KL, Kimura SL, Mushailov V, Budimlija ZM, Prinz M, Wurmbach E. Improved eye- and skin-color prediction based on 8 SNPs. Croat Med J. 2013; 54(3):248.
Mushailov V, Rodriguez SA, Budimlija ZM, Prinz M, Wurmbach E. Assay Development and Validation of an 8-SNP Multiplex Test to Predict Eye and Skin Coloration. J Forensic Sci. 2015; 60(4):990–1000.
Lim S, Youn JP, Hong S, Choi D, Moon S, Kim W, et al. Customized multiplexing SNP panel for Korean-specific DNA phenotyping in forensic applications. Genes Genomics. 2017; 39(7):723–32.
Zaorska K, Zawierucha P, Nowicki M. Prediction of skin color, tanning and freckling from DNA in Polish population: linear regression, random forest and neural network approaches. Hum Genet. 2019; 138(6):635–47.
Maroñas O, Phillips C, Söchtig J, Gomez-Tato A, Cruz R, Alvarez-Dios J, et al. Development of a forensic skin colour predictive test. Forensic Sci Int Genet. 2014; 13:34–44.
Pośpiech E, Karłowska-Pik J, Marcińska M, Abidi S, Andersen JD, Berge M van den, et al. Evaluation of the predictive capacity of DNA variants associated with straight hair in Europeans. Forensic Sci Int Genet. 2015; 19:280–8.
Liu F, Chen Y, Zhu G, Hysi PG, Wu S, Adhikari K, et al. Meta-analysis of genome-wide association studies identifies 8 novel loci involved in shape variation of human head hair. Hum Mol Genet. 2018; 27(3):559.
Pośpiech E, Karłowska-Pik J, Kukla-Bartoszek M, Woźniak A, Boroń M, Zubańska M, et al. Overlapping association signals in the genetics of hair-related phenotypes in humans and their relevance to predictive DNA analysis. Forensic Sci Int Genet. 2022; 59:102693.
Marcińska M, Pośpiech E, Abidi S, Andersen JD, van den Berge M, Carracedo Á, et al. Evaluation of DNA Variants Associated with Androgenetic Alopecia and Their Potential to Predict Male Pattern Baldness. PLoS One. 2015; 10(5):e0127852.
Liu F, Hamer MA, Heilmann S, Herold C, Moebus S, Hofman A, et al. Prediction of male-pattern baldness from genotypes. European Journal of Human Genetics 2016 24:6. 2015; 24(6):895–902.
Fagertun J, Wolffhechel K, Pers TH, Nielsen HB, Gudbjartsson D, Stefansson H, et al. Predicting facial characteristics from complex polygenic variations. Forensic Sci Int Genet. 2015; 19:263–8.
Qiao L, Yang Y, Fu P, Hu S, Zhou H, Peng S, et al. Genome-wide variants of Eurasian facial shape differentiation and a prospective model of DNA based face prediction. Journal of Genetics and Genomics. 2018; 45(8):419–32.
Noreen S, Ballard D, Mehmood T, Khan A, Khalid T, Rakha A. Evaluation of loci to predict ear morphology using two SNaPshot assays. Forensic Sci Med Pathol. 2022;
Liu F, Hendriks AEJ, Ralf A, Boot AM, Benyi E, Sävendahl L, et al. Common DNA variants predict tall stature in Europeans. Hum Genet. 2014; 133(5):587–97.
Liu F, Zhong K, Jing X, Uitterlinden AG, Hendriks AEJ, Drop SLS, et al. Update on the predictability of tall stature from DNA markers in Europeans. Forensic Sci Int Genet. 2019; 42:8–13.
DNAPrint Genomics [Internet].
HIrisPlex-S Eye, Hair and Skin Colour DNA Phenotyping Webtool [Internet].
Wendt FR, Churchill JD, Novroski NMM, King JL, Ng J, Oldt RF, et al. Genetic analysis of the Yavapai Native Americans from West-Central Arizona using the Illumina MiSeq FGx^TM forensic genomics system. Forensic Sci Int Genet. 2016; 24:18–23.
Hussing C, Børsting C, Mogensen HS, Morling N. Testing of the Illumina® ForenSeq^TM kit. Forensic Sci Int Genet Suppl Ser. 2015; 5:e449–50.
Churchill JD, Schmedes SE, King JL, Budowle B. Evaluation of the Illumina® Beta Version ForenSeq^TM DNA Signature Prep Kit for use in genetic profiling. Forensic Sci Int Genet. 2016; 20:20–9.
Churchill JD, Novroski NMM, King JL, Seah LH, Budowle B. Population and performance analyses of four major populations with Illumina’s FGx Forensic Genomics System. Forensic Sci Int Genet. 2017; 30:81–92.
Silvia AL, Shugarts N, Smith J. A preliminary assessment of the ForenSeq^TM FGx System: next generation sequencing of an STR and SNP multiplex. Int J Legal Med. 2017; 131(1):73–86.
Jäger AC, Alvarez ML, Davis CP, Guzmán E, Han Y, Way L, et al. Developmental validation of the MiSeq FGx Forensic Genomics System for Targeted Next Generation Sequencing in Forensic DNA Casework and Database Laboratories. Forensic Sci Int Genet. 2017; 28:52–70.
Hussing C, Huber C, Bytyci R, Mogensen HS, Morling N, Børsting C. Sequencing of 231 forensic genetic markers using the MiSeq FGx^TM forensic genomics system – an evaluation of the assay and software. Forensic Sci Res. 2018; 3(2):111–23.
Sidstedt M, Junker K, Forsberg C, Boiso L, Rådström P, Ansell R, et al. In-house validation of MPS-based methods in a forensic laboratory. Forensic Sci Int Genet Suppl Ser. 2019; 7(1):635–6.
Sharma V, Jani K, Khosla P, Butler E, Siegel D, Wurmbach E. Evaluation of ForenSeq^TM Signature Prep Kit B on predicting eye and hair coloration as well as biogeographical ancestry by using Universal Analysis Software (UAS) and available web-tools. Electrophoresis. 2019; 40(9):1353–64.
Frégeau CJ. Validation of the Verogen ForenSeq^TM DNA Signature Prep kit/Primer Mix B for phenotypic and biogeographical ancestry predictions using the Micro MiSeq® Flow Cells. Forensic Sci Int Genet. 2021; 53:102533.
Salvo NM, Janssen K, Kirsebom MK, Meyer OS, Berg T, Olsen GH. Predicting eye and hair colour in a Norwegian population using Verogen’s ForenSeq^TM DNA signature prep kit. Forensic Sci Int Genet. 2022; 56:102620.
Weisz NA, Roberts KA, Hardy WR. Reliability of phenotype estimation and extended classification of ancestry using decedent samples. Int J Legal Med. 2021; 135(6):2221–33.
Barbarić L, Horjan-Zanki I. Challenges in the recovery of the genetic data from human remains found on the Western Balkan migration route. Int J Legal Med. 2022;(3):1–13.
Frégeau CJ. A multiple predictive tool approach for phenotypic and biogeographical ancestry inferences. Canadian Society of Forensic Science Journal. 2021; 55(2):71–99.
Junker K, Staadig A, Sidstedt M, Tillmar A, Hedman J. Phenotype prediction accuracy – A Swedish perspective. Forensic Sci Int Genet Suppl Ser. 2019; 7(1):384–6.
Magdalena M, Wróbel M, Parys-Proszek A, Kupiec T. Evaluation of the performance of the beta version of the ForenSeq DNA signature Prep Kit on the MiSeq FGx forensic genomics system. Forensic Sci Int Genet Suppl Ser. 2019; 7(1):585–6.
Guo F, Yu J, Zhang L, Li J. Massively parallel sequencing of forensic STRs and SNPs using the Illumina® ForenSeq^TM DNA Signature Prep Kit on the MiSeq FGx^TM Forensic Genomics System. Forensic Sci Int Genet. 2017; 31:135–48.
Ramani A, Wong Y, Tan SZ, Shue BH, Syn C. Ancestry prediction in Singapore population samples using the Illumina ForenSeq kit. Forensic Sci Int Genet. 2017; 31:171–9.
Bouakaze C, Keyser C, Crubézy E, Montagnon D, Ludes B. Pigment phenotype and biogeographical ancestry from ancient skeletal remains: inferences from multiplexed autosomal SNP analysis. Int J Legal Med. 2009; 123(4):315–25.
Butler K, Peck M, Hart J, Schanfield M, Podini D. Molecular “eyewitness”: Forensic prediction of phenotype and ancestry. Forensic Sci Int Genet Suppl Ser. 2011; 3(1):e498–9.
Castel C, Piper A. Development of a SNP multiplex assay for the inference of biogeographical ancestry and pigmentation phenotype. Forensic Sci Int Genet Suppl Ser. 2011; 3(1).
Keating B, Bansal AT, Walsh S, Millman J, Newman J, Kidd K, et al. First all-in-one diagnostic tool for DNA intelligence: Genome-wide inference of biogeographic ancestry, appearance, relatedness, and sex with the Identitas v1 Forensic Chip. Int J Legal Med. 2013; 127(3):559–72.
Gettings KB, Lai R, Johnson JL, Peck MA, Hart JA, Gordish-Dressman H, et al. A 50-SNP assay for biogeographic ancestry and phenotype prediction in the U.S. population. Forensic Sci Int Genet. 2014; 8(1):101–8.
Bulbul O, Filoglu G, Altuncul H, Aradas AF, Ruiz Y, Fondevila M, et al. A SNP multiplex for the simultaneous prediction of biogeographic ancestry and pigmentation type. Forensic Sci Int Genet Suppl Ser. 2011; 3(1):e500–1.
Bulbul O, Filoglu G, Zorlu T, Altuncul H, Freire-Aradas A, Söchtig J, et al. Inference of biogeographical ancestry across central regions of Eurasia. Int J Legal Med. 2016; 130(1):73–9.
Bardan F, Higgins D, Austin JJ. A mini-multiplex SNaPshot assay for the triage of degraded human DNA. Forensic Sci Int Genet. 2018; 34:62–70.
Xavier C, de la Puente M, Mosquera-Miguel A, Freire-Aradas A, Kalamara V, Vidaki A, et al. Development and validation of the VISAGE AmpliSeq basic tool to predict appearance and ancestry from DNA. Forensic Sci Int Genet. 2020; 48:102336.
Palencia-Madrid L, Xavier C, de la Puente M, Hohoff C, Phillips C, Kayser M, et al. Evaluation of the VISAGE Basic Tool for Appearance and Ancestry Prediction Using PowerSeq Chemistry on the MiSeq FGx System. Genes 2020, Vol 11, Page 708. 2020; 11(6):708.
de la Puente M, Ruiz-Ramírez J, Ambroa-Conde A, Xavier C, Pardo-Seco J, Álvarez-Dios J, et al. Development and Evaluation of the Ancestry Informative Marker Panel of the VISAGE Basic Tool. Genes 2021, Vol 12, Page 1284. 2021; 12(8):1284.
Xavier C, de la Puente M, Sidstedt M, Junker K, Minawi A, Unterländer M, et al. Evaluation of the VISAGE basic tool for appearance and ancestry inference using ForenSeq® chemistry on the MiSeq FGx® system. Forensic Sci Int Genet. 2022; 58.
Diepenbroek M, Bayer B, Schwender K, Schiller R, Lim J, Lagacé R, et al. Evaluation of the Ion AmpliSeq^TM PhenoTrivium Panel: MPS-Based Assay for Ancestry and Phenotype Predictions Challenged by Casework Samples. Genes 2020, Vol 11, Page 1398. 2020; 11(12):1398.
Diepenbroek M, Bayer B, Anslinger K. Pushing the Boundaries: Forensic DNA Phenotyping Challenged by Single-Cell Sequencing. Genes (Basel). 2021; 12(9).
Rauf S, Austin JJ, Higgins D, Khan MR. Unveiling forensically relevant biogeographic, phenotype and Y-chromosome SNP variation in Pakistani ethnic groups using a customized hybridisation enrichment forensic intelligence panel. PLoS One. 2022; 17(2 February).
Zidkova A, Horinek A, Stenzl V, Korabecna M. Application of multifactor dimensionality reduction analysis and Bayesian networks for eye color and ancestry prediction for forensic purposes in the Czech Republic. Forensic Sci Int Genet Suppl Ser. 2013; 4(1):e322–3.
Fesenko DO, Ivanovsky ID, Ivanov PL, Zemskova EY, Agapitova AS, Polyakov SA, et al. A Biochip for Genotyping Polymorphisms Associated with Eye, Hair, Skin Color, AB0 Blood Group, Sex, Y Chromosome Core Haplogroup, and Its Application to Study the Slavic Population. Mol Biol. 2022; 56(5):780–99.
ForenSeq DNA Signature Prep Kit [Internet].
IDentify Advantages - Identitas [Internet].
DNA Phenotyping - Parabon® Snapshot® DNA Analysis Service [Internet].
Wienroth M. Governing anticipatory technology practices. Forensic DNA phenotyping and the forensic genetics community in Europe. https://eprints.ncl.ac.uk. 2018; 37(2):137–52.
Samuel G, Prainsack B. Forensic DNA phenotyping in Europe: views “on the ground” from those who have a professional stake in the technology. https://doi.org/101080/1463677820181549984. 2018; 38(2):119–41.
Koops BJ, Schellekens MHM. Forensic DNA Phenotyping: Regulatory Issues. SSRN Electronic Journal. 2006;
Granja R, Machado H. Forensic DNA phenotyping and its politics of legitimation and contestation: Views of forensic geneticists in Europe. https://doi.org/101177/0306312720945033. 2020;
Samuel G, Carmen Howard H, Cornel M, van El C, Hall A, Forzano F, et al. A response to the forensic genetics policy initiative’s report ‘Establishing Best Practice for Forensic DNA Databases’. Forensic Sci Int Genet. 2018;1–3.
Scientific Working Group on DNA Analysis Methods (SWGDAM) [Internet].
Coquet M, Terrado-Ortuño N. A Postmodern Instance of Sub-Population Stigmatization Forensic DNA Phenotyping and the Pitfalls of Modern Penal Guarantees. Journal of policy analysis and management. 2023; Submitted.
Meyer OS, Børsting C, Andersen JD. Perception of blue and brown eye colours for forensic DNA phenotyping. Forensic Sci Int Genet Suppl Ser. 2019; 7(1):476–7.
Salas A, Phillips C, Carracedo A. Ancestry vs physical traits: the search for ancestry informative markers (AIMs). International Journal of Legal Medicine 2005 120:3. 2005; 120(3):188–9.
Rajeevan H, Soundararajan U, Pakstis AJ, Kidd KK. Introducing the Forensic Research/Reference on Genetics knowledge base, FROG-kb. Investig Genet. 2012; 3(1).
Kidd KK, Soundararajan U, Rajeevan H, Pakstis AJ, Moore KN, Ropero-Miller JD. The redesigned Forensic Research/Reference on Genetics-knowledge base, FROG-kb. Forensic Sci Int Genet. 2018; 33:33–7.
Giardina E, Pietrangeli I, Martínez-Labarga C, Martone C, Angelis F de, Spinella A, et al. Haplotypes in SLC24A5 Gene as Ancestry Informative Markers in Different Populations. Curr Genomics. 2008; 9(2):110.
Qu Y, Tran D, Martinez-Marroquin E. Biogeographical Ancestry Inference from Genotype: A Comparison of Ancestral Informative SNPs and Genome-wide SNPs. 2020 IEEE Symposium Series on Computational Intelligence, SSCI 2020. 2020;64–70.
Gu Y, Yun L, Zhang L, Yang F, Hou Y. The potential forensic utility of two single nucleotide polymorphisms in predicting biogeographical ancestry. Forensic Sci Int Genet Suppl Ser. 2011; 3(1):e105–6.
Soejima M, Koda Y. Population differences of two coding SNPs in pigmentation-related genes SLC24A5 and SLC45A2. Int J Legal Med. 2007; 121(1):36–9.
Yuasa I, Umetsu K, Watanabe G, Nakamura H, Endoh M, Irizawa Y. MATP polymorphisms in Germans and Japanese: The L374F mutation as a population marker for Caucasoids. Int J Legal Med. 2004; 118(6):364–6.
Hunter P. Uncharted waters: Next-generation sequencing and machine learning software allow forensic sicente to expand into phenotype prediction from DNA samples. EMBO Rep. 2018; 19(3):e45810.
Caliebe A, Walsh S, Liu F, Kayser M, Krawczak M. Likelihood ratio and posterior odds in forensic genetics: Two sides of the same coin. Forensic Sci Int Genet. 2017; 28:203–10.
Albert FW, Kruglyak L. The role of regulatory variation in complex traits and disease. Nature Reviews Genetics 2015 16:4. 2015; 16(4):197–212.
Bradbury C, Köttgen A, Staubach F. Off-target phenotypes in forensic DNA phenotyping and biogeographic ancestry inference: A resource. Forensic Sci Int Genet. 2019; 38:93–104.
Mehta B, Daniel R, McNevin D. HRM and SNaPshot as alternative forensic SNP genotyping methods. Forensic Sci Med Pathol. 2017; 13(3):293–301.
Mehta B, Daniel R, McNevin D. HRM and SNaPshot as alternative forensic SNP genotyping methods. Forensic Sci Med Pathol. 2017; 13(3):293–301.
Yang Y, Xie B, Yan J. Application of Next-generation Sequencing Technology in Forensic Science. Genomics Proteomics Bioinformatics. 2014; 12(5):190–7.
Pfaffelhuber P, Rohde A. A central limit theorem concerning uncertainty in estimates of individual admixture. Theor Popul Biol. 2022; 148:28–39.
Pośpiech E, Draus-Barini J, Kupiec T, Wojas-Pelc A, Branicki W. Gene-gene interactions contribute to eye colour variation in humans. J Hum Genet. 2011; 56(6):447–55.
Pfaffelhuber P, Sester-Huss E, Baumdicker F, Naue J, Lutz-Bonengel S, Staubach F. Inference of recent admixture using genotype data. Forensic Sci Int Genet. 2022; 56.
Caliebe A, Krawczak M, Kayser M. Predictive values in Forensic DNA Phenotyping are not necessarily prevalence-dependent. Forensic Sci Int Genet. 2018; 33:e7–8.

ForensicDNAPhenotypingareviewonSNPpanelsgenotypingtechniquesandpredictionmodelsSupplementMaterial.xlsx
Article's author(s), title, year of publication, publication journal and details on their studied FDP trait.

Download PDF

Version 1

posted

You are reading this latest preprint version

Forensic DNA Phenotyping: a review on SNP panels, genotyping techniques, and prediction models

Status:

Version 1

Abstract

1. Introduction

2. Materials And Methods

3. Discussion

3.1 BGA

3.1.1 BGA-related literature

3.2 EVC

3.2.1 EVC-related literature

3.3 BGA and EVC

4. Findings

4.1 Terminology and reporting

4.2 Development of panels

4.3 Genotyping technology

4.4 Prediction models and algorithms

5. Conclusions, Limitations And Recommendations

Abbreviations

Declarations

References

Supplementary Files

Status:

Version 1