Moscow University
Anthropology
Bulletin

Comparability of results from canonical discriminant analysis based on different input data

Fedorchuk O.A. (1, 2, 4), Goncharova N.N. (1, 3)

1) Lomonosov Moscow State University, Faculty of Biology, Department of Anthropology, Leninskie Gory, 1(12), Moscow, 119234, Russia; 2) Lomonosov Moscow State University, Anuchin Research Institute and Museum of Anthropology, Mokhovaya st., 11, Moscow, 125009, Russia; 3) FSBI «Research Centre for Medical Genetics», Moskvorechie st., 1, Moscow, 115522, Russia; 4) Paleoethnology Research Center, Novaja plochad, 12, 5, Moscow, 109012, Russia

Fedorchuk Olga A., PhD; ORCID ID 0000-0002-9645-2014; lela.fed@yandex.ru; Goncharova Natalia N., PhD; ORCID ID: 0000-0001-8504-1175; 1455008@gmail.com

Abstract

Introduction. Canonical discriminant analysis, based on the mean values of the traits, is widely used by anthropologists. These analyses use standard deviation means, as well as standard correlation coefficients. The question of the comparability of the results of such analysis with the results based on individual values remains open. Moreover, the existing inter-group variability in correlation coefficients can lead to altered analysis results when applying the correlation matrix calculated for the specific under analysis groups. This study compares the results of three variants of the canonical discriminant analysis: based on individual data, based on average values and a generalized (species-specific) correlation matrix, and based on average values and a regional (calculated for a certain region) correlation matrix. Materials and methods. Data from 48 ethno-territorial groups from the Old World were used. The series are dated close to modern times, from the 16th to the 20th century. Twenty-five craniometric linear features have been measured. For canonical analysis on individual data we used the R language package, and for average data analysis the MultiCan software was used. Results. The results of the two analyses performed on individual data and on average data turned out to be quite similar. A comparison of the results of a series of discriminant analyses carried out on samples of the three major races using different correlation matrices reveals some small differences in the mutual arrangement of groups. In general, the distribution of samples in the scatter plots, as well as the standardized coefficients of discriminant functions coincide, regardless of the type of initial data. Conclusion. In general, it may be concluded that the use of both individual values and sample averages in most cases leads to the same results. When individual values are used, the results may be distorted as a result of a strong reduction in the number of samples. Also, sample differentiation in this case is strongly influenced by a higher real intra-group variability.

Keywords

biological anthropology; archaeological material; craniology; biometry; canonical discriminant analysis (CDA); correlations

DOI: 10.32521/2074-8132.2023.1.062-077

Цит.: Fedorchuk O.A., Goncharova N.N. Comparability of results from canonical discriminant analysis based on different input data // Moscow University Anthropology Bulletin (Vestnik Moskovskogo Universiteta. Seria XXIII. Antropologia), 2023; 1/2023; с. 62-77 (Published: February 28, 2023)

Download text
2009-2018
Свидетельство о регистрации ПИ № ФС77-35672 от 19 марта 2009 г.
Website developer