Symptom-based classification of 16p11.2 copy number variations underlying the multidimensional autism spectrum disorder phenotype using machine learning methods

dc.contributor.authorBolat, Hilmi
dc.contributor.authorUnsel-bolat, Gul
dc.contributor.authorBulut, Edanur
dc.contributor.authorOzgul, Semiha
dc.contributor.authorTuran, Duygu Selin
dc.contributor.authorCelik, Samet
dc.contributor.authorKosova, Buket
dc.date.accessioned2026-06-21T16:20:59Z
dc.date.created2026
dc.date.issued2026
dc.departmentBartın Üniversitesi
dc.description.abstractPurpose Copy number variations (CNVs) in the 16p11.2 region are well-established contributors to neurodevelopmental disorders, yet phenotype variability across this locus remains insufficiently characterized. This study investigates clinical features and ASD-related symptoms among carriers of rare pathogenic and common CNVs, and evaluates symptom-level discriminability using machine learning (ML) methods. Methods Genetic data from 7568 individuals were retrospectively screened, identifying 147 carriers of 16p11.2 CNVs. Detailed clinical assessments were completed for 50 participants. ASD-related symptoms were evaluated using a structured 25-item instrument. Group comparisons applied nonparametric statistics with effect sizes, confidence intervals, and FDR correction. ML analyses used PCA and k-means for feature selection, oversampling methods (SMOTE, Borderline-SMOTE, ADASYN), and five classifiers, evaluated through cross-validation. Results Across pathogenic and common CNV groups, no significant differences were observed in social communication, restricted/repetitive behaviors, sensory symptoms, regression, or total autism scores (FDR-adjusted p > 0.05). Aggression was more frequently endorsed in pathogenic CNV carriers (raw p = 0.030; FDR p = 0.098). BMI was higher in pathogenic CNVs, though nonsignificant after correction (raw p = 0.027; FDR p = 0.152). ML analyses identified three recurrent discriminative symptoms across multiple datasets: delayed response to name, unusual object play, and aggression. Dataset 3 (16 symptoms) provided the most balanced classification performance but, given the very small pathogenic CNV sample, results remain exploratory. Conclusion Findings suggest that, while most autism-related symptoms do not differ between groups, aggression and increased BMI may represent preliminary phenotypic signals associated with pathogenic CNVs. Integrating clinical data from 147 CNV carriers further supports a potential widespread effect across the broader 16p11.2 locus rather than a single breakpoint-specific mechanism. However, all results should be interpreted cautiously due to limited sample size, and larger, systematically phenotyped cohorts are required to establish robust genotype-phenotype relationships.
dc.description.sponsorshipScientific and Technological Research Council of Turkiye (TUBITAK) [321S239]
dc.description.sponsorshipThis study was supported by the Scientific and Technological Research Council of Turkiye (TUBITAK) under project number 321S239.
dc.identifier.doi10.1016/j.reia.2026.202865
dc.identifier.issn3050-6573
dc.identifier.issn3050-6565
dc.identifier.orcid0000-0003-4870-6945
dc.identifier.orcid0000-0003-2654-3698
dc.identifier.scopus2-s2.0-105030284433
dc.identifier.scopusqualityN/A
dc.identifier.urihttp://doi.org/10.1016/j.reia.2026.202865
dc.identifier.urihttps://hdl.handle.net/11772/27405
dc.identifier.volume132
dc.identifier.wosWOS:001700449100001
dc.identifier.wosqualityN/A
dc.indekslendigikaynakWeb of Science
dc.indekslendigikaynakScopus
dc.language.isoen
dc.publisherElsevier Sci Ltd
dc.relation.ispartofResearch in Autism
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
dc.rightsinfo:eu-repo/semantics/closedAccess
dc.snmzKA_WoS_20260621
dc.subject16P11.2
dc.subjectCnvs
dc.subjectMachine Learning
dc.subjectNeurodevelopmental Disorders
dc.titleSymptom-based classification of 16p11.2 copy number variations underlying the multidimensional autism spectrum disorder phenotype using machine learning methods
dc.typeArticle
dspace.entity.typePublication

Dosyalar