Featured Application: The presented proof-of-concept machine learning pipeline enables the reconstruction of original cusp morphology at the enamel–dentine junction (EDJ) in worn lower molars using standard morphometric data. The pre-trained models and automated prediction tools can be applied to new fossils, archaeological, or clinical datasets that provide compatible EDJ measurements, without requiring programming expertise. In its current form, the pipeline offers a ready-to-use framework for EDJ-based reconstruction in research settings and establishes the foundation for future automated systems capable of predicting enamel thickness and full crown morphology in worn or damaged teeth. Reconstructing original cusp dimensions in worn molars represents a fundamental challenge across dentistry, anthropology, and paleontology, as dental wear obscures critical morphological information. In this proof-of-concept study, we present a standardized machine learning pipeline for predicting original cusp height, specifically the horn tips of the enamel–dentine junction (EDJ), in worn lower molars using three-dimensional morphometric data from micro-computed tomography (micro-CT). We analyzed 40 permanent lower first (M1) and second (M2) molars from four hominin groups, systematically evaluated across three wear stages: original, moderately worn (worn1), and severely worn (worn2). Morphometric variables including height, area, and volume were quantified for each cusp, with Random Forest and multiple linear regression models developed individually and combined through ensemble methods. To mimic realistic reconstruction scenarios while preserving a known ground truth, models were trained on unworn specimens (original EDJ morphology) and tested on other teeth after digitally simulated wear (worn1 and worn2). Predictive performance was evaluated using root mean square error (RMSE) and coefficient of determination (R2). Our results demonstrate that under moderate wear (worn1), the ensemble models achieved normalized RMSE values between 11% and 17%. Absolute errors typically below 0.25 mm for most cusps, with R2 values up to ~0.69. Performance deteriorated under severe wear (worn2), particularly for morphologically variable cusps such as the hypoconid and entoconid, but generally remained within sub-millimetric error ranges for several structures. Random Forests and linear models showed complementary strengths, and the ensemble generally offered the most stable performance across cusps and wear states. To enhance transparency and accessibility, we provide a comprehensive, user-friendly software pipeline including pre-trained models, automated prediction scripts, standardized data templates, and detailed documentation. This implementation allows researchers without advanced machine learning expertise to explore EDJ-based reconstruction from standard morphometric measurements in new datasets, while explicitly acknowledging the limitations imposed by our modest and taxonomically unbalanced sample. More broadly, the framework represents an initial step toward predicting complete crown morphology, including enamel thickness, in worn or damaged teeth. As such, it offers a validated methodological foundation for future developments in cusp and crown reconstruction in both clinical and evolutionary dental research.

A Machine Learning Pipeline for Cusp Height Prediction in Worn Lower Molars: Methodological Proof-of-Concept and Validation Across Homo

Napolitano R.
Primo
;
Martini P.
;
Di Domenico G.;
2026

Abstract

Featured Application: The presented proof-of-concept machine learning pipeline enables the reconstruction of original cusp morphology at the enamel–dentine junction (EDJ) in worn lower molars using standard morphometric data. The pre-trained models and automated prediction tools can be applied to new fossils, archaeological, or clinical datasets that provide compatible EDJ measurements, without requiring programming expertise. In its current form, the pipeline offers a ready-to-use framework for EDJ-based reconstruction in research settings and establishes the foundation for future automated systems capable of predicting enamel thickness and full crown morphology in worn or damaged teeth. Reconstructing original cusp dimensions in worn molars represents a fundamental challenge across dentistry, anthropology, and paleontology, as dental wear obscures critical morphological information. In this proof-of-concept study, we present a standardized machine learning pipeline for predicting original cusp height, specifically the horn tips of the enamel–dentine junction (EDJ), in worn lower molars using three-dimensional morphometric data from micro-computed tomography (micro-CT). We analyzed 40 permanent lower first (M1) and second (M2) molars from four hominin groups, systematically evaluated across three wear stages: original, moderately worn (worn1), and severely worn (worn2). Morphometric variables including height, area, and volume were quantified for each cusp, with Random Forest and multiple linear regression models developed individually and combined through ensemble methods. To mimic realistic reconstruction scenarios while preserving a known ground truth, models were trained on unworn specimens (original EDJ morphology) and tested on other teeth after digitally simulated wear (worn1 and worn2). Predictive performance was evaluated using root mean square error (RMSE) and coefficient of determination (R2). Our results demonstrate that under moderate wear (worn1), the ensemble models achieved normalized RMSE values between 11% and 17%. Absolute errors typically below 0.25 mm for most cusps, with R2 values up to ~0.69. Performance deteriorated under severe wear (worn2), particularly for morphologically variable cusps such as the hypoconid and entoconid, but generally remained within sub-millimetric error ranges for several structures. Random Forests and linear models showed complementary strengths, and the ensemble generally offered the most stable performance across cusps and wear states. To enhance transparency and accessibility, we provide a comprehensive, user-friendly software pipeline including pre-trained models, automated prediction scripts, standardized data templates, and detailed documentation. This implementation allows researchers without advanced machine learning expertise to explore EDJ-based reconstruction from standard morphometric measurements in new datasets, while explicitly acknowledging the limitations imposed by our modest and taxonomically unbalanced sample. More broadly, the framework represents an initial step toward predicting complete crown morphology, including enamel thickness, in worn or damaged teeth. As such, it offers a validated methodological foundation for future developments in cusp and crown reconstruction in both clinical and evolutionary dental research.
2026
Napolitano, R.; Alichane, H.; Martini, P.; Di Domenico, G.; Martin, R. M. G.; Hublin, J. -J.; Oxilia, G.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in SFERA sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11392/2620910
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 1
social impact