Descriptor engineering in machine learning regression of electronic structure properties for 2D materials
- Others:
- Université Côte d'Azur (UCA)
- Centre de recherche sur l'hétéroepitaxie et ses applications (CRHEA) ; Université Nice Sophia Antipolis (1965 - 2019) (UNS) ; COMUE Université Côte d'Azur (2015-2019) (COMUE UCA)-COMUE Université Côte d'Azur (2015-2019) (COMUE UCA)-Centre National de la Recherche Scientifique (CNRS)-Université Côte d'Azur (UCA)
- UCA-CSI 2021, Doeblin Federation (FR 2800), INP-CNRS Tremplin 2022 and Dialog 2022
Description
We build new material descriptors to predict the band gap and the work function of 2D materials by tree-based machine-learning models. The descriptor's construction is based on vectorizing property matrices and on empirical property function, leading to mixing features that require low-resource computations. Combined with database-based features, the mixing features significantly improve the training and prediction of the models. We find R 2 greater than 0.9 and mean absolute errors (MAE) smaller than 0.23 eV both for the training and prediction. The highest R 2 of 0.95, 0.98 and the smallest MAE of 0.16 eV and 0.10 eV were obtained by using extreme gradient boosting for the bandgap and work-function predictions, respectively. These metrics were greatly improved as compared to those of database features-based predictions. We also find that the hybrid features slightly reduce the overfitting despite a small scale of the dataset. The relevance of the descriptor-based method was assessed by predicting and comparing the electronic properties of several 2D materials belonging to new classes (oxides, nitrides, carbides) with those of conventional computations. Our work provides a guideline to efficiently engineer descriptors by using vectorized property matrices and hybrid features for predicting 2D materials properties via ensemble models.
Abstract
International audience
Additional details
- URL
- https://hal.science/hal-04058161
- URN
- urn:oai:HAL:hal-04058161v1
- Origin repository
- UNICA