Machine Learning Methods to Predict Density Functional Theory B3LYP Energies of HOMO and LUMO Orbitals

Journal of Chemical Information and Modeling
Florbela PereiraJoao Aires-de-Sousa


Machine learning algorithms were explored for the fast estimation of HOMO and LUMO orbital energies calculated by DFT B3LYP, on the basis of molecular descriptors exclusively based on connectivity. The whole project involved the retrieval and generation of molecular structures, quantum chemical calculations for a database with >111 000 structures, development of new molecular descriptors, and training/validation of machine learning models. Several machine learning algorithms were screened, and an applicability domain was defined based on Euclidean distances to the training set. Random forest models predicted an external test set of 9989 compounds achieving mean absolute error (MAE) up to 0.15 and 0.16 eV for the HOMO and LUMO orbitals, respectively. The impact of the quantum chemical calculation protocol was assessed with a subset of compounds. Inclusion of the orbital energy calculated by PM7 as an additional descriptor significantly improved the quality of estimations (reducing the MAE in >30%).


Dec 11, 1999·Nucleic Acids Research·M Kanehisa, S Goto
May 9, 1996·Chemical Reviews·Mati KarelsonAlan R. Katritzky
Nov 25, 2003·Journal of Chemical Information and Computer Sciences·Vladimir SvetnikBradley P Feuston
Apr 30, 2010·Journal of Chemical Information and Modeling·David Rogers, Mathew Hahn
Feb 11, 2011·Chemical Reviews·Pratim Kumar ChattarajSoma Duley
Mar 23, 2011·Journal of Computational Chemistry·Chun Wei Yap
May 17, 2012·Journal of Chemical Information and Modeling·John J IrwinRyan G Coleman
Jan 29, 2013·The Journal of Physical Chemistry. a·Eduardo ChamorroPatricia Pérez
May 9, 2013·Journal of Computational Chemistry·Brajesh K Rai, Gregory A Bakken
Jul 16, 2013·Journal of Cheminformatics·Xiaohui QuJoao Aires-de-Sousa
Sep 10, 2013·European Journal of Medicinal Chemistry·Chanin NantasenamatVirapong Prachayasittikul
Sep 18, 2014·Nature Communications·Vinit SharmaRampi Ramprasad
Sep 25, 2015·Nucleic Acids Research·Sunghwan KimStephen H Bryant
Nov 18, 2015·Journal of Chemical Theory and Computation·Raghunathan RamakrishnanO Anatole von Lilienfeld
Feb 16, 2016·Scientific Reports·Arun Mannodi-KanakkithodiRampi Ramprasad
Aug 6, 2016·Molecular Informatics·Qingyou ZhangJoão Aires-de-Sousa

❮ Previous
Next ❯


Aug 24, 2018·Journal of Cheminformatics·Florbela Pereira, João Aires-de-Sousa
Jul 2, 2018·The Journal of Chemical Physics·Peter Bjørn JørgensenMikkel N Schmidt
May 9, 2019·Advanced Science·Kunal GhoshPatrick Rinke
Jun 4, 2019·The Journal of Chemical Physics·Annika StukePatrick Rinke
Jan 16, 2021·The Journal of Chemical Physics·Chee-Kong LeeLiang Shi
Nov 12, 2019·Journal of Cheminformatics·Marta GlavatskikhBenoit Da Mota
Mar 7, 2021·Molecules : a Journal of Synthetic Chemistry and Natural Product Chemistry·Jeffrey PlantePaul L A Popelier
Nov 20, 2020·Chemical Reviews·Julia Westermayr, Philipp Marquetand
Jan 5, 2021·Annual Review of Physical Chemistry·Tim J ZuehlsdorffChristine M Isborn
Apr 27, 2021·Discover Materials·Jose F RodriguesOsvaldo N Oliveira
Mar 13, 2020·Chemical Science·Jonathan A FineGaurav Chopra

❮ Previous
Next ❯

Related Concepts

Trending Feeds


Coronaviruses encompass a large family of viruses that cause the common cold as well as more serious diseases, such as the ongoing outbreak of coronavirus disease 2019 (COVID-19; formally known as 2019-nCoV). Coronaviruses can spread from animals to humans; symptoms include fever, cough, shortness of breath, and breathing difficulties; in more severe cases, infection can lead to death. This feed covers recent research on COVID-19.

Hereditary Sensory Autonomic Neuropathy

Hereditary Sensory Autonomic Neuropathies are a group of inherited neurodegenerative disorders characterized clinically by loss of sensation and autonomic dysfunction. Here is the latest research on these neuropathies.

Evolution of Pluripotency

Pluripotency refers to the ability of a cell to develop into three primary germ cell layers of the embryo. This feed focuses on the mechanisms that underlie the evolution of pluripotency. Here is the latest research.

Chronic Fatigue Syndrome

Chronic fatigue syndrome is a disease characterized by unexplained disabling fatigue; the pathology of which is incompletely understood. Discover the latest research on chronic fatigue syndrome here.

Nuclear Pore Complex in ALS/FTD

Alterations in nucleocytoplasmic transport, controlled by the nuclear pore complex, may be involved in the pathomechanism underlying multiple neurodegenerative diseases including Amyotrophic Lateral Sclerosis and Frontotemporal Dementia. Here is the latest research on the nuclear pore complex in ALS and FTD.

Landau-Kleffner Syndrome

Landau Kleffner syndrome (LKS), also called infantile acquired aphasia, acquired epileptic aphasia, or aphasia with convulsive disorder, is a rare childhood neurological syndrome characterized by the sudden or gradual development of aphasia (the inability to understand or express language) and an abnormal electroencephalogram. Discover the latest research on LKS here.

STING Receptor Agonists

Stimulator of IFN genes (STING) are a group of transmembrane proteins that are involved in the induction of type I interferon that is important in the innate immune response. The stimulation of STING has been an active area of research in the treatment of cancer and infectious diseases. Here is the latest research on STING receptor agonists.


Microbicides are products that can be applied to vaginal or rectal mucosal surfaces with the goal of preventing, or at least significantly reducing, the transmission of sexually transmitted infections. Here is the latest research on microbicides.

Regulation of Vocal-Motor Plasticity

Dopaminergic projections to the basal ganglia and nucleus accumbens shape the learning and plasticity of motivated behaviors across species including the regulation of vocal-motor plasticity and performance in songbirds. Discover the latest research on the regulation of vocal-motor plasticity here.

Related Papers

Spectrochimica Acta. Part A, Molecular and Biomolecular Spectroscopy
Hajar Sahebalzamani
Spectrochimica Acta. Part A, Molecular and Biomolecular Spectroscopy
Gürkan KeşanMustafa Senyel
The Journal of Physical Chemistry. a
Gang Zhang, Charles B Musgrave
© 2021 Meta ULC. All rights reserved