News & Analysis
/
Article

Exploring solar cell materials with text-based machine learning

FEB 11, 2022
Natural language process techniques read and predict solar cell data from literature.
Ashley Piccone headshot
Press Officer American Institute of Physics
Exploring solar cell materials with text-based machine learning internal name

Exploring solar cell materials with text-based machine learning lead image

Because solar cells provide clean energy, optimizing and studying their constituent materials is crucial. Big data, coupled with machine learning, can expedite this material design process.

However, much of the literature on solar cell materials exists in text form and is difficult to compare to other types of data. To address this, Zhang and Mu created a text-based model to read papers, extract important information, and predict solar cell materials and their properties.

“Reading these papers can be extremely troublesome for many researchers,” said author Lei Zhang. “As a result, automating the literature reading process in an unsupervised manner can be important. The computers do not get tired easily and can read the papers 24 hours a day, 7 days a week!”

The researchers use the natural language process technique to read the text data, which is vastly different than traditional machine learning data types.

As the model reads in abstracts from an online database, it summarizes the major materials and their functionalities. It predicts solar cell materials, most of which are well known, but some of which are unexpected possibilities. These new candidates are further examined with first principles calculations to reveal their optoelectronic properties.

The team believes the model can act as a platform for future machine learning studies of materials and plans to extend the model to read non-English data.

“Reading papers can be extremely difficult for many foreign researchers and students in non-English-speaking countries,” said Zhang. “On the other hand, a lot of important data are reported in the non-English-written journals, patents and reports, which are relevant for the data-driven studies but are frequently overlooked.”

Source: “Unsupervised machine learning for solar cell materials from literature,” by Lei Zhang and Mu He, Journal of Applied Physics (2022). The article can be accessed at https://doi.org/10.1063/5.0064875 .

Related Topics
More Science
/
Article
First known mountain ridge impact found at Zhangguang Cailing
/
Article
Using magnetic resonance velocimetry facilitates precise structural modifications based on flow characteristics and turbulent kinetic energy
AAS
/
Article
The Astronomical Journal turns 175 this month — join us in celebrating the journal’s impact and looking back on its history. The post Celebrating 175 Years of The Astronomical Journal appeared first on AAS Nova.
APS
/
Article
The experimental confirmation of supersolid vortices opens the prospect of making and studying laboratory analogues of rotating neutron stars.