TY - JOUR
T1 - Why big data and compute are not necessarily the path to big materials science
AU - Fujinuma, Naohiro
AU - DeCost, Brian
AU - Hattrick-Simpers, Jason
AU - Lofland, Samuel E.
N1 - Publisher Copyright:
© 2022, This is a U.S. Government work and not under copyright protection in the US; foreign copyright protection may apply.
PY - 2022/12
Y1 - 2022/12
N2 - Applied machine learning has rapidly spread throughout the physical sciences. In fact, machine learning-based data analysis and experimental decision-making have become commonplace. Here, we reflect on the ongoing shift in the conversation from proving that machine learning can be used, to how to effectively implement it for advancing materials science. In particular, we advocate a shift from a big data and large-scale computations mentality to a model-oriented approach that prioritizes the use of machine learning to support the ecosystem of computational models and experimental measurements. We also recommend an open conversation about dataset bias to stabilize productive research through careful model interrogation and deliberate exploitation of known biases. Further, we encourage the community to develop machine learning methods that connect experiments with theoretical models to increase scientific understanding rather than incrementally optimizing materials. Moreover, we envision a future of radical materials innovations enabled by computational creativity tools combined with online visualization and analysis tools that support active outside-the-box thinking within the scientific knowledge feedback loop.
AB - Applied machine learning has rapidly spread throughout the physical sciences. In fact, machine learning-based data analysis and experimental decision-making have become commonplace. Here, we reflect on the ongoing shift in the conversation from proving that machine learning can be used, to how to effectively implement it for advancing materials science. In particular, we advocate a shift from a big data and large-scale computations mentality to a model-oriented approach that prioritizes the use of machine learning to support the ecosystem of computational models and experimental measurements. We also recommend an open conversation about dataset bias to stabilize productive research through careful model interrogation and deliberate exploitation of known biases. Further, we encourage the community to develop machine learning methods that connect experiments with theoretical models to increase scientific understanding rather than incrementally optimizing materials. Moreover, we envision a future of radical materials innovations enabled by computational creativity tools combined with online visualization and analysis tools that support active outside-the-box thinking within the scientific knowledge feedback loop.
UR - http://www.scopus.com/inward/record.url?scp=85137712002&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85137712002&partnerID=8YFLogxK
U2 - 10.1038/s43246-022-00283-x
DO - 10.1038/s43246-022-00283-x
M3 - Article
AN - SCOPUS:85137712002
SN - 2662-4443
VL - 3
JO - Communications Materials
JF - Communications Materials
IS - 1
M1 - 59
ER -