LLM-Driven Knowledge Extraction for Noble Metal Nanomaterial Discovery and Applications

Jessica Millen; Henry Gauder; Nathan Langford; Mohammed Khan; Eugenia Valsami-Jones

Previous Article in event

Loading of fluorescent anticancer drug coralyne into micelles, liposomes and peptide self-assemblies

Previous Article in session

Sculpting Chirality at the Nanoscale: Size-Dependent Enantioselectivity in Nanoparticles from MD and DFT

Next Article in event

IN SITU FORMING GELS FOR SUBCUTANEOUS DELIVERY OF CURCUMIN AND PIPERINE

Next Article in session

FAIR data principles and AI ethics: Exploring convergence and gaps

LLM-Driven Knowledge Extraction for Noble Metal Nanomaterial Discovery and Applications

Jessica Millen

Henry Gauder

Nathan Langford

Mohammed Khan

^*,

Eugenia Valsami-Jones

¹ School of Geography, Earth and Environmental Sciences, University of Birmingham, Edgbaston, Birmingham B15 2TT, UK

Academic Editor: Eugenia Valsami-Jones

Published: 16 March 2026 by MDPI in Nanomaterials 2026: Innovations and Future Perspectives session Computational Nanoscience

Abstract:

Advances in the discovery, synthesis, characterisation and deployment of noble metal nanomaterials, including gold, silver, platinum and palladium, are central to transformative progress across a wide range of sectors, notably catalysis, biomedicine, chemical sensing and energy conversion and storage.

Despite rapid growth in the field, the pace of innovation is increasingly limited by the fragmentation of knowledge across disparate sources, including peer-reviewed literature, experimental datasets, patents and computational materials databases. Valuable insights are often buried within unstructured text or siloed resources, hindering systematic comparison, reuse, and translation across disciplines.

This presentation discusses the development of a masked-language-model-based data retrieval and analysis pipeline capable of automatically extracting, structuring and synthesising information from the global corpus of noble metal nanomaterials research. By leveraging recent advances in artificial intelligence, natural language understanding and data-driven materials science, the approach aims to accelerate materials discovery by identifying underexplored compositions, morphologies, structure–property relationships and emerging application spaces. Ultimately, this work will provide a scalable and extensible foundation for targeted experimental validation and cross-domain innovation within the advanced material ecosystem, supporting both fundamental scientific discovery and applications of global relevance.

Keywords: Large Languge Models; Masked Language Models; nanomaterials; datasets; AI; structure-property relationships; noble metals

23 Reads
0 Recommendations

Jessica Millen

Henry Gauder

Nathan Langford

Mohammed Khan

Eugenia Valsami-Jones