Retrieval augmented generation using engineering design knowledge
Research output: Journal Publications and Reviews › RGC 21 - Publication in refereed journal › peer-review
Author(s)
Related Research Unit(s)
Detail(s)
Original language | English |
---|---|
Article number | 112410 |
Journal / Publication | Knowledge-Based Systems |
Volume | 303 |
Online published | 24 Aug 2024 |
Publication status | Published - 4 Nov 2024 |
Link(s)
Abstract
Aiming to support Retrieval Augmented Generation (RAG) in the design process, we present a method to identify explicit, engineering design facts – {head entity:: relationship:: tail entity} from patented artefact descriptions. Given a sentence with a pair of entities (selected from noun phrases) marked in a unique manner, our method extracts their relationship that is explicitly communicated in the sentence. For this task, we create a dataset of 375,084 examples and fine-tune language models for relation identification (token classification task) and relation elicitation (sequence-to-sequence task). The token classification approach achieves up to 99.7% accuracy. Upon applying the method to a domain of 4,870 fan system patents, we populate a knowledge base of over 2.93 million facts. Using this knowledge base, we demonstrate how Large Language Models (LLMs) are guided by explicit facts to synthesise knowledge and generate technical and cohesive responses when sought out for knowledge retrieval tasks in the design process. © 2024 Elsevier B.V.
Research Area(s)
- Engineering design knowledge, Graph neural networks, Knowledge graphs, Large-language models, Patent documents, Retrieval-augmented generation
Citation Format(s)
Retrieval augmented generation using engineering design knowledge. / Siddharth, L.; Luo, Jianxi.
In: Knowledge-Based Systems, Vol. 303, 112410, 04.11.2024.
In: Knowledge-Based Systems, Vol. 303, 112410, 04.11.2024.
Research output: Journal Publications and Reviews › RGC 21 - Publication in refereed journal › peer-review