Automating Systematic Literature Reviews with Retrieval-Augmented Generation: A Comprehensive Overview
dc.citation.issue | 19 | |
dc.citation.volume | 14 | |
dc.contributor.author | Han B | |
dc.contributor.author | Susnjak T | |
dc.contributor.author | Mathrani A | |
dc.contributor.editor | Garcia Villalba LJ | |
dc.date.accessioned | 2024-10-29T01:02:26Z | |
dc.date.available | 2024-10-29T01:02:26Z | |
dc.date.issued | 2024-10-09 | |
dc.description.abstract | This study examines Retrieval-Augmented Generation (RAG) in large language models (LLMs) and their significant application for undertaking systematic literature reviews (SLRs). RAG-based LLMs can potentially automate tasks like data extraction, summarization, and trend identification. However, while LLMs are exceptionally proficient in generating human-like text and interpreting complex linguistic nuances, their dependence on static, pre-trained knowledge can result in inaccuracies and hallucinations. RAG mitigates these limitations by integrating LLMs’ generative capabilities with the precision of real-time information retrieval. We review in detail the three key processes of the RAG framework—retrieval, augmentation, and generation. We then discuss applications of RAG-based LLMs to SLR automation and highlight future research topics, including integration of domain-specific LLMs, multimodal data processing and generation, and utilization of multiple retrieval sources. We propose a framework of RAG-based LLMs for automating SRLs, which covers four stages of SLR process: literature search, literature screening, data extraction, and information synthesis. Future research aims to optimize the interaction between LLM selection, training strategies, RAG techniques, and prompt engineering to implement the proposed framework, with particular emphasis on the retrieval of information from individual scientific papers and the integration of these data to produce outputs addressing various aspects such as current status, existing gaps, and emerging trends. | |
dc.description.confidential | false | |
dc.edition.edition | October-1 2024 | |
dc.identifier.citation | Han B, Susnjak T, Mathrani A. (2024). Automating Systematic Literature Reviews with Retrieval-Augmented Generation: A Comprehensive Overview. Applied Sciences (Switzerland). 14. 19. | |
dc.identifier.doi | 10.3390/app14199103 | |
dc.identifier.eissn | 2076-3417 | |
dc.identifier.elements-type | journal-article | |
dc.identifier.number | 9103 | |
dc.identifier.uri | https://mro.massey.ac.nz/handle/10179/71855 | |
dc.language | English | |
dc.publisher | MDPI (Basel, Switzerland) | |
dc.publisher.uri | https://www.mdpi.com/2076-3417/14/19/9103 | |
dc.relation.isPartOf | Applied Sciences (Switzerland) | |
dc.rights | (c) 2024 The Author/s | |
dc.rights | CC BY 4.0 | |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | |
dc.subject | retrieval-augmented generation | |
dc.subject | large language models | |
dc.subject | systematic literature review | |
dc.title | Automating Systematic Literature Reviews with Retrieval-Augmented Generation: A Comprehensive Overview | |
dc.type | Journal article | |
pubs.elements-id | 492028 | |
pubs.organisational-group | College of Health |