Document Retrieval Augmented Fine-Tuning (DRAFT) for safety-critical software assessment

Regan Bolton, Mohammadreza Sheikhfathollahi, Simon Parkinson, Vanessa Vulovic, Gary Bamford, Dan Basher, Howard Parkinson

Research output: Contribution to journalArticlepeer-review

Abstract

The evaluation of safety critical software requires a robust evaluation against complex regulatory frameworks, a process traditionally limited by manual evaluation. This paper presents Document Retrieval-Augmented Fine-Tuning (DRAFT), a novel approach that enhances the capabilities of a large language model (LLM) for safety-critical compliance assessment. DRAFT builds upon existing Retrieval-Augmented Generation (RAG) techniques by introducing a novel fine-tuning framework that accommodates our dual-retrieval architecture, which simultaneously accesses both software documentation and applicable reference standards. To fine-tune DRAFT, we develop a semi-automated dataset generation methodology that incorporates variable numbers of relevant documents with meaningful distractors, closely mirroring real-world assessment scenarios. Experiments with GPT-4o-mini demonstrate an improvement in correctness over the baseline model, with qualitative improvements in evidence handling, response structure, and domain-specific reasoning. DRAFT represents a practical approach to improving compliance assessment systems while maintaining the transparency and evidence-based reasoning essential in regulatory domains.
Original languageEnglish
Article number11338754
Pages (from-to)7152-7163
Number of pages12
JournalIEEE Access
Volume14
Early online date12 Jan 2026
DOIs
Publication statusPublished - 15 Jan 2026

Fingerprint

Dive into the research topics of 'Document Retrieval Augmented Fine-Tuning (DRAFT) for safety-critical software assessment'. Together they form a unique fingerprint.

Cite this