Abstract
The integration of Large Language Models (LLMs) into software systems is transforming regulated sectors like insurance, where precision, compliance, and efficiency are essential. While proprietary LLMs like GPT-4 offer state-of-the-art performance, their closed-source nature and high computational demands constrain adoption in privacy-sensitive and cost-restricted InsurTech environments. In response, this paper investigates how lightweight, open-source LLMs can be effectively deployed for domain-specific question answering in insurance, emphasizing software engineering considerations such as modularity, inference stability, and prompt orchestration. We propose a software-engineered evaluation framework tailored to insurance-related tasks, featuring modular prompt management, automated rubricbased evaluation, and backend support for reproducibility and compliance tracking. A curated benchmark dataset derived from the Hong Kong Insurance Intermediaries Qualifying Examination (IIQE) is constructed to reflect real-world regulatory and operational challenges. Ten open-source models are systematically evaluated across four question types using both standard and Chain-of-Thought (CoT) prompting strategies. Our findings show that compact models such as DeepSeek-R1-1.5B achieve strong accuracy with minimal resource consumption, making them suitable for practical deployment. CoT prompting further enhances reasoning performance, particularly for models with 3B parameters or more. With proper prompt design and modular deployment, lightweight LLMs can support secure, efficient, and interpretable InsurTech applications, enabling trustworthy AI-driven software systems in regulated domains. © 2025 IEEE.
| Original language | English |
|---|---|
| Title of host publication | 2025 32nd Asia-Pacific Software Engineering Conference (APSEC) |
| Publisher | IEEE |
| Pages | 785-795 |
| Number of pages | 11 |
| ISBN (Electronic) | 979-8-3315-6653-1 |
| ISBN (Print) | 979-8-3315-6654-8 |
| DOIs | |
| Publication status | Published - Dec 2025 |
| Event | 32nd Asia-Pacific Software Engineering Conference (APSEC 2025) - Wynn Palace, Macao, China Duration: 2 Dec 2025 → 5 Dec 2025 https://conf.researchr.org/home/apsec-2025 |
Publication series
| Name | Proceedings - Asia-Pacific Software Engineering Conference, APSEC |
|---|---|
| ISSN (Print) | 1530-1362 |
| ISSN (Electronic) | 2640-0715 |
Conference
| Conference | 32nd Asia-Pacific Software Engineering Conference (APSEC 2025) |
|---|---|
| Abbreviated title | APSEC 2025 |
| Place | Macao, China |
| Period | 2/12/25 → 5/12/25 |
| Internet address |
Research Keywords
- Chain-of-Thought Reasoning
- InsurTech
- Large Language Models
- Prompt Engineering
- Software System Integration
Fingerprint
Dive into the research topics of 'Towards Lightweight LLM Software Solutions for InsurTech: A Framework for Scalable Question Answering Systems'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver