面向工程监理的高质量数据集构建与检索增强生成应用研究

面向工程监理的高质量数据集构建与检索增强生成应用研究
DOI:
                        
CSTR:
                        
作者:
                        
作者单位:1.思立博（上海）工程咨询有限公司;2.上海建科工程咨询有限公司
作者简介:
通讯作者:
中图分类号:
基金项目:

Research on High-Quality Dataset Construction and Retrieval-Augmented Generation Applications for Engineering Supervision

Author:

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

面向工程监理行业的数智化转型需求，针对基础模型在专业语义理解与工程场景适配方面的局限性，本文围绕高质量数据集构建与检索增强生成（Retrieval-Augmented Generation，RAG）技术开展系统研究。在《高质量数据集建设指引》的框架下，本文探讨了面向监理业务场景的高质量数据集构建与应用路径。基于JKinco 筑衍·工程监理大模型测评集，采用覆盖忠实性、答案正确性、答案相关性、上下文精度、上下文召回率及噪声敏感性的六维评测体系，分别对基础模型与引入检索增强生成架构的基础模型开展性能对比研究。结果表明，基础模型综合性能得分为0.52，各项指标整体表现偏弱。引入 RAG 架构后，模型综合性能得分提升至0.73，相对提升约 40.3%，各维度性能均得到明显优化。研究表明，通过引入外部高质量数据集，RAG 架构能够有效解决基础模型在监理专业知识不足及复杂场景推理能力有限方面的问题。本研究为工程监理领域垂直大模型的构建提供了可行技术路径，并对推动行业在“十五五”期间实现数智化转型具有参考意义。

Abstract:

In response to the digital and intelligent transformation demands of the engineering supervision industry, and addressing the limitations of foundation models in professional semantic understanding and engineering-scene adaptation, this study systematically investigates high-quality dataset construction and Retrieval-Augmented Generation (RAG) techniques. Within the framework of the Guidelines for High-Quality Dataset Development, a methodological pathway for constructing and applying high-quality datasets tailored to engineering supervision scenarios is proposed.Based on the JKinco Zhuyan Engineering Supervision Large Model Evaluation Benchmark, a six-dimensional evaluation framework encompassing faithfulness, answer correctness, answer relevance, context precision, context recall, and noise sensitivity was established to comparatively assess the performance of a baseline foundation model and its RAG-enhanced counterpart. The results indicate that the baseline model achieved an overall performance score of 0.52, with relatively weak performance across all evaluation dimensions. After integrating the RAG architecture, the overall score increased to 0.73, representing an improvement of approximately 40.3%, with substantial enhancements observed across all evaluation metrics.The findings demonstrate that the incorporation of external high-quality datasets through the RAG framework effectively mitigates the deficiencies of foundation models in engineering supervision knowledge and complex scenario reasoning. This study provides a feasible technical pathway for developing domain-specific large models in the engineering supervision sector and offers practical insights for advancing the industry’s digital and intelligent transformation during China’s Fifteenth Five-Year Plan period.

参考文献

相似文献

引证文献

引用本文

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2026-04-20
最后修改日期:2026-05-27
录用日期:2026-06-01
在线发布日期:
出版日期:

首页

期刊简介

编委会

投稿指南

期刊订阅

广告合作

过刊浏览

联系我们

引用本文

分享

相关视频

文章指标

历史

文章二维码