基于DeepSeek-LoRA微调的列车运行环境风险目标识别方法
作者:
作者单位:

1.北京市地铁运营有限公司,北京 100044;2.北京交通大学 交通运输学院,北京 100044

作者简介:

崔广炎(1994-),男,博士,工程师,主要研究方向为智能列车运行系统.email:subwaybj2025@126.com.
李宇杰(1983-),男,博士,正高级工程师,主要研究方向为智能列车运行系统.
李熙(1981-),男,博士,正高级工程师,主要研究方向为智能列车运行系统.
沈忱(1973-),男,学士,高级工程师,主要研究方向为智能列车运行系统.
李琨 (1996-),男,硕士,工程师,主要研究方向为智能列车运行系统.
刘可(1994-),男,学士,工程师,主要研究方向为智能列车运行系统.

通讯作者:

基金项目:

北京市地铁运营有限公司科研资助项目

伦理声明:



A DeepSeek-LoRA fine-tuned approach for risk object detection in railway operational environments
Author:
Ethical statement:

Affiliation:

1.Beijing Mass Transit Railway Operation Corp. Ltd. ,Beijing 100044,China;2.School of Traffic and Transportation,Beijing Jiaotong University,Beijing 100044,China

Funding:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
    摘要:

    由于地铁列车运行环境复杂,光照变化、遮挡频繁、目标尺度差异大等问题,现有模型难以有效识别列车运行中的风险目标,如对向列车、行人、落石以及异常侵入物等。针对上述问题,本文探索了视觉语言大模型在地铁列车运行风险目标识别中的可行性。基于列车运行数据,首先构建了基于低秩自适应微调(LoRA)的风险目标数据集,将标注数据重构为结构化自然语言描述。通过低秩自适应微调DeepSeek-VL2多模态大模型参数,获得了针对列车运行环境风险目标识别的优化权重,并采用自然语言问答评估模型识别精确度。实验表明,仅通过自然语言指令即可实现列车运行环境风险目标识别,F1分数达到80.5%,满足地铁列车运行场景中对对向列车和行人等风险目标的检测精确度需求,可有效降低列车碰撞风险,且模型具有极强泛化能力,可应对多种地铁列车运行场景。

    Abstract:

    Due to the complexities of train operation environments—such as frequent lighting variations, occlusions, and significant scale differences among targets—existing models struggle to effectively identify risk objects during train operations, including oncoming trains, pedestrians, falling rocks, and abnormal intrusions. To address these challenges, this study explores the feasibility of applying large visual-language models to risk object identification in train operation scenarios. Based on train operation data, this paper first constructs a risk object dataset using Low-Rank Adaptation(LoRA) fine-tuning, reformatting annotated data into structured natural language descriptions. By applying LoRA fine-tuning to the parameters of the DeepSeek-VL2 multimodal large model, optimized weights for risk object identification in train operation environments are obtained. The model's recognition accuracy is evaluated through natural language question-answering tasks. Experiments show that risk objects in train operation environments can be identified using only natural language instructions, achieving an F1-score of 80.5%. This meets the accuracy requirements for detecting risk objects such as oncoming trains and pedestrians in subway operation scenarios, effectively reducing train collision risks. Moreover, the model exhibits strong generalization capability to adapt to diverse subway train operation scenarios.

    参考文献
    相似文献
    引证文献
引用本文

崔广炎,李宇杰,李熙,沈忱,李琨,刘可.基于DeepSeek-LoRA微调的列车运行环境风险目标识别方法[J].太赫兹科学与电子信息学报,2025,23(12):1269~1277

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
历史
  • 收稿日期:2025-08-12
  • 最后修改日期:2025-11-01
  • 录用日期:
  • 在线发布日期: 2026-02-13
  • 出版日期:
关闭