Volume 43 Issue 1
Feb.  2025
Turn off MathJax
Article Contents
CHEN Chen, WEI Yuenan, MA Feng, HU Songtao, WANG Tengfei. A Novel Ship Driver Behavior Recognition Approach Based on Improved TSM[J]. Journal of Transport Information and Safety, 2025, 43(1): 120-129. doi: 10.3963/j.jssn.1674-4861.2025.01.011
Citation: CHEN Chen, WEI Yuenan, MA Feng, HU Songtao, WANG Tengfei. A Novel Ship Driver Behavior Recognition Approach Based on Improved TSM[J]. Journal of Transport Information and Safety, 2025, 43(1): 120-129. doi: 10.3963/j.jssn.1674-4861.2025.01.011

A Novel Ship Driver Behavior Recognition Approach Based on Improved TSM

doi: 10.3963/j.jssn.1674-4861.2025.01.011
  • Received Date: 2024-07-24
    Available Online: 2025-06-27
  • In maritime transportation, irregular operations by crew onboard represent a significant factor causing maritime accidents. The design of a real-time detection method for monitoring ship driver behavior holds substantial importance. Compared to automobilism driving and security surveillance, the ship's bridge environment is more complex, posing challenges such as the inability to simultaneously monitor multiple crew members, inefficiency and lower accuracy rates. To solve this problem, a two-step multi-person behavior recognition approach combining multi-target tracking and behavior recognition is proposed. Firstly, a multi-target tracker uses the YoloV7 and ByteTracker to generate continuous feature maps of crew. Based on the temporal shift module (TSM) algorithm for single-target behavior recognition, this approach utilizes techniques such as oversampling and cross-frame stitching to process continuous feature maps. Meanwhile, it leverages EfficientNet-B3 alongside the co-ordinate attention (CA) module to produce highly accurate recognition outcomes. The research establishes a ship's bridge behavior dataset "SC-Action", with data from different ship's bridge surveillance videos, including 2,000 behavior samples of both regular and irregular behaviors. Transfer learning and ablation experiments conducted on this dataset demonstrate that the proposed method achieves real-time behavior recognition of three crew at 24 frames per second, with both recognition speed and accuracy superior to mainstream algorithms. In tests targeting single-person behavior recognition, the method's accuracy improved by 1.3% compared to the baseline TSM model after applying the image enhancement module. Incorporating attention mechanism, the accuracy further increased by 1.78%, reaching 82.1%, with only a 0.1% increase in computational load. During multi-target testing, the method also surpasses leading approaches such as SlowFast in practical inference speed and performance, affirming its efficacy.

     

  • loading
  • [1]
    王晓, 余永华, 董旭, 等. 智能机舱验证平台设计与开发[J]. 船海工程, 2024, 53(4): 24-28, 35.

    WANG X, YU Y H, DONG X, et al. Design and development of the intelligent engine cabin verification platform[J]. Ship & Ocean Engineering, 2024, 53(4): 24-28, 35. (in Chinese)
    [2]
    黄亮, 张治豪, 文元桥, 等. 基于轨迹特征的船舶停留行为识别与分类[J]. 交通运输工程学报, 2021, 21(5): 189-198.

    HUANG L, ZHANG Z H, WEN Y Q, et al. Stopping behavior recognition and classification of ship based on trajectory characteristics[J]. Journal of Traffic and Transportation Engineering, 2021, 21(5): 189-198. (in Chinese)
    [3]
    CHEN J H, DI Z J, SHI J, et al. Marine oil spill pollution causes and governance: a case study of Sanchi tanker collision and explosion[J]. Journal of Cleaner Production, 2020, 273: 122978. doi: 10.1016/j.jclepro.2020.122978
    [4]
    ZHANG J, WU Z, LI F, et al. Attention-based convolutional and recurrent neural networks for driving behavior recognition using smartphone sensor data[J]. IEEE Access, 2019, 7: 148031-148046. doi: 10.1109/ACCESS.2019.2932434
    [5]
    苏晨阳, 武文红, 牛恒茂, 等. 深度学习的工人多种不安全行为识别方法综述[J]. 计算机工程与应用, 2024, 60(5): 30-46.

    SU C Y, WU W H, NIU H M, et al. Review of deep learning approaches for recognizing multiple unsafe behaviors in workers[J]. Computer Engineering and Applications, 2024, 60(5): 30-46. (in Chinese)
    [6]
    张平, 迟志诚, 陈一凡, 等. 用于自动驾驶车辆的融合注意力机制多目标跟踪算法[J]. 汽车安全与节能学报, 2021, 12 (4): 516-521. doi: 10.3969/j.issn.1674-8484.2021.04.010

    ZHANG P, CHI Z C, CHEN Y F, et al. Multiple object tracking algorithm integrated with attention mechanism for autonomous vehicles[J]. Journal of Automotive Safety and Energy, 2021, 12(04): 516-521. (in Chinese) doi: 10.3969/j.issn.1674-8484.2021.04.010
    [7]
    ZHANG Y, SUN P, JIANG Y, et al. ByteTrack: multi-object tracking by associating every detection box[C]. Computer Vision-ECCV 2022, Israel: ECCV, 2022.
    [8]
    姜杰, 张立民, 刘凯, 等. 基于改进PP-YOLOE和ByteTrack算法的红外船舶目标检测跟踪方法[J]. 兵器装备工程学报, 2024, 45(11): 291-297. doi: 10.11809/bqzbgcxb2024.11.037

    JIANG J, ZHANG L, LIU K, et al. Research on infrared ship target detection and tracking method based on improved pp-yoloe and bytetrack algorithms[J]. Journal of Ordnance Equipment Engineering, 2024, 45(11): 291-297. (in Chinese) doi: 10.11809/bqzbgcxb2024.11.037
    [9]
    陈信强, 王美琳, 李朝锋, 等. 基于深度学习与多级匹配机制的港区人员轨迹提取[J]. 交通运输系统工程与信息, 2023, 23(4): 70-79.

    CHEN X Q, WANG M L, LI C F, et al. Port staff trajectory extraction based on deep learning and multi-level matching mechanism[J]. Journal of Transportation Systems Engineering and Information Technology, 2023, 23(4): 70-79. (in Chinese)
    [10]
    高庆吉, 徐达, 罗其俊, 等. 基于深层动态特征双流网络的高效行为识别算法[J]. 计算机应用与软件, 2024, 41(9): 175-181, 189.

    GAO Q J, XU D, LUO Q J, et al. An efficient action recognition algorithm based on deep dynamic feature dual-stream cnn[J]. Computer Applications and Software, 2024, 41(9): 175-181, 189. (in Chinese)
    [11]
    LIN J, GAN C, HAN S. TSM: temporal shift module for efficient video understanding[C]. International Conference on Computer Vision(ICCV), Seoul, Korea: ICCV, 2019.
    [12]
    胡宏宇, 黎烨宸, 张争光, 等. 基于多尺度骨架图和局部视觉上下文融合的驾驶员行为识别方法[J]. 汽车工程, 2024, 46(1): 1-8, 28.

    HU H Y, LI Y C, ZHANG Z G, et al. Driver behavior recognition based on multi-scale skeleton graph and local visual context method[J]. Automotive Engineering, 2024, 46(1): 1-8, 28. (in Chinese)
    [13]
    吴建清, 张子毅, 王钰博, 等. 考虑多模态数据的重载货车危险驾驶行为识别方法[J]. 交通运输系统工程与信息, 2024, 24(2): 63-75.

    WU J Q, ZHANG Z Y, WANG Y B, et al. Method for identifying dangerous driving behaviors in heavy-duty trucks based on multi-modal data[J]. Journal of Transportation Systems Engineering and Information Technology, 2024, 24(2): 63-75. (in Chinese)
    [14]
    WANG S, CHEN M, RATNAVELU K, et al. Online classroom student engagement analysis based on facial expression recognition using enhanced yolov5 for mitigating cyber-bullying[J]. Measurement Science and Technology, 2024, 36(1): 015419.
    [15]
    章宇翔, 李先旺, 贺德强, 等. 基于改进的多算法融合地铁站内乘客行为识别[J]. 铁道科学与工程学报, 2023, 20 (11): 4096-4106.

    ZHANG Y X, LI X W, HE D Q. et al. Passenger action recognition in subway stations based on improved multi-algorithm fusion[J]. Journal of Railway Science and Engineering, 2023, 20(11): 4096-4106. (in Chinese)
    [16]
    张孝杰, 张艳伟, 邹鹰, 等. 基于改进YOLOv7的码头作业人员检测算法[J]. 交通信息与安全, 2024, 42(2): 67-75. doi: 10.3963/j.jssn.1674-4861.2024.02.007

    ZHANG X J, ZHANG Y W, ZOU Y, et al. An improved yolov7 algorithm for workers detection in port terminals[J]. Journal of Transport Information and Safety, 2024, 42(2): 67-75. (in Chinese) doi: 10.3963/j.jssn.1674-4861.2024.02.007
    [17]
    FEICHTENHOFER C, FAN H, MALIK J, et al. SlowFast networks for video recognition[C]. International Conference on Computer Vision(ICCV), Seoul, Korea: IEEE, 2019.
    [18]
    SREELAKSHMY I J, KOVOOR B C. Generative inpainting of high-resolution images: redefined with Real-ESRGAN[J]. International Journal of Artificial Intelligence Tools, 2022, 31(5): 2250035.
    [19]
    WANG X, YU K, WU S, et al. ESRGAN: enhanced super-resolution generative adversarial networks[C]. Computer Vision-ECCV 2018 Workshops, Munich, Germany: ECCV, 2019.
    [20]
    ZHOU A, MA Y, JI W, et al. Multi-head attention-based two-stream EfficientNet for action recognition[J]. Multimedia Systems, 2023, 29(2): 487-498.
    [21]
    LI W D, LI Z Y, WANG C S, et al. An improved SSD light-weight network with coordinate attention for aircraft target recognition in scene videos[J]. Journal of Intelligent & Fuzzy Systems, 2024, 46(1): 355-368.
    [22]
    RUSSAKOVSKY O, DENG J, SU H, et al. ImageNet large scale visual recognition challenge[J]. International Journal of Computer Vision, 2015, 115(3): 211-252.
  • 加载中

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Figures(9)  / Tables(4)

    Article Metrics

    Article views (13) PDF downloads(0) Cited by()
    Proportional views
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return