A Novel Ship Driver Behavior Recognition Approach Based on Improved TSM

CHEN Chen; WEI Yuenan; MA Feng; HU Songtao; WANG Tengfei

doi:10.3963/j.jssn.1674-4861.2025.01.011

Volume 43 Issue 1

Feb. 2025

Turn off MathJax

Article Contents

Article Navigation > Journal of Transport Information and Safety > 2025 > 43(1): 120-129

CHEN Chen, WEI Yuenan, MA Feng, HU Songtao, WANG Tengfei. A Novel Ship Driver Behavior Recognition Approach Based on Improved TSM[J]. Journal of Transport Information and Safety, 2025, 43(1): 120-129. doi: 10.3963/j.jssn.1674-4861.2025.01.011

Citation:

CHEN Chen, WEI Yuenan, MA Feng, HU Songtao, WANG Tengfei. A Novel Ship Driver Behavior Recognition Approach Based on Improved TSM[J]. Journal of Transport Information and Safety, 2025, 43(1): 120-129. doi: 10.3963/j.jssn.1674-4861.2025.01.011

Citation:

PDF( 5615 KB)

A Novel Ship Driver Behavior Recognition Approach Based on Improved TSM

doi: 10.3963/j.jssn.1674-4861.2025.01.011

CHEN Chen^1
,,
WEI Yuenan¹,
MA Feng^{2, 3
,
,},
HU Songtao¹,
WANG Tengfei⁴

1.
Computer Science & Engineering Artificial Intelligence, Wuhan Institute of Technology, Wuhan 430205, China
2.
Intelligent Transportation Systems Research Center, Wuhan University of Technology, Wuhan 430063, China
3.
State Key Laboratory of Maritime Technology and Safety, Wuhan 430063, China
4.
School of Transportation and Logistics Engineering, Wuhan University of Technology, Wuhan 430063, China

Received Date: 2024-07-24
Available Online: 2025-06-27

Abstract

Abstract

In maritime transportation, irregular operations by crew onboard represent a significant factor causing maritime accidents. The design of a real-time detection method for monitoring ship driver behavior holds substantial importance. Compared to automobilism driving and security surveillance, the ship's bridge environment is more complex, posing challenges such as the inability to simultaneously monitor multiple crew members, inefficiency and lower accuracy rates. To solve this problem, a two-step multi-person behavior recognition approach combining multi-target tracking and behavior recognition is proposed. Firstly, a multi-target tracker uses the YoloV7 and ByteTracker to generate continuous feature maps of crew. Based on the temporal shift module (TSM) algorithm for single-target behavior recognition, this approach utilizes techniques such as oversampling and cross-frame stitching to process continuous feature maps. Meanwhile, it leverages EfficientNet-B3 alongside the co-ordinate attention (CA) module to produce highly accurate recognition outcomes. The research establishes a ship's bridge behavior dataset "SC-Action", with data from different ship's bridge surveillance videos, including 2,000 behavior samples of both regular and irregular behaviors. Transfer learning and ablation experiments conducted on this dataset demonstrate that the proposed method achieves real-time behavior recognition of three crew at 24 frames per second, with both recognition speed and accuracy superior to mainstream algorithms. In tests targeting single-person behavior recognition, the method's accuracy improved by 1.3% compared to the baseline TSM model after applying the image enhancement module. Incorporating attention mechanism, the accuracy further increased by 1.78%, reaching 82.1%, with only a 0.1% increase in computational load. During multi-target testing, the method also surpasses leading approaches such as SlowFast in practical inference speed and performance, affirming its efficacy.
- navigation safety,
- behavior recognition,
- target tracking,
- attention mechanism,
- temporal shift module

FullText(HTML)

References(22)

References

[1]	王晓, 余永华, 董旭, 等. 智能机舱验证平台设计与开发[J]. 船海工程, 2024, 53(4): 24-28, 35. WANG X, YU Y H, DONG X, et al. Design and development of the intelligent engine cabin verification platform[J]. Ship & Ocean Engineering, 2024, 53(4): 24-28, 35. (in Chinese)
[2]	黄亮, 张治豪, 文元桥, 等. 基于轨迹特征的船舶停留行为识别与分类[J]. 交通运输工程学报, 2021, 21(5): 189-198. HUANG L, ZHANG Z H, WEN Y Q, et al. Stopping behavior recognition and classification of ship based on trajectory characteristics[J]. Journal of Traffic and Transportation Engineering, 2021, 21(5): 189-198. (in Chinese)
[3]	CHEN J H, DI Z J, SHI J, et al. Marine oil spill pollution causes and governance: a case study of Sanchi tanker collision and explosion[J]. Journal of Cleaner Production, 2020, 273: 122978. doi: 10.1016/j.jclepro.2020.122978
[4]	ZHANG J, WU Z, LI F, et al. Attention-based convolutional and recurrent neural networks for driving behavior recognition using smartphone sensor data[J]. IEEE Access, 2019, 7: 148031-148046. doi: 10.1109/ACCESS.2019.2932434
[5]	苏晨阳, 武文红, 牛恒茂, 等. 深度学习的工人多种不安全行为识别方法综述[J]. 计算机工程与应用, 2024, 60(5): 30-46. SU C Y, WU W H, NIU H M, et al. Review of deep learning approaches for recognizing multiple unsafe behaviors in workers[J]. Computer Engineering and Applications, 2024, 60(5): 30-46. (in Chinese)
[6]	张平, 迟志诚, 陈一凡, 等. 用于自动驾驶车辆的融合注意力机制多目标跟踪算法[J]. 汽车安全与节能学报, 2021, 12 (4): 516-521. doi: 10.3969/j.issn.1674-8484.2021.04.010 ZHANG P, CHI Z C, CHEN Y F, et al. Multiple object tracking algorithm integrated with attention mechanism for autonomous vehicles[J]. Journal of Automotive Safety and Energy, 2021, 12(04): 516-521. (in Chinese) doi: 10.3969/j.issn.1674-8484.2021.04.010
[7]	ZHANG Y, SUN P, JIANG Y, et al. ByteTrack: multi-object tracking by associating every detection box[C]. Computer Vision-ECCV 2022, Israel: ECCV, 2022.
[8]	姜杰, 张立民, 刘凯, 等. 基于改进PP-YOLOE和ByteTrack算法的红外船舶目标检测跟踪方法[J]. 兵器装备工程学报, 2024, 45(11): 291-297. doi: 10.11809/bqzbgcxb2024.11.037 JIANG J, ZHANG L, LIU K, et al. Research on infrared ship target detection and tracking method based on improved pp-yoloe and bytetrack algorithms[J]. Journal of Ordnance Equipment Engineering, 2024, 45(11): 291-297. (in Chinese) doi: 10.11809/bqzbgcxb2024.11.037
[9]	陈信强, 王美琳, 李朝锋, 等. 基于深度学习与多级匹配机制的港区人员轨迹提取[J]. 交通运输系统工程与信息, 2023, 23(4): 70-79. CHEN X Q, WANG M L, LI C F, et al. Port staff trajectory extraction based on deep learning and multi-level matching mechanism[J]. Journal of Transportation Systems Engineering and Information Technology, 2023, 23(4): 70-79. (in Chinese)
[10]	高庆吉, 徐达, 罗其俊, 等. 基于深层动态特征双流网络的高效行为识别算法[J]. 计算机应用与软件, 2024, 41(9): 175-181, 189. GAO Q J, XU D, LUO Q J, et al. An efficient action recognition algorithm based on deep dynamic feature dual-stream cnn[J]. Computer Applications and Software, 2024, 41(9): 175-181, 189. (in Chinese)
[11]	LIN J, GAN C, HAN S. TSM: temporal shift module for efficient video understanding[C]. International Conference on Computer Vision(ICCV), Seoul, Korea: ICCV, 2019.
[12]	胡宏宇, 黎烨宸, 张争光, 等. 基于多尺度骨架图和局部视觉上下文融合的驾驶员行为识别方法[J]. 汽车工程, 2024, 46(1): 1-8, 28. HU H Y, LI Y C, ZHANG Z G, et al. Driver behavior recognition based on multi-scale skeleton graph and local visual context method[J]. Automotive Engineering, 2024, 46(1): 1-8, 28. (in Chinese)
[13]	吴建清, 张子毅, 王钰博, 等. 考虑多模态数据的重载货车危险驾驶行为识别方法[J]. 交通运输系统工程与信息, 2024, 24(2): 63-75. WU J Q, ZHANG Z Y, WANG Y B, et al. Method for identifying dangerous driving behaviors in heavy-duty trucks based on multi-modal data[J]. Journal of Transportation Systems Engineering and Information Technology, 2024, 24(2): 63-75. (in Chinese)
[14]	WANG S, CHEN M, RATNAVELU K, et al. Online classroom student engagement analysis based on facial expression recognition using enhanced yolov5 for mitigating cyber-bullying[J]. Measurement Science and Technology, 2024, 36(1): 015419.
[15]	章宇翔, 李先旺, 贺德强, 等. 基于改进的多算法融合地铁站内乘客行为识别[J]. 铁道科学与工程学报, 2023, 20 (11): 4096-4106. ZHANG Y X, LI X W, HE D Q. et al. Passenger action recognition in subway stations based on improved multi-algorithm fusion[J]. Journal of Railway Science and Engineering, 2023, 20(11): 4096-4106. (in Chinese)
[16]	张孝杰, 张艳伟, 邹鹰, 等. 基于改进YOLOv7的码头作业人员检测算法[J]. 交通信息与安全, 2024, 42(2): 67-75. doi: 10.3963/j.jssn.1674-4861.2024.02.007 ZHANG X J, ZHANG Y W, ZOU Y, et al. An improved yolov7 algorithm for workers detection in port terminals[J]. Journal of Transport Information and Safety, 2024, 42(2): 67-75. (in Chinese) doi: 10.3963/j.jssn.1674-4861.2024.02.007
[17]	FEICHTENHOFER C, FAN H, MALIK J, et al. SlowFast networks for video recognition[C]. International Conference on Computer Vision(ICCV), Seoul, Korea: IEEE, 2019.
[18]	SREELAKSHMY I J, KOVOOR B C. Generative inpainting of high-resolution images: redefined with Real-ESRGAN[J]. International Journal of Artificial Intelligence Tools, 2022, 31(5): 2250035.
[19]	WANG X, YU K, WU S, et al. ESRGAN: enhanced super-resolution generative adversarial networks[C]. Computer Vision-ECCV 2018 Workshops, Munich, Germany: ECCV, 2019.
[20]	ZHOU A, MA Y, JI W, et al. Multi-head attention-based two-stream EfficientNet for action recognition[J]. Multimedia Systems, 2023, 29(2): 487-498.
[21]	LI W D, LI Z Y, WANG C S, et al. An improved SSD light-weight network with coordinate attention for aircraft target recognition in scene videos[J]. Journal of Intelligent & Fuzzy Systems, 2024, 46(1): 355-368.
[22]	RUSSAKOVSKY O, DENG J, SU H, et al. ImageNet large scale visual recognition challenge[J]. International Journal of Computer Vision, 2015, 115(3): 211-252.

Relative Articles

Supplements(0)

Cited By

Proportional views

Proportional views

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Figures(9) / Tables(4)

Get Citation

PDF

XML

Article Metrics

Article views (13) PDF downloads(0)

A Novel Ship Driver Behavior Recognition Approach Based on Improved TSM

doi: 10.3963/j.jssn.1674-4861.2025.01.011

Abstract

References

Proportional views

Catalog

通讯作者: 陈斌, bchen63@163.com

Article Metrics

Proportional views

Related

A Novel Ship Driver Behavior Recognition Approach Based on Improved TSM

doi: 10.3963/j.jssn.1674-4861.2025.01.011

Abstract

References

Proportional views

Catalog

通讯作者: 陈斌, bchen63@163.com

Article Metrics

Proportional views

Related

Export File

Citation

Format

Content