spelling |
oapen-20.500.12657-890712024-04-03T02:23:10Z Chapter Identifying Hazards in Construction Sites Using Deep Learning-Based Multimodal with CCTV Data Son, Seongwoo Quoc Tran, Dai Jeon, Yuntae Park, Minsoo Park, Seunghee deep learning multimodal multiCCTV synthetic data pointcloud thema EDItEUR::U Computing and Information Technology The use of closed-circuit television (CCTV) for safety monitoring is crucial for reducing accidents in construction sites. However, the majority of currently proposed approaches utilize single detection models without considering the context of CCTV video inputs. In this study, a multimodal detection, and depth map estimation algorithm utilizing deep learning is proposed. In addition, the point cloud of the test site is acquired using a terrestrial laser scanning scanner, and the detected object's coordinates are projected into global coordinates using a homography matrix. Consequently, the effectiveness of the proposed monitoring system is enhanced by the visualization of the entire monitored scene. In addition, to validate our proposed method, a synthetic dataset of construction site accidents is simulated with Twinmotion. These scenarios are then evaluated with the proposed method to determine its precision and speed of inference. Lastly, the actual construction site, equipped with multiple CCTV cameras, is utilized for system deployment and visualization. As a result, the proposed method demonstrated its robustness in detecting potential hazards on a construction site, as well as its real-time detection speed 2024-04-02T15:45:38Z 2024-04-02T15:45:38Z 2023 chapter ONIX_20240402_9791221502893_40 2704-5846 9791221502893 https://library.oapen.org/handle/20.500.12657/89071 eng Proceedings e report application/pdf n/a 9791221502893_61.pdf https://books.fupress.com/doi/capitoli/979-12-215-0289-3_61 Firenze University Press 10.36253/979-12-215-0289-3.61 10.36253/979-12-215-0289-3.61 bf65d21a-78e5-4ba2-983a-dbfa90962870 9791221502893 137 9 Florence open access
|
description |
The use of closed-circuit television (CCTV) for safety monitoring is crucial for reducing accidents in construction sites. However, the majority of currently proposed approaches utilize single detection models without considering the context of CCTV video inputs. In this study, a multimodal detection, and depth map estimation algorithm utilizing deep learning is proposed. In addition, the point cloud of the test site is acquired using a terrestrial laser scanning scanner, and the detected object's coordinates are projected into global coordinates using a homography matrix. Consequently, the effectiveness of the proposed monitoring system is enhanced by the visualization of the entire monitored scene. In addition, to validate our proposed method, a synthetic dataset of construction site accidents is simulated with Twinmotion. These scenarios are then evaluated with the proposed method to determine its precision and speed of inference. Lastly, the actual construction site, equipped with multiple CCTV cameras, is utilized for system deployment and visualization. As a result, the proposed method demonstrated its robustness in detecting potential hazards on a construction site, as well as its real-time detection speed
|