9791221502893_87.pdf

Symbols are a universal way to convey complex information in technical drawings since they can represent a wide range of elements, including components, materials, or relationships, in a concise and space-saving manner. Therefore, to enable a digital and automatic interpretation of pixel-based drawi...

Πλήρης περιγραφή

Λεπτομέρειες βιβλιογραφικής εγγραφής
Γλώσσα:English
Έκδοση: Firenze University Press 2024
Διαθέσιμο Online:https://books.fupress.com/doi/capitoli/979-12-215-0289-3_87
id oapen-20.500.12657-89045
record_format dspace
spelling oapen-20.500.12657-890452024-04-03T02:22:49Z Chapter A Comparative Study of Deep Learning Models for Symbol Detection in Technical Drawings Gann, Damaris Faltin, Benedikt König, Markus Computer Vision Technical Drawings Symbol Detection Comparative Study thema EDItEUR::U Computing and Information Technology::UT Computer networking and communications::UTV Virtualization Symbols are a universal way to convey complex information in technical drawings since they can represent a wide range of elements, including components, materials, or relationships, in a concise and space-saving manner. Therefore, to enable a digital and automatic interpretation of pixel-based drawings, accurate detection of symbols is a crucial step. To enhance the efficiency of the digitization process, current research focuses on automating this symbol detection using deep learning models. However, the ever-increasing repertoire of model architectures poses a challenge for researchers and practitioners alike in retaining an overview of the latest advancements and selecting the most suitable model architecture for their respective use cases. To provide guidance, this contribution conducts a comparative study of prevalent and state-of-the-art model architectures for the task of symbol detection in pixel-based construction drawings. Therefore, this study evaluates six different object detection model architectures, including YOLOv5, YOLOv7, YOLOv8, Swin-Transformer, ConvNeXt, and Faster-RCNN. These models are trained and tested on two distinct datasets from the bridge and residential building domains, both representing substantial sub-sectors of the construction industry. Furthermore, the models are evaluated based on five criteria, i.e., detection accuracy, robustness to data scarcity, training time, inference time, and model size. In summary, our comparative study highlights the performance and capabilities of different deep learning models for symbol detection in construction drawings. Through the comprehensive evaluation and practical insights, this research facilitates the advancement of automated symbol detection by showing the strengths and weaknesses of the model architectures, thus providing users with valuable guidance in choosing the most appropriate model for their real-world applications 2024-04-02T15:44:42Z 2024-04-02T15:44:42Z 2023 chapter ONIX_20240402_9791221502893_14 2704-5846 9791221502893 https://library.oapen.org/handle/20.500.12657/89045 eng Proceedings e report application/pdf n/a 9791221502893_87.pdf https://books.fupress.com/doi/capitoli/979-12-215-0289-3_87 Firenze University Press 10.36253/979-12-215-0289-3.87 10.36253/979-12-215-0289-3.87 bf65d21a-78e5-4ba2-983a-dbfa90962870 9791221502893 137 10 Florence open access
institution OAPEN
collection DSpace
language English
description Symbols are a universal way to convey complex information in technical drawings since they can represent a wide range of elements, including components, materials, or relationships, in a concise and space-saving manner. Therefore, to enable a digital and automatic interpretation of pixel-based drawings, accurate detection of symbols is a crucial step. To enhance the efficiency of the digitization process, current research focuses on automating this symbol detection using deep learning models. However, the ever-increasing repertoire of model architectures poses a challenge for researchers and practitioners alike in retaining an overview of the latest advancements and selecting the most suitable model architecture for their respective use cases. To provide guidance, this contribution conducts a comparative study of prevalent and state-of-the-art model architectures for the task of symbol detection in pixel-based construction drawings. Therefore, this study evaluates six different object detection model architectures, including YOLOv5, YOLOv7, YOLOv8, Swin-Transformer, ConvNeXt, and Faster-RCNN. These models are trained and tested on two distinct datasets from the bridge and residential building domains, both representing substantial sub-sectors of the construction industry. Furthermore, the models are evaluated based on five criteria, i.e., detection accuracy, robustness to data scarcity, training time, inference time, and model size. In summary, our comparative study highlights the performance and capabilities of different deep learning models for symbol detection in construction drawings. Through the comprehensive evaluation and practical insights, this research facilitates the advancement of automated symbol detection by showing the strengths and weaknesses of the model architectures, thus providing users with valuable guidance in choosing the most appropriate model for their real-world applications
title 9791221502893_87.pdf
spellingShingle 9791221502893_87.pdf
title_short 9791221502893_87.pdf
title_full 9791221502893_87.pdf
title_fullStr 9791221502893_87.pdf
title_full_unstemmed 9791221502893_87.pdf
title_sort 9791221502893_87.pdf
publisher Firenze University Press
publishDate 2024
url https://books.fupress.com/doi/capitoli/979-12-215-0289-3_87
_version_ 1799945299508068352