Creating 3D shape abstractions using superquadric surfaces

Abstracting complex 3d shapes with simple geometric primitives is an idea that has existed for a long time. Applications such as modeling, design, recognition, robotic grasping, and computer graphics can make the most of parsimonious, low-dimensional, interpretable shape representations. However, mu...

Full description

Bibliographic Details
Main Author:	Φώτης, Βλάσιος
Other Authors:	Fotis, Vlassis
Language:	English
Published:	2021
Subjects:	Computer vision Shape abstractions Geometric deep learning Point clouds Superquadric surfaces Convolutional neural networks Sparse convolution Υπολογιστική όραση Αφαιρετικές αναπαραστάσεις Γεωμετρική βαθιά μάθηση Νέφη σημείων Υπερτετραγωνικές επιφάνειες Συνελικτικά νευρωνικά δίκτυα Αραιή συνέλιξη
Online Access:	http://hdl.handle.net/10889/14977

id	nemertes-10889-14977
record_format	dspace
institution	UPatras
collection	Nemertes
language	English
topic	Computer vision Shape abstractions Geometric deep learning Point clouds Superquadric surfaces Convolutional neural networks Sparse convolution Υπολογιστική όραση Αφαιρετικές αναπαραστάσεις Γεωμετρική βαθιά μάθηση Νέφη σημείων Υπερτετραγωνικές επιφάνειες Συνελικτικά νευρωνικά δίκτυα Αραιή συνέλιξη
spellingShingle	Computer vision Shape abstractions Geometric deep learning Point clouds Superquadric surfaces Convolutional neural networks Sparse convolution Υπολογιστική όραση Αφαιρετικές αναπαραστάσεις Γεωμετρική βαθιά μάθηση Νέφη σημείων Υπερτετραγωνικές επιφάνειες Συνελικτικά νευρωνικά δίκτυα Αραιή συνέλιξη Φώτης, Βλάσιος Creating 3D shape abstractions using superquadric surfaces
description	Abstracting complex 3d shapes with simple geometric primitives is an idea that has existed for a long time. Applications such as modeling, design, recognition, robotic grasping, and computer graphics can make the most of parsimonious, low-dimensional, interpretable shape representations. However, multiple failures due to lack of computational power and data caused this idea to be abandoned, until advancements in hardware technology and computer vision allowed it to resurface. While multiple works are dedicated to this very problem, most of them require supervision, work with low descriptive capacity geometric primitives or operate on impractical data, such as depth images or multi-views, that are unable to properly and directly capture depth and shape details. In this thesis, we seek to tackle this problem using an unsupervised, learning-based approach to this problem, using superquadric surfaces as the defacto geometric primitive. We utilize point clouds, a very popular data format that can be easily captured by sensory devices and inherently incorporates depth information. In order to take advantage of the success of convolutional neural networks, we use a volumetric representation of the input point clouds. Contrary to most works, we introduce a model based on sparse convolution, eliminating the memory disadvantages of volumetric approaches and allowing a significantly deeper architecture. We first diagnose possible problems with the model and attempt to remedy them by employing popular, architectural, and optimization techniques. We then proceed to evaluate our models on 3 categories of the ShapeNet dataset and use common metrics to assess their quality. Finally, we make empirical observations about the model’s behaviour and test these observations by experimenting with missing and noisy data to prove the model’s robustness under harsh conditions.
author2	Fotis, Vlassis
author_facet	Fotis, Vlassis Φώτης, Βλάσιος
author	Φώτης, Βλάσιος
author_sort	Φώτης, Βλάσιος
title	Creating 3D shape abstractions using superquadric surfaces
title_short	Creating 3D shape abstractions using superquadric surfaces
title_full	Creating 3D shape abstractions using superquadric surfaces
title_fullStr	Creating 3D shape abstractions using superquadric surfaces
title_full_unstemmed	Creating 3D shape abstractions using superquadric surfaces
title_sort	creating 3d shape abstractions using superquadric surfaces
publishDate	2021
url	http://hdl.handle.net/10889/14977
work_keys_str_mv	AT phōtēsblasios creating3dshapeabstractionsusingsuperquadricsurfaces AT phōtēsblasios dēmiourgiaaphairetikōnanaparastaseōnse3dmontelamechrēsēypertetragōnikōnepiphaneiōn
_version_	1771297340422881280
spelling	nemertes-10889-149772022-09-05T20:52:05Z Creating 3D shape abstractions using superquadric surfaces Δημιουργία αφαιρετικών αναπαραστάσεων σε 3D μοντέλα, με χρήση υπερτετραγωνικών επιφανειών Φώτης, Βλάσιος Fotis, Vlassis Computer vision Shape abstractions Geometric deep learning Point clouds Superquadric surfaces Convolutional neural networks Sparse convolution Υπολογιστική όραση Αφαιρετικές αναπαραστάσεις Γεωμετρική βαθιά μάθηση Νέφη σημείων Υπερτετραγωνικές επιφάνειες Συνελικτικά νευρωνικά δίκτυα Αραιή συνέλιξη Abstracting complex 3d shapes with simple geometric primitives is an idea that has existed for a long time. Applications such as modeling, design, recognition, robotic grasping, and computer graphics can make the most of parsimonious, low-dimensional, interpretable shape representations. However, multiple failures due to lack of computational power and data caused this idea to be abandoned, until advancements in hardware technology and computer vision allowed it to resurface. While multiple works are dedicated to this very problem, most of them require supervision, work with low descriptive capacity geometric primitives or operate on impractical data, such as depth images or multi-views, that are unable to properly and directly capture depth and shape details. In this thesis, we seek to tackle this problem using an unsupervised, learning-based approach to this problem, using superquadric surfaces as the defacto geometric primitive. We utilize point clouds, a very popular data format that can be easily captured by sensory devices and inherently incorporates depth information. In order to take advantage of the success of convolutional neural networks, we use a volumetric representation of the input point clouds. Contrary to most works, we introduce a model based on sparse convolution, eliminating the memory disadvantages of volumetric approaches and allowing a significantly deeper architecture. We first diagnose possible problems with the model and attempt to remedy them by employing popular, architectural, and optimization techniques. We then proceed to evaluate our models on 3 categories of the ShapeNet dataset and use common metrics to assess their quality. Finally, we make empirical observations about the model’s behaviour and test these observations by experimenting with missing and noisy data to prove the model’s robustness under harsh conditions. Η δημιουργία αφαιρετικών αναπαραστάσεων σε τρισδιάστατα σχήματα με χρήση γεωμετρικών στοιχείων έχει προσελκύσει αρκετό ερευνητικό ενδιαφέρον ανά τα χρόνια. Εφαρμογές όπως η μοντελοποίηση, η σχεδίαση, η αναγνώριση, η ρομποτική και τα γραφικά υπολογιστών, μπορούν να εκμεταλλευτούν σε μεγάλο βαθμό μια φειδωλή, χαμηλοδιάστατη, ερμηνεύσιμη αναπαράσταση περίπλοκων σχημάτων. Ωστόσο, η έλλειψη επαρκούς υπολογιστικής ισχύος και μεγάλου όγκου δεδομένων δυσκόλευαν πολύ αυτό το εγχείρημα, έως ότου τεχνολογικά επιτεύγματα στους τομείς του υλικού και της υπολογιστικής όρασης επέτρεψαν να ξαναέρθει στην επιφάνεια. Παρότι πληθώρα εργασιών είναι αφιερωμένες σε αυτό ακριβώς το πρόβλημα, οι περισσότερες χρειάζονται επίβλεψη, δουλεύουν με γεωμετρικά στοιχεία χαμηλής περιγραφικής ικανότητας ή χρησιμοποιούν μη-πρακτικά δεδομένα, όπως εικόνες βάθους, χάνοντας έτσι αρκετή πληροφορία σχετικά με τις λεπτομέρειες του σχήματος. Σε αυτή την διπλωματική εργασία επιχειρούμε να αντιμετωπίσουμε αυτό το πρόβλημα, ακολουθώντας μια προσέγγιση μη επιβλεπόμενης μάθησης, και χρησιμοποιώντας υπερτετραγωνικές επιφάνειες ως το ντεφάκτο γεωμετρικό στοιχείο. Ως είσοδο στο μοντέλο μάθησης, χρησιμοποιούμε νέφη σημείων, μια ιδιαίτερα δημοφιλή δομή δεδομένων που συμπεριλαμβάνει εξ ορισμού πληροφορία βάθους, και είναι έυκολα διαθέσιμη μέσω αισθητήρων. Για να εκμεταλλευτούμε όμως την επιτυχία των συνελικτικών νευρωνικών δικτύων, υιοθετούμε μια ενδιάμεση ογκομετρική αναπαράσταση της εισόδου. Εν αντιθέσει με τις περισσότερες εργασίες, εισάγουμε ένα μοντέλο βασισμένο στην αραιή συνέλιξη, εξαλείφοντας τα μειονεκτήματα μνήμης που έρχονται με τις ογκομετρικές αναπαραστάσεις, και ανοίγοντας χώρο για μια εξαιρετικά βαθύτερη αρχιτεκτονική. Μελετάμε το πρόβλημα και πραγματοποιούμε διάγνωση των προβλημάτων του μοντέλου, τα οποία μετά αντιμετωπίζουμε χρησιμοποιώντας δημοφιλείς τεχνικές βελτιστοποίησης και σχεδιαστικές επιλογές. ΄Επειτα αξιολογούμε το μοντέλο επάνω σε 3 κατηγορίες του συνόλου δεδομένων ShapeNet και χρησιμοποιούμε μετρικές για να εκτιμήσουμε την ποιότητα των αποτελεσμάτων. Τέλος, κάνουμε ορισμένες εμπειρικές παρατηρήσεις επάνω στη συμπεριφορά του μοντέλου, και ελέγχουμε αν ισχύουν πραγματοποιώντας πειράματα με ελλιπή και θορυβώδη δεδομένα, αποδεικνύοντας την ευρωστία του μοντέλου υπό αντίξοες συνθήκες. 2021-07-12T06:37:13Z 2021-07-12T06:37:13Z 2021-07-07 http://hdl.handle.net/10889/14977 en application/pdf

Creating 3D shape abstractions using superquadric surfaces

Similar Items