Holistic indoor scene understanding by context supported instance segmentation

Guo, Lin

dc.contributor.advisor	Fan, Guoliang
dc.contributor.author	Guo, Lin
dc.date.accessioned	2021-05-25T20:32:03Z
dc.date.available	2021-05-25T20:32:03Z
dc.date.issued	2020-12
dc.identifier.uri	https://hdl.handle.net/11244/329911
dc.description.abstract	Intelligent robots require advanced vision capabilities to perceive and interact with the real physical world. While computer vision has made great strides in recent years, its predominant paradigm still focuses on building deep-learning networks or handcrafted features to achieve semantic labeling or instance segmentation separately and independently. However, the two tasks should be synergistically unified in the recognition flow since they have a complementary nature in scene understanding.
dc.description.abstract	This dissertation presents the detection of instances in multiple scene understanding levels. Representations that enable intelligent systems to not only recognize what is seen (e.g. Does that pixel represent a chair?), but also predict contextual information about the complete 3D scene as a whole (e.g. How big is the chair? Is the chair placed next to a table?). More specifically, it presents a flow of understanding from local information to global fitness. First, we investigate in the 3D geometry information of instances. A new approach of generating tight cuboids for objects is presented. Then, we take advantage of the trained semantic labeling networks by using the intermediate layer output as a per-category local detector. Instance hypotheses are generated to help traditional optimization methods to get a higher instance segmentation accuracy. After that, to bring the local detection results to holistic scene understanding, our method optimizes object instance segmentation considering both the spacial fitness and the relational compatibility. The context information is implemented using graphical models which represent the scene level object placement in three ways: horizontal, vertical and non-placement hanging relations. Finally, the context information is implemented to a network structure. A deep learning-based re-inferencing frame work is proposed to boost any pixel-level labeling outputs using our local collaborative object presence (LoCOP) feature as the global-to-local guidance.
dc.description.abstract	This dissertation demonstrates that uniting pixel-level detection and instance segmentation not only significantly improves the overall performance for localized and individualized analysis, but also paves the way for holistic scene understanding.
dc.format	application/pdf
dc.language	en_US
dc.rights	Copyright is held by the author who has granted the Oklahoma State University Library the non-exclusive right to share this material in its institutional repository. Contact Digital Library Services at lib-dls@okstate.edu or 405-744-9161 for the permission policy on the use, reproduction or distribution of this material.
dc.title	Holistic indoor scene understanding by context supported instance segmentation
dc.contributor.committeeMember	Hagan, Martin
dc.contributor.committeeMember	Sheng, Weihua
dc.contributor.committeeMember	Jacob, Jamey
osu.filename	Guo_okstate_0664D_17039.pdf
osu.accesstype	Open Access
dc.type.genre	Dissertation
dc.type.material	Text
dc.subject.keywords	context
dc.subject.keywords	deep learning
dc.subject.keywords	graphical model
dc.subject.keywords	instance segmentation
dc.subject.keywords	scene understanding
thesis.degree.discipline	Electrical Engineering
thesis.degree.grantor	Oklahoma State University

Files in this item

Name:: Guo_okstate_0664D_17039.pdf
Size:: 11.49Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

OSU Dissertations [11222]

Show simple item record

SHAREOK^TM

advancing Oklahoma scholarship, research and institutional memory

Holistic indoor scene understanding by context supported instance segmentation

Files in this item

This item appears in the following Collection(s)