IMECH-IR  > 流固耦合系统力学重点实验室
Instance-Aware Monocular 3D Semantic Scene Completion
Xiao, Haihong1; Xu, Hongbin1; Kang, Wenxiong1; Li YQ(李玉琼)2
通讯作者Kang, Wenxiong(auwxkang@scut.edu.cn)
发表期刊IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS
2024-01-02
页码12
ISSN1524-9050
摘要We study outdoor 3D scene understanding, a challenging task demanding the intelligent system to infer both geometry and semantics from a single-view image - a critical skill for autonomous vehicles to navigate in the real 3D world. Towards this end, we present an instance-aware monocular semantic scene completion framework. To the best of our knowledge, this is the first endeavor specifically targeting the challenge of instance perception in the camera-based semantic scene completion task. Our method consists of two stages. In stage I, we design a region-based VQ-VAE network, providing an effective solution for 3D occupancy prediction. In stage II, we first introduce an instance-aware attention module, explicitly incorporating instance-level cues captured from mask images to enhance the instance features in RGB images. Then we leverage the deformable cross-attention to aggregate image features corresponding to each voxel query and utilize the deformable self-attention to refine query proposals. We combine these key ingredients and evaluate our method on two challenging datasets, namely SemanticKITTI and SSCBench-KITTI-360. The results unequivocally demonstrate the superiority of our proposed method over the state-of-the-art VoxFormer-S. Specifically, our method surpasses VoxFormer-S by 0.22 IoU and 0.72 mIoU on the validation set and achieves an impressive improvement of 3.04 IoU and 1.06 mIoU on the SSCBench-KITTI-360 validation set. Meanwhile, our approach ensures accurate perception of critical instances, thereby exhibiting its exceptional performance and potential for practical deployment.
关键词3D scene understanding semantic scene completion 3D vision
DOI10.1109/TITS.2023.3344806
收录类别SCI ; EI
语种英语
WOS记录号WOS:001167317900001
WOS研究方向Engineering ; Transportation
WOS类目Engineering, Civil ; Engineering, Electrical & Electronic ; Transportation Science & Technology
资助项目National Natural Science Foundation of China
项目资助者National Natural Science Foundation of China
论文分区一类
力学所作者排名3+
RpAuthorKang, Wenxiong
引用统计
文献类型期刊论文
条目标识符http://dspace.imech.ac.cn/handle/311007/94550
专题流固耦合系统力学重点实验室
作者单位1.South China Univ Technol, Sch Automat Sci & Engn, Guangzhou 511442, Peoples R China;
2.Chinese Acad Sci, Inst Mech, Key Lab Mech Fluid Solid Coupling Syst, Beijing 100190, Peoples R China
推荐引用方式
GB/T 7714
Xiao, Haihong,Xu, Hongbin,Kang, Wenxiong,et al. Instance-Aware Monocular 3D Semantic Scene Completion[J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS,2024:12.
APA Xiao, Haihong,Xu, Hongbin,Kang, Wenxiong,&李玉琼.(2024).Instance-Aware Monocular 3D Semantic Scene Completion.IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS,12.
MLA Xiao, Haihong,et al."Instance-Aware Monocular 3D Semantic Scene Completion".IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS (2024):12.
条目包含的文件
条目无相关文件。
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
Lanfanshu学术
Lanfanshu学术中相似的文章
[Xiao, Haihong]的文章
[Xu, Hongbin]的文章
[Kang, Wenxiong]的文章
百度学术
百度学术中相似的文章
[Xiao, Haihong]的文章
[Xu, Hongbin]的文章
[Kang, Wenxiong]的文章
必应学术
必应学术中相似的文章
[Xiao, Haihong]的文章
[Xu, Hongbin]的文章
[Kang, Wenxiong]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。