Publication detail
Estimating Extreme 3D Image Rotations using Cascaded Attention
DEKEL, S. KELLER, Y. ČADÍK, M.
Original Title
Estimating Extreme 3D Image Rotations using Cascaded Attention
Type
conference paper
Language
English
Original Abstract
Estimating large, extreme inter-image rotations is critical for numerous computer vision domains involving images related by limited or non-overlapping fields of view. In this work, we propose an attention-based approach with a pipeline of novel algorithmic components. First, as rotation estimation pertains to image pairs, we introduce an inter-image distillation scheme using Decoders to improve embeddings. Second, whereas contemporary methods compute a 4D correlation volume (4DCV) encoding inter-image relationships, we propose an Encoder-based cross-attention approach between activation maps to compute an enhanced equivalent of the 4DCV. Finally, we present a cascaded Decoder-based technique for alternately refining the cross-attention and the rotation query. Our approach outperforms current state-of-the-art methods on extreme rotation estimation. We make our code publicly available.
Keywords
camera orientation estimation, extreme rotation, 3D rotation, cascaded attention
Authors
DEKEL, S.; KELLER, Y.; ČADÍK, M.
Released
13. 3. 2024
Publisher
IEEE Computer Society
Location
Seattle
ISBN
979-8-3503-5301-3
Book
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Pages from
2588
Pages to
2598
Pages count
11
URL
BibTex
@inproceedings{BUT188275,
author="Shay {Dekel} and Yosi {Keller} and Martin {Čadík}",
title="Estimating Extreme 3D Image Rotations using Cascaded Attention",
booktitle="Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)",
year="2024",
pages="2588--2598",
publisher="IEEE Computer Society",
address="Seattle",
doi="10.1109/CVPR52733.2024.00250",
isbn="979-8-3503-5301-3",
url="https://cadik.posvete.cz/"
}