From 2D to 3D: AISG-SLA Visual Localization Challenge

From 2D to 3D: AISG-SLA Visual Localization Challenge

Jialin Gao, Bill Ong, Darld Lwi, Zhen Hao Ng, Xun Wei Yee, Mun-Thye Mak, Wee Siong Ng, See-Kiong Ng, Hui Ying Teo, Victor Khoo, Georg Bökman, Johan Edstedt, Kirill Brodt, Clémentin Boittiaux, Maxime Ferrera, Stepan Konev

Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence
Demo Track. Pages 8661-8664. https://doi.org/10.24963/ijcai.2024/1003

Research in 3D mapping is crucial for smart city applications, yet the cost of acquiring 3D data often hinders progress. Visual localization, particularly monocular camera position estimation, offers a solution by determining the camera's pose solely through visual cues. However, this task is challenging due to limited data from a single camera. To tackle these challenges, we organized the AISG–SLA Visual Localization Challenge (VLC) at IJCAI 2023 to explore how AI can accurately extract camera pose data from 2D images in 3D space. The challenge attracted over 300 participants worldwide, forming 50+ teams. Winning teams achieved high accuracy in pose estimation using images from a car-mounted camera with low frame rates. The VLC dataset is available for research purposes upon request via vlc-dataset@aisingapore.org.
Keywords:
Computer Vision: CV: 3D computer vision
Computer Vision: CV: Applications
Computer Vision: CV: Machine learning for vision
Computer Vision: CV: Motion and tracking
Computer Vision: CV: Recognition (object detection, categorization)
Computer Vision: CV: Scene analysis and understanding   
Computer Vision: CV: Segmentation