LDIP: Real-time on-road object detection with depth estimation from a single image | 明躍 | 2025/3/26 |
Bridging Language, Vision and Action: Multimodal VAEs in Robotic Manipulation Tasks | 淳傑 | 2025/3/26 |
StratXplore: Strategic Novelty-seeking and Instruction-aligned Exploration for Vision and Language Navigation | 婷婷 | 2025/4/02 |
3D Object Visibility Prediction in Autonomous Driving | 文文 | 2025/4/02 |
DVT: Decoupled Dual-Branch View Transformation for Monocular Bird’s Eye View Semantic Segmentation | 士涵 | 2025/4/09 |
VANP: Learning Where to See for Navigation with Self-Supervised Vision-Action Pre-Training | 崇瑋 | 2025/4/09 |
Temporal Attention for Cross-View Sequential Image Localization | 柏勳 | 2025/4/16 |
Simultaneous Super-resolution and Depth Estimation for Satellite Images Based on Diffusion Model | 洺緯 | 2025/4/16 |
BEV-CV: Birds-Eye-View Transform for Cross-View Geo-Localisation | 欣玲 | 2025/4/23 |
OV-MAP : Open-Vocabulary Zero-Shot 3D Instance Segmentation Map for Robots | 依庭 | 2025/4/23 |
Semantics from Space: Satellite-Guided Thermal Semantic Segmentation Annotation for Aerial Field Robots | Munir | 2025/4/30 |
Fast and Communication-Efficient Multi-UAV Exploration Via Voronoi Partition on Dynamic Topological Graph
| Teresa | 2025/4/30 |