| SCAM-P: Spatial Channel Attention Module for Panoptic Driving Perception | 崇瑋 | 2026/3/16 |
| Segment Any Repeated Object | Jeslin | 2026/3/16 |
| One Map to Find Them All: Real-time Open-Vocabulary Mapping for Zero-shot Multi-Object Navigation | 洺緯 | 2026/3/23 |
| Adaptive Thresholding for Sequence-Based Place Recognition | Nhu | 2026/3/23 |
| MiniVLN: Efficient Vision-and-Language Navigation by Progressive Knowledge Distillation | 依庭 | 2026/3/30 |
| Run-time Observation Interventions Make Vision-Language-Action Models More Visually Robust | Teresa | 2026/3/30 |
| SpatialBot: Precise Spatial Understanding with Vision Language Models | Munir | 2026/4/06 |
| Dur360BEV: A Real-world 360-degree Single Camera Dataset and Benchmark for Bird-Eye View Mapping in Autonomous Driving | 桂茹 | 2026/4/06 |