Publications

You can also find my articles on my Google Scholar profile.
arxiv 2025
sym

ColaVLA: Leveraging Cognitive Latent Reasoning for Hierarchical Parallel Trajectory Planning in Autonomous Driving
Qihang Peng, Xuesong Chen, Chenye Yang, Shaoshuai Shi, Hongsheng Li

Project | Code

  • ColaVLA moves VLM reasoning into a compact latent space and decodes multi-scale causal trajectories in one pass.
  • State-of-the-art in both open-loop and closed-loop settings with favorable efficiency and robustnesson on the nuScenes benchmark.
CVPR 2025
sym

ProxyTransformation : Preshaping Point Cloud Manifold With Proxy Attention For 3D Visual Grounding
Qihang Peng, Henry Zheng, Gao Huang

Project | Code

  • Make full use of multimodal information in ego-centric 3D visual grounding for point enhancement.
  • State-of-the-art on EmbodiedScan benchmark.
ICLR 2025
sym

DenseGrounding: Improving Dense Language-Vision Semantics for Ego-centric 3D Visual Grounding
Henry Zheng*, Shi Hao*, Qihang Peng, et al.

ICLR 2025 | AGC 2024

  • Use LLM and Ground Truth to enhance semantic details in prompt to reduce the ambiguity during training.
  • Extract individual view semantics and enriches visual representation with global scene-level semantic.