Publications

You can also find my articles on my Google Scholar profile.

CVPR 2026

sym

ColaVLA: Leveraging Cognitive Latent Reasoning for Hierarchical Parallel Trajectory Planning in Autonomous Driving
Qihang Peng, Xuesong Chen, Chenye Yang, Shaoshuai Shi, Hongsheng Li

Project | Code

ColaVLA moves VLM reasoning into a compact latent space and decodes multi-scale causal trajectories in one pass.
State-of-the-art in both open-loop and closed-loop settings with favorable efficiency and robustnesson on the nuScenes benchmark.

CVPR 2025

sym

ProxyTransformation : Preshaping Point Cloud Manifold With Proxy Attention For 3D Visual Grounding
Qihang Peng, Henry Zheng, Gao Huang

Project | Code

Make full use of multimodal information in ego-centric 3D visual grounding for point enhancement.
State-of-the-art on EmbodiedScan benchmark.

ICLR 2025

sym

DenseGrounding: Improving Dense Language-Vision Semantics for Ego-centric 3D Visual Grounding
Henry Zheng*, Shi Hao*, Qihang Peng, et al.

ICLR 2025 | AGC 2024

Use LLM and Ground Truth to enhance semantic details in prompt to reduce the ambiguity during training.
Extract individual view semantics and enriches visual representation with global scene-level semantic.