MoST: Multi-modality Scene Tokenization for Motion Prediction
Norman Mu, Jingwei Ji, Zhenpei Yang, Rami Al-Rfou, Dragomir Anguelov, Yin Zhou
June 2024Abstract
MoST introduces scene tokenization for multimodal motion prediction, improving representation quality and prediction performance.
Publication
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

Member of Technical Staff - TLM
My research interests include language modeling, embodied AI, motion forecasting, and multilingual modeling.