1

MoST: Multi-modality Scene Tokenization for Motion Prediction

Multi-modality scene tokenization for motion prediction.

Scaling Motion Forecasting Models with Ensemble Distillation

Distillation methods for scaling motion forecasting models.

WOMD-LiDAR: Raw Sensor Dataset Benchmark for Motion Forecasting

A raw sensor benchmark for motion forecasting.

MotionLM: Multi-Agent Motion Forecasting as Language Modeling

Multi-agent motion forecasting as language modeling.

Wayformer: Motion Forecasting via Simple and Efficient Attention Networks

Efficient attention architecture for motion forecasting.

Narrowing the Coordinate-Frame Gap in Behavior Prediction Models

Distillation for efficient and accurate scene-centric motion forecasting.

SPoT: Better Frozen Model Adaptation through Soft Prompt Transfer

Soft prompt transfer for better adaptation of frozen models.

The Power of Scale for Parameter-Efficient Prompt Tuning

Prompt tuning improves substantially with model scale.

nmT5: Is Parallel Data Still Relevant for Pre-training Massively Multilingual Language Models?

Parallel data and pre-training dynamics for massively multilingual LMs.

Machine Translation Aided Bilingual Data-to-Text Generation and Semantic Parsing

We present a system for bilingual Data-To-Text Generation and Semantic Parsing. We use a text-to-text generator to learn a single model that works for both languages on each of the tasks. The model is aided by machine translation during both …