AI & ML interests

multi-modal foundation models

Recent Activity

oliveryanzuolu  updated a collection about 1 hour ago
RAVEN
oliveryanzuolu  updated a model about 2 hours ago
mvp-lab/RAVEN
View all activity

oliveryanzuolu 
posted an update about 1 hour ago
view post
Post
3
Excited to share RAVEN, my first PhD project. Paper, code, and models are all released.

RAVEN is for real-time autoregressive video generation. Instead of simply appending future chunks, we train the model to better remember and use its own generated history, leading to more realistic and natural long-horizon videos.

Technically, RAVEN repacks self-rollouts into interleaved clean historical endpoints and noisy denoising states, aligning training-time attention with inference-time extrapolation.

We also introduce CM-GRPO: by reformulating consistency-model sampling as a conditional Gaussian transition kernel, online RL can directly optimize the sampler transition used at inference.

Project Page: https://yanzuo.lu/raven
Paper: https://arxiv.org/abs/2605.15190
Code: https://github.com/mvp-ai-lab/RAVEN
Model: mvp-lab/RAVEN