Toggle Poster Visibility
Oral
Fri Apr 24 06:30 AM -- 06:40 AM (PDT) None
Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation
[
OpenReview]
Oral
Fri Apr 24 06:42 AM -- 06:52 AM (PDT) None
SANA-Video: Efficient Video Generation with Block Linear Diffusion Transformer
[
OpenReview]
Oral
Fri Apr 24 06:54 AM -- 07:04 AM (PDT) None
Partition Generative Modeling: Masked Modeling Without Masks
[
OpenReview]
Oral
Fri Apr 24 07:06 AM -- 07:16 AM (PDT) None
NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale
[
OpenReview]
Oral
Fri Apr 24 07:18 AM -- 07:28 AM (PDT) None
TTSDS2: Resources and Benchmark for Evaluating Human-Quality Text to Speech Systems
[
OpenReview]
Oral
Fri Apr 24 07:30 AM -- 07:40 AM (PDT) None
VibeVoice: Expressive Podcast Generation with Next-Token Diffusion
[
OpenReview]
Oral
Fri Apr 24 07:42 AM -- 07:52 AM (PDT) None
UALM: Unified Audio Language Model for Understanding, Generation and Reasoning
[
OpenReview]
Successful Page Load