🔄 In a Training Loop

ouasdg

20 17 56

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Causal-rCM: A Unified Teacher-Forcing and Self-Forcing Open Recipe for Autoregressive Diffusion Distillation in Streaming Video Generation and Interactive World Models

liked a Space 8 days ago

stabilityai/stable-audio-3

liked a model 9 days ago

GD-ML/DreamX-World-5B

View all activity

Organizations

upvoted a paper 3 days ago

Causal-rCM: A Unified Teacher-Forcing and Self-Forcing Open Recipe for Autoregressive Diffusion Distillation in Streaming Video Generation and Interactive World Models

Paper • 2606.25473 • Published 9 days ago • 24

liked a Space 8 days ago

Stable Audio 3

🎵

117

Text-to-audio with SA3 Medium / Small Music / Small SFX.

liked 2 models 9 days ago

GD-ML/DreamX-World-5B

Image-to-Video • 5B • Updated 16 days ago • 462 • 35

krea/Krea-2-Raw

Text-to-Image • Updated 9 days ago • 46.3k • 269

upvoted a paper 19 days ago

Latent Spatial Memory for Video World Models

Paper • 2606.09828 • Published 25 days ago • 71

liked a model 20 days ago

google/diffusiongemma-26B-A4B-it

Image-Text-to-Text • 26B • Updated about 3 hours ago • 1.42M • 1.09k

liked a model 24 days ago

nvidia/Cosmos3-Super-Text2Image

Text-to-Image • 65B • Updated 17 days ago • 65.8k • 153

updated a model 27 days ago

ouasdg/dirt

Updated 27 days ago

updated a Space 27 days ago

nanoTTS

💻

text-to-speech demo

published a Space 27 days ago

nanoTTS

💻

text-to-speech demo

liked a model about 1 month ago

nvidia/PiD

Image-to-Image • Updated about 1 month ago • 594 • 354

upvoted 3 papers about 2 months ago

SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer

Paper • 2605.15178 • Published May 14 • 91

CausalCine: Real-Time Autoregressive Generation for Multi-Shot Video Narratives

Paper • 2605.12496 • Published May 12 • 30

TextLDM: Language Modeling with Continuous Latent Diffusion

Paper • 2605.07748 • Published May 8 • 26

liked a model about 2 months ago

ResembleAI/Dramabox

Text-to-Speech • Updated May 13 • 235 • 296

liked a dataset 2 months ago

k2-fsa/OpenDialog

Viewer • Updated Apr 18 • 996k • 762 • 22

liked a Space 2 months ago

DialogueSidon Demo

🔥

Separate two speakers from an audio or video recording

updated a Space 3 months ago

Dirt TTS

💻

text-to-speech demo

liked a model 3 months ago

Skywork/Matrix-Game-3.0

Image-Text-to-Video • Updated Apr 28 • 221 • 124

liked a dataset 3 months ago

IVLLab/MultiDialog

Updated Aug 29, 2024 • 598 • 30

ouasdg

AI & ML interests

Recent Activity

Organizations

ouasdg's activity

Stable Audio 3

nanoTTS

nanoTTS

DialogueSidon Demo

Dirt TTS