BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data Paper โข 2402.08093 โข Published Feb 12, 2024 โข 61
Running on Zero Agents Featured 73 Draw To Search Art ๐ 73 Draw/upload image and search among WikiART using SigLIP
stabilityai/stable-video-diffusion-img2vid-xt Image-to-Video โข Updated Jul 10, 2024 โข 342k โข 3.29k
Running on CPU Upgrade Agents 667 Moe TTS ๐ 667 Generate and convert voice using text and audio inputs
Runtime error Agents Featured 5.07k MusicGen ๐ต 5.07k Generate music from text descriptions and optional melodies