https://arxiv.org/abs/1612.07837
[1612.07837] SampleRNN: An Unconditional End-to-End Neural Audio Generation Model
Abstract page for arXiv paper 1612.07837: SampleRNN: An Unconditional End-to-End Neural Audio Generation Model
audio generationsamplernnunconditional
https://varenyaz.com/stability-ai-launches-smartphone-friendly-audio-generation-model/
Stability AI's Portable Audio Generation Innovation
May 14, 2025 - Unlock your business's potential with VarenyaZ's bespoke AI, ML, and digital solutions. Specializing in industries like Real Estate, Jewellery, Political...
stability aiportable audiogenerationinnovation
https://www.aitoolzdir.com/tool/coqui
Coqui - Audio Generation AI Tool | AI Toolz Dir
Voice Cloning aka text-to-speech through generative AI Coqui is a Audio Generation AI tool listed on AI Toolz Dir.
audio generationai toolcoquitoolzdir
https://forum.generation-n.at/viewforum.php?f=17&sid=e0495b910ac1b04d40f6305618078b4a
Musik / Audio - generation-n.at/forum
audio generationmusikforum
https://ltx-2.run/blog/z-image-open-source-image-generation-benchmark-en/
LTX-2 - Open-Source 4K AI Video & Audio Generation Model
LTX-2 is a production-ready 19B parameter AI model for synchronized 4K video and audio generation. Text-to-video, image-to-video, LoRA fine-tuning support.
open sourceai videoaudio generationltxmodel
https://thinksoundai.com/zh/
ThinkSound AI - Revolutionary Audio Generation with Reasoning
Experience ThinkSound AI, the breakthrough audio generation model that thinks before it speaks. Generate high-quality audio with advanced reasoning...
audio generationthinksoundairevolutionaryreasoning
https://gptrealtime2.ai/
GPT Realtime 2 | AI Audio Generation & Text-to-Speech API
GPT Realtime 2 delivers instant, natural-sounding AI audio generation. Try GPT-Realtime-2 text-to-speech in your browser — no registration required....
text to speechgpt realtimeai audiogenerationapi
https://thinksoundai.com/ru/
ThinkSound AI - Revolutionary Audio Generation with Reasoning
Experience ThinkSound AI, the breakthrough audio generation model that thinks before it speaks. Generate high-quality audio with advanced reasoning...
audio generationthinksoundairevolutionaryreasoning
https://forum.generation-n.at/viewforum.php?f=17&sid=8a217f42ef86ab50ffbae931c030a8cc
Musik / Audio - generation-n.at/forum
audio generationmusikforum
https://open-launch.com/projects/seedance-art-cinematic-video-audio-generation
Seedance Art: Cinematic Video & Audio Generation | Open-Launch
Seedance Art is a professional AI video and audio generation platform powered by the Seedance core models from ByteDance. We empower creators, storytellers,...
seedance artcinematic videoaudio generationopenlaunch
https://aidreamhub.com/category/audio-generation
Best Audio Generation AI Tools 2026 | AI Dreamhub
AI tools for creating music and audio
audio generationai toolsbest
https://www.aitoolzdir.com/toolz/audio-generation/clone
Audio Generation - AI Toolz Dir
A list of Audio Generation AI Tools.
audio generationai toolzdir
https://forum.generation-n.at/viewforum.php?f=17&sid=057ee29c68a33971ebda9e8b8dd5b6ac
Musik / Audio - generation-n.at/forum
audio generationmusikforum
https://best100apps.com/ai-apps-for-audio-generation/
AI Apps for Audio Generation - Best 100 Apps
Dec 23, 2025 - - AI Apps for Audio Generation
ai appsfor audiogenerationbest
https://ulazai.com/fi/kling26/landing/
KLING 2.6 - AI Video with Simultaneous Audio Generation | UlazAI
Generate videos with speech, sound effects, and ambient sounds in one step using KLING 2.6. The world's first AI model with simultaneous audio-visual...
ai videoaudio generationklingsimultaneousulazai
https://vidofy.ai/en/models/ovi
Ovi AI Video Generator: Synchronized Audio-Video Generation from Character AI
Ovi AI by Character AI generates 5-10 second videos with native synchronized audio. Twin DiT architecture with physics-accurate motion, lip-sync, and...
ai video generatorsynchronized audioovigenerationcharacter
https://awesomeskill.ai/tag/audio-generation
audio-generation - Claude Skills - Awesome Skills - Agent Skills Marketplace for Claude, Codex &...
Browse skills tagged with audio-generation
audio generationclaude skillsagent marketplaceawesomecodex
https://laptopmindset.com/tag/native-audio-generation/
native audio generation - Laptop Mindset
audio generationnativelaptopmindset
https://www.aibase.com/tool/34123
SoundStorm-Efficient Parallel Audio Generation Technology
SoundStorm is an audio generation technology developed by Google Research that significantly reduces the time needed for audio synthesis by generating audio tok
audio generationsoundstormefficientparalleltechnology
https://ypforai.com/category/music-audio-generation
Popluar Music & Audio Generation in 2026
music audiopopluargeneration
https://ltx-2.run/
LTX-2 - Open-Source 4K AI Video & Audio Generation Model
LTX-2 is a production-ready 19B parameter AI model for synchronized 4K video and audio generation. Text-to-video, image-to-video, LoRA fine-tuning support.
open sourceai videoaudio generationltxmodel
https://forum.generation-n.at/viewforum.php?f=17&sid=ae99bba6bc8b5374fc43eed73dddf057
Musik / Audio - generation-n.at/forum
audio generationmusikforum
https://www.workingnomads.com/jobs/ai-research-lead-video-audio-generation-canva-1014967
AI Research Lead - Video & Audio Generation at Canva | Working Nomads
ai researchvideo audioleadgenerationcanva
https://www.yemenculturalnews.com/article/871977017-chamelo-introduces-next-generation-fashion-frames-blending-smart-tint-premium-audio-and-modern-style
Chamelo Introduces Next-Generation Fashion Frames Blending Smart Tint, Premium Audio, and Modern...
https://www.hajim.rochester.edu/ece/news-events/events/ms-phd_defenses/2025-08-26_yan_defense.html
Structured Analysis and Generation in Music, Audio, and Beyond : News & Events : Department of...
Music is fundamentally organized sound, yet its hierarchical, temporal, and relational structure remains challenging for data-driven models.
in music
https://en.xtones.net/walrusaudio-meraki-stereoanalog-dualdelay/
[A must-see for analog fans] What is the next-generation analog delay with WALRUS AUDIO MERAKI 8...
May 2, 2025 - The WALRUS AUDIO MERAKI Stereo Analog Dual Delay maintains the musical appeal of analog delay while also incorporating modern stereo functionality and MIDI...
https://aiartweekly.com/tools/tangoflux-super-fast-and-faithful-text-to-audio-generation-with-flow-matching-and-clap-ranked-preference-optimization
TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked...
TangoFlux can generate 30 seconds of 44.1kHz audio in just 3.7 seconds on a single A40 GPU.
text to audio generation
https://www.dripo.ai/zh/model/wan25
Wan 2.5 AI Video Generator - Native Audio-Visual Generation | Dripo AI
Generate stunning videos with Wan 2.5, the first AI model to natively integrate professional audio generation. Create complete audiovisual experiences with...
ai video generatoraudio visualwan
https://arxiv.org/abs/2405.14598
[2405.14598] Visual Echoes: A Simple Unified Transformer for Audio-Visual Generation
Abstract page for arXiv paper 2405.14598: Visual Echoes: A Simple Unified Transformer for Audio-Visual Generation
visual echoesfor audio
https://mcpserver.space/category/audio-video-generation/
Audio & Video Generation - MCP Server Space
Services for generating and processing audio and video content, such as text-to-speech, music composition, and video synthesis.
audio videomcp servergenerationspace
https://viw.ai/veo3
Veo 3 - Cinematic AI Video Generation with Audio | Viw AI
Unlock next-gen AI video creation with Viw AI using Veo 3. Generate stunning, cinematic videos with native audio and realistic physics, effortlessly.
cinematic ai videowith audioveogenerationviw
https://www.dreamega.ai/models/kling-2-6-pro
Unlimited Kling 2.6 Video Generator - Native Audio-Visual Generation | Dreamega
Create stunning AI videos with Kling 2.6 Pro featuring native audio-visual generation, 1080p resolution, perfect lip-sync, and multi-language voice support....
video generatoraudio visualunlimitedkling
https://arxiv.org/abs/2510.02110
[2510.02110] SoundReactor: Frame-level Online Video-to-Audio Generation
Abstract page for arXiv paper 2510.02110: SoundReactor: Frame-level Online Video-to-Audio Generation
video to audioframelevelonlinegeneration
https://ai.sony/publications/savgbench-benchmarking-spatially-aligned-audio-video-generation?hsLang=en
SAVGBench: Benchmarking Spatially Aligned Audio-Video Generation - Sony AI
This work addresses the lack of multimodal generative models capable of producing high-quality videos with spatially aligned audio. While recent advancements...
audio videobenchmarkingalignedgenerationsony
https://aiparabellum.com/category/audio-generation/
AI Tools for Audio Generation
Discover the Best AI Tools for Audio Generation to create high-quality, realistic audio, music, and voiceovers effortlessly.
ai toolsfor audiogeneration
https://diglib.eg.org/items/6c75b755-dbcf-498f-aab3-239f918b33fa
Conversational Gesture Model (CGM): Extending Speaker-Centric Audio-Driven Motion Generation to...
In this work we extend speaker-centric audio-driven gesture synthesis toward a unified conversational model that jointly captures both speaking and listening...
https://wan2.video/OmniAvatar
OmniAvatar: Efficient Audio-Driven Avatar Video Generation
OmniAvatar is an innovative model for generating realistic, controllable avatar videos driven by audio, featuring adaptive body animation and fine-grained...
avatar videoomniavatarefficientaudiodriven