https://arxiv.org/abs/2411.04996
[2411.04996] Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation...
Abstract page for arXiv paper 2411.04996: Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models
multi modalmixturetransformerssparsescalable
https://arxiv.org/abs/2303.06555
[2303.06555] One Transformer Fits All Distributions in Multi-Modal Diffusion at Scale
Abstract page for arXiv paper 2303.06555: One Transformer Fits All Distributions in Multi-Modal Diffusion at Scale
multi modalonetransformerfitsdistributions
https://dokeyai.com/item/seedvideo-io
SeedVideo: Seedance 3.0 AI Video Generator - Multi-Modal Cinematic Videos - DokeyAI
ai video generatormulti modalcinematic videosseedancedokeyai
https://seedancevideo.app/tools/seedance-2-prompt-guide
Seedance 2.0 Prompt Guide — Multi-Modal & Multi-Shot Prompting | Seedance
Master Seedance 2.0 prompts for multi-shot, multi-modal, and audio-inclusive video generation.
prompt guidemulti modalseedanceshotprompting
https://v12marketing.com/marketing/multi-modal-marketing-why-text-only-strategies-are-becoming-obsolete/
Multi-Modal Marketing: Why Text-Only Content Fails in 2026
May 7, 2026 - Discover why multi-modal marketing using video, audio, and visuals is replacing text-only strategies for stronger SEO and engagement.
multi modalmarketingtextcontentfails
https://seedance2.ai/generate
Seedance 2.0 Multi-Modal AI Video Generator | Seedance 2
The revolutionary AI video model that understands your creative vision. Use images, videos, audio, and text as inputs to create stunning videos with...
multi modal aiseedancevideogenerator
https://docs.vllm.ai/en/latest/design/mm_processing/
Multi-Modal Data Processing - vLLM
multi modaldata processingvllm
https://www.raumedic.com/application-areas/neuromonitoring
Multi-Modal Neuromonitoring Equipment – RAUMEDIC
Measuring Catheters and Equipment from RAUMEDIC to measure parameters such as pressure, temperature or oxygen partial pressure in tissue.
multi modalneuromonitoringequipmentraumedic
https://www.pattern-skin.at/
PATTERN-Skin – Modular Multi-Modal Proximity and Tactile Perception Skin
multi modalpatternskinmodularproximity
https://aiwith.me/tools/aiveo4-ai/
Veo 4: Veo 4 is a multi-modal AI video generator that creates cinematic content with native audio....
May 7, 2026 - Veo 4: Veo 4 is an advanced AI video creation platform designed to solve the problem of fragmented and inconsistent video generation. Unlike basic...
multi modal aivideo generatorveocreatescinematic
https://creati.ai/ai-tools/veo-4-ai/
Veo 4 Multi-Modal AI Video Creation with Native Audio | Creati.ai
Create cinematic videos from text, images, video, and audio with Veo 4’s multi-modal control, native audio, consistency, and seamless editing.
multi modal aivideo creationnative audioveo
https://www.meazurelearning.com/resources/in-person-remote-or-multi-modal-test-delivery-which-one-is-right-for-you
In-Person, Remote, or Multi-Modal Test Delivery—Which One Is Right for You? | Meazure Learning
May 30, 2023 - Learn how to evaluate test delivery modalities and choose the assessment solution that's right for your exam program!
multi modalpersonremotetestone
https://www.tooluck.org/tag/multi-modal-ai
Multi-modal AI - Tooluck
Platform supporting multiple modalities (text, image, audio)
multi modal aitooluck
https://arxiv.org/abs/2509.08519
[2509.08519] HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning
Abstract page for arXiv paper 2509.08519: HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning
human centricvideo generationmulti modalhumovia
https://aitoolex.com/tag/multi-modal-ai-platform
Multi-modal AI platform - AIToolex
AIToolex is the best AI tools directory to explore cutting-edge AI tools. Quickly find the perfect AI tools to supercharge your productivity and efficiency.
multi modal aiplatformaitoolex
https://xenoss.io/cases/unified-multi-modal-neural-network-for-improving-credit-scoring-accuracy
Multi-Modal Neural Network for Credit Scoring
Bank achieved a 2.6-point Gini uplift using a unified neural network architecture. How multi-modal AI improved loan default predictions.
multi modalneural networkcreditscoring
https://platform.kimi.ai/docs/pricing/chat-k25
Multi-modal Model Kimi K2.5 Pricing - Kimi API Platform
Kimi K2.6 Open Platform, providing trillion-parameter K2.5 large language model API, supporting 256K long context and Tool Calling. Professional code...
multi modalmodelkimipricingapi
https://auralumeai.com/model/kling-o1
Kling O1 - AI Video Editor & Multi-Modal Generator | Auralume AI
Create and edit videos with Kling O1, the first AI model for video-to-video editing. Generate up to 2-minute videos with text prompts, multi-modal inputs, and...
ai video editormulti modalklinggeneratorauralume
https://nav4ai.com/tool/grok-imagine
Grok Imagine: AI image and video generator with multi-modal inputs and unfiltere
AI image and video generator with multi-modal inputs and unfiltered creative freedom.
grok imagine aivideo generatormulti modalimageinputs
https://wan2-7ai.com/blog/wan-2-7-vs-seedance-2-0-ai-video-showdown
The Clash of Multi-Modal Titans: Wan 2.7 vs Seedance 2.0 in the New AI Video Era | Blog
Discover the ultimate showdown: Wan 2.7 vs Seedance 2.0. Explore how the Wan 2.7 ai video generator and Seedance2.0 Image and Video maker redefine AI video...
multi modalvs seedancenew aiclashtitans
https://selfreason.com/
SelfReason: Multi-modal Offline AI
multi modalofflineai
https://www.makesong.com/seedance-2
Seedance 2.0 - Multi-Modal AI Video Generator | MakeSong
Create cinematic AI videos from text, images, or reference materials. Supports 9 reference images, 3 videos, 3 audio clips with native audio-visual...
multi modal aivideo generatorseedance
https://seadanceai.app/seedance-2
Seedance 2.0 – Multi-Modal AI Video Generator | Seadance AI
Seedance 2.0 is ByteDance's next-generation AI video model. Combine images, videos, audio and text to generate cinematic videos with multi-shot storytelling,...
multi modal aivideo generatorseedanceseadance
https://seedancevideo.app/tools/seedance-2-image-to-video
Seedance 2.0 Image-to-Video — Multi-Modal Animation with Audio | Seedance
Seedance 2.0 image-to-video accepts up to 9 images as input and generates multi-shot video with native audio output.
multi modalseedanceimagevideoanimation
https://www.aimla.org/raising-the-standard-of-veterinary-care-a-multi-modal-approach-to-managing-pain-on-demand
Raising the Standard of Veterinary Care: A Multi-Modal Approach to Managing Pain - Las Vegas, NV -...
Our mission is to provide opportunities to raise the level of knowledge in the numerous disciplines of medical lasers
veterinary caremulti modallas vegasraisingstandard
https://bioconnect.com/
BioConnect | Multi-Modal Authentication Platform
Protect your data and assets with BioConnect's multi-modal authentication platform, ensuring seamless integration and compliance.
multi modalauthenticationplatform
https://www.inboundlogistics.com/web-cite-city/multi-modal/
Multi-Modal Archives - Inbound Logistics
multi modalarchives inboundlogistics
https://www.seedance2pro.com/
Seedance 2.0 – Multi-Modal AI Video Generator with Audio
Seedance 2.0 is a next-generation multimodal AI video generator that transforms text, images, and audio into cinematic video content for professional creators.
multi modal aivideo generatorseedanceaudio
https://seedancevideo.app/tools/seedance-2-text-to-video
Seedance 2.0 Text-to-Video — Multi-Modal Video with Stereo Audio | Seedance
Seedance 2.0 text-to-video generates multi-shot video with native audio output up to 15 seconds from text prompts.
multi modalseedancetextvideostereo
https://www.jxp.com/seedream/seedream-4
Seedream 4.0 AI Image Generator - Multi-Modal Creation | JXP
Seedream 4.0 by ByteDance: Generate stunning 2K-4K images in 1.8s. Multi-reference processing, batch generation, and natural language editing.
ai image generatormulti modalseedreamcreationjxp
https://huggingface.co/papers/2604.14268
Paper page - HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating...
Join the discussion on this paper page
multi modalpaperhyworldmodel
https://multimodalmoving.com/
Atlantic Multi-Modal Moving
multi modalatlanticmoving
https://platform.kimi.ai/docs/pricing/chat-k26
Multi-modal Model Kimi K2.6 Pricing - Kimi API Platform
Kimi K2.6 Open Platform, providing trillion-parameter K2.5 large language model API, supporting 256K long context and Tool Calling. Professional code...
multi modalmodelkimipricingapi
https://mindgard.ai/blog/openai-sora-system-prompts
Uncovering System Prompts Driving Multi-Modal LLMs - Mindgard
Dec 9, 2025 - We show how we revealed Sora 2's system prompt by experimenting across multiple modalities, including text-to-image, ASCII and glyph renderings, video, audio...
multi modaluncoveringsystempromptsdriving
https://arxiv.org/abs/2405.17842
[2405.17842] MMDisCo: Multi-Modal Discriminator-Guided Cooperative Diffusion for Joint Audio and...
Abstract page for arXiv paper 2405.17842: MMDisCo: Multi-Modal Discriminator-Guided Cooperative Diffusion for Joint Audio and Video Generation
multi modalguidedcooperativediffusionjoint
https://nanobananaimg.com/studio/reference-to-video
Reference to Video | Multi-Modal AI Video Generator | Nano Banana
Generate AI videos from image, video, and audio references. Upload photos for character consistency, videos for motion guidance, and audio for synchronized...
multi modal aigenerator nanoreferencevideobanana
https://www.gtec.at/product/g-trigbox-multimodal-trigger-box/
g.TRIGBOX Multi-Modal Trigger Generation | g.tec medical engineering
The g.TRIGbox is a device that generates trigger pulses from various sensors or input signals.
multi modaltec medicaltriggergenerationengineering
https://bloxmove.com/multi-modal-mobility
SaaS for multi-modal mobility | bloXmove
Dec 5, 2022 - bloXmove provides a decentralized software-as-a-service to operate the business transactions between mobility and energy companies.
multi modalsaasmobility
https://www.keolisna.com/
Multi-Modal Solutions. Imagine ways from here to better. Keolis NA
Keolis is creating the mobility solutions of the future and has been doing so for over 100 years. Learn how we help communities transportation needs now!
multi modalsolutionsimaginewaysbetter
https://ams-osram.com/de/innovation/technology/optical-force-sensing
Optical force sensing - Multi-modal control | ams OSRAM
Innovative Schalttechnologien revolutionieren die Steuerung elektrischer Geräte – sie machen sie sensibler, vielseitiger, zuverlässiger, hygienischer,...
multi modalopticalforcesensingcontrol
https://www.seedancepro.net/seedance/seedance-2-0
Seedance 2.0 — Multi-Modal AI Video Creation
Create cinematic AI videos using text, images, video references, and audio direction with Seedance 2.0. Direct scenes. Generate stories.
multi modal aiseedancevideocreation
https://preflmr.github.io/
PreFLMR: Scaling Up Fine-Grained Late-Interaction Multi-modal Retrievers
` - General Purpose Multimodal Knowledge Retriever
fine grainedmulti modalscalinglateinteraction
https://registry.opendata.aws/open-robo-care/
OpenRoboCare Multi-Modal Expert Demonstration Dataset for Robot-Assisted Caregiving - Registry of...
multi modalrobot assistedexpertdemonstrationdataset
https://www.hunyuanvideo.org/en/hunyuan-custom-ai
Hunyuan Custom AI Video Generator Free Online: Multi-modal Creation
Use our free online Hunyuan Custom AI video generator. Create videos with consistent subjects and characters from text, images, audio, or video inputs....
ai video generatorfree onlinemulti modalhunyuancustom
https://aiseedance.pro/
Seedance 2.0 — Multi-Modal AI Video Generator by ByteDance
Seedance 2.0 creates cinematic AI videos with multi-modal input, native audio in 8 languages, and 2K export. Free Seedance AI video generator.
multi modal aivideo generatorseedancebytedance
https://arxiv.org/abs/2007.03634
[2007.03634] PinnerSage: Multi-Modal User Embedding Framework for Recommendations at Pinterest
Abstract page for arXiv paper 2007.03634: PinnerSage: Multi-Modal User Embedding Framework for Recommendations at Pinterest
multi modaluserembeddingframeworkrecommendations
https://sigport.org/documents/appendix-multimae-meets-earth-observation-pre-training-multi-modal-multi-task-masked-0
(Appendix) MultiMAE Meets Earth Observation: Pre-training Multi-modal Multi-task Masked...
earth observationpre trainingmulti modalappendixmeets
https://showmebest.ai/ai-tools/geminigen-ai
GeminiGen AI: Multi-Modal Content Creation Platform | ShowMeBest.ai
Transform ideas into AI-generated images, videos, and speech with cutting-edge technology and unlimited creativity.
content creation platformai multimodalshowmebest
https://ieeexplore.ieee.org/document/11007678/;jsessionid=BAC2A2967BD237BD6D46C078CB4375E8
Otter: A Multi-Modal Model With In-Context Instruction Tuning | IEEE Journals & Magazine | IEEE...
Recent advances in Large Multimodal Models (LMMs) have unveiled great potential as visual assistants. However, most existing works focus on responding to indivi
multi modalieee journalsottermodelcontext
https://conglobal.com/company/
ConGlobal: Experts in Multi-Modal, Industrial, Terminal Operations
Aug 8, 2025 - Multi-modal service, the largest depot terminal network, industrial operations expertise, and technology to unlock value, increase operational efficiency, and...
multi modalexpertsindustrialterminaloperations
https://www.qualitymag.com/articles/99559-scaling-battery-production-the-growing-importance-of-quality-assurance-and-multi-modal-inspection
Scaling Battery Production: The Growing Importance of Quality Assurance and Multi-Modal Inspection...
May 1, 2026 - The global battery market is entering a phase of rapid industrial scaling, but growth alone does not guarantee efficiency.
battery productionquality assurancemulti modalscalinggrowing
https://www.routekingmn.com/index.html
Best Cargo Transportation Company in USA - Multi-modal Logistics Service Provider
Route King specialises in transporting all types of cargo within USA using land and railways-based modes of transport. Connect for quotes
cargo transportationmulti modallogistics servicebestcompany
https://dokeyai.com/item/seedance-2-ai
Seedance 2 AI: Best Multi-Modal AI Video Generator - DokeyAI
ai bestmulti modalvideo generatorseedancedokeyai
https://laion.ai/blog/laion-5b/
LAION-5B: A NEW ERA OF OPEN LARGE-SCALE MULTI-MODAL DATASETS | LAION
new eralarge scalemulti modallaionopen
https://www.flychicago.com/ohare/tofrom/MultimodalFacility/pages/default.aspx
Multi-Modal Facility | Chicago O’Hare International Airport (ORD)
The O'Hare Multi Modal Facility (MMF) consolidates the airport's rental car operations and public parking into one facility, while serving as a new gateway to...
multi modalinternational airportfacilitychicagoord
https://ams-osram.com/innovation/technology/optical-force-sensing
Optical force sensing - Multi-modal control | ams OSRAM
Innovative switching technologies are transforming controls on electric devices to be sensitive, multimodal, reliable, hygienic, stylish, and cost-effective.
multi modalopticalforcesensingcontrol
https://2020.stateofthemap.org/sessions/373NDC/
How to publish a multi-modal journey app based on OSM with Trufi App
The global OpenStreetMap conference. July 4 - 5 July, 2020 online (was planed to take place in Cape Town, South Africa).
multi modalapp basedpublishjourneyosm
https://www.graphcore.ai/posts/graphcore-and-aleph-alpha-partner-on-large-multi-modal-ai-models
Graphcore and Aleph Alpha partner on large, multi-modal AI models
May 13, 2023 - Graphcore and Aleph Alpha will work together on research and deployment of Aleph Alpha’s advanced multi-modal models on current IPU systems and the...
multi modal aialeph alphagraphcorepartnerlarge
https://grably.us/
High-quality multi-modal human interaction and conversational datasets | Grably
Grably is a multi-modal human interaction data research company trusted by leading AI labs and big-tech companies.
high qualitymulti modalhuman interactionconversationaldatasets
https://www.webai.com/solutions/multi-modal-kg-rag
Multi-modal KG RAG | webAI
Multi-modal KG RAG retrieves accurate answers from manuals, CAD files, charts, and PDFs with source-linked citations, private deployment, and full data...
multi modalkgragwebai
https://vercel.com/templates/next.js/multi-modal-chatbot
Multi-Modal Chatbot - Vercel
A multi-modal chatbot application using the Vercel AI SDK.
multi modalchatbotvercel
https://arxiv.org/abs/2311.17049
[2311.17049] MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training
Abstract page for arXiv paper 2311.17049: MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training
fast imagemulti modaltextmodelsreinforced
https://ieeexplore.ieee.org/document/9908054/
Ultra-Broadband Optical Wavelength-Conversion using Nonlinear Multi-Modal Optical Waveguides | IEEE...
Ultra-Broadband Wavelength Conversion is one of the key issues of future high-capacity, flexible optical networks. Using optimized Multi-Modal Optical Waveguide
multi modalultrabroadbandopticalwavelength
https://www.msop-765k.org/
mSOP-765k: A Benchmark For Multi-Modal Structured Output Predictions | mSOP-765k
multi modalbenchmarkstructuredoutputpredictions
https://tagntrac.ai/solutions/multi-modal-visibility/
Multi-Modal Tracking | Tag-N-Trac Visibility Solutions
Dec 15, 2025 - Track shipments across ocean, land, and air with Tag-N-Trac's multi-modal visibility solution. Real-time location data and predictive analytics for global...
multi modaltrackingtagvisibilitysolutions
https://www.dreamega.ai/models/pixverse-v5
Advanced PixVerse v5 AI Video Generator - Multi-Modal Text & Image to Video Model with Superior...
Experience PixVerse v5's revolutionary AI video generation. Advanced multi-modal model creates stunning videos from text and images with 1080p output, multiple...
ai video generatormulti modaltext imageadvancedpixverse
https://ai-sdk.dev/cookbook/guides/multi-modal-chatbot
Guides: Multi-Modal Agent
Learn how to build a multi-modal agent that can process images and PDFs with the AI SDK.
multi modalguidesagent
https://arxiv.org/abs/2311.12793
[2311.12793] ShareGPT4V: Improving Large Multi-Modal Models with Better Captions
Abstract page for arXiv paper 2311.12793: ShareGPT4V: Improving Large Multi-Modal Models with Better Captions
multi modalimprovinglargemodelsbetter
https://getwpteam.com/multi-modal-member-details-with-nav/
Multi-Modal Member Details with Nav - Smart Team
Oct 14, 2022 - Multi-Modal Member Details with Navigation Watch More Videos Documentations →
multi modalmember detailsnavsmartteam
https://apoco.com/projects/multi-modal-interaction
Multi-Modal Interaction | Apoco
Software platform for building spatial applications with natural human interaction through voice, gestures, touch, and computer vision, transforming physical...
multi modalinteraction
https://www.etowertech.com/service-news/etowerone-new-functions-in-2024-multi-modal-transport.html
eTowerOne's New Functions in 2024: Multi-modal Transport - WallTech (China) Co.,Ltd.
Multi-modal transport is an important mode of transportation in the modern logistics and transportation industry. Its definition, advantages, and application...
walltech china conew functionsmulti modaltransportltd
https://gen.new/
Gen.new – Multi-Modal AI Studio
Craft words, songs, images, and videos in a single collaborative AI workspace that blends text, music, and visual generation tools.
multi modal aigennewstudio
https://ai4di.eu/aboutx-2/144-development-of-the-ai-based-fleet-management-for-supporting-multi-modal-transport-maas-ai-based-fleet-optimisation-tool.html
AI4DI - Development of the AI-based fleet management for supporting multi-modal transport– MaaS -...
ai basedfleet managementmulti modaldevelopmentsupporting
https://cdg.ifdemo.com/
ComfortDelGro – Leading multi-modal transport operator
leading multimodaltransportoperator
https://www.lukew.com/ff/entry.asp?2035
LukeW | Multi-Modal Personal Assistants: Early Explorations
LukeW Ideation + Design provides resources for mobile and Web product design and strategy including presentations, workshops, articles, books and more on...
multi modalpersonal assistantslukewearlyexplorations
https://www.silverpeakib.com/silverpeak-advises-multi-modal-urban-transport-platform-trafi-on-series-b-financing/
Silverpeak advises multi-modal urban transport platform Trafi on Series B financing - Silverpeak
Mar 21, 2026 - Silverpeak advises Trafi on Series B financing led by Sumitomo Corporation. Exclusive mobility tech growth financing advisory for a leading European transport...
silverpeak advisesmulti modalurban transportplatformtrafi
https://payware.eu/en/innovation/payment-methods/qr-code-payments
QR Code Payments | Multi-Modal Payment Initiation | payware
May 22, 2026 - Instant QR code payments with flat % fees. Scan-to-pay in-store and online. No hardware needed. Part of 7 payment initiation methods. Instant settlement.
qr codemulti modalpaymentsinitiationpayware
https://immunarch.com/
Multi-Modal Immune Repertoire Analytics for Immunotherapy and Vaccine Design in R • immunarch
A comprehensive analytics framework for building reproducible pipelines on T-cell and B-cell immune receptor repertoire data. Delivers multi-modal immune...
multi modalimmunerepertoireanalyticsimmunotherapy
https://lean-lang.org/use-cases/veil/
Veil: Multi-Modal Verification of Distributed Protocols — Lean Lang
Lean is an open-source programming language and proof assistant that enables correct, maintainable, and formally verified code.
multi modalveilverificationdistributedprotocols
https://newyougo.com/models/grok-imagine-video
Grok Imagine Video - Multi-modal AI Video Studio for Text, Image, and Source-Video Workflows
Grok Imagine Video is a short-form AI video studio for text-to-video, image-to-video, and video-to-video generation with synchronized audio, 480p or 720p...
grok imagine videomulti modal aitext imagestudiosource
https://ieeexplore.ieee.org/document/9200754
EgoCom: A Multi-Person Multi-Modal Egocentric Communications Dataset | IEEE Journals & Magazine |...
Multi-modal datasets in artificial intelligence (AI) often capture a third-person perspective, but our embodied human intelligence evolved with sensory input fr
multi personieee journalsmodalcommunicationsdataset
https://huggingface.co/papers/2503.11576
Paper page - SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal...
Join the discussion on this paper page
vision language modelultra compactpaperendmulti
https://sigport.org/documents/give-multi-agent-framework-generating-immersive-multi-modal-virtual-environments-3d-games
GIVE: A Multi-Agent Framework for Generating Immersive Multi-Modal Virtual Environments for 3D...
multi agent frameworkvirtual environmentsgivegeneratingimmersive