Contact
DMCA
Privacy
Robuta
https://openreview.net/forum?id=9hd5WA6QCn
MODA: MOdular Duplex Attention for Multimodal Perception, Cognition, and Emotion Understanding |...
Multimodal large language models (MLLMs) recently showed strong capacity in integrating data among multiple modalities, empowered by generalizable attention...
cognition and emotion
multimodal perception
modular
duplex
attention
https://praeclarumjj3.github.io/publication/visper-lm/
Elevating Perception in Multimodal LLMs with Visual Embedding Distillation | Jitesh Jain
perception
multimodal
llms
visual
embedding