https://arxiv.org/abs/2408.03326
[2408.03326] LLaVA-OneVision: Easy Visual Task Transfer
Abstract page for arXiv paper 2408.03326: LLaVA-OneVision: Easy Visual Task Transfer
llavaeasyvisualtasktransfer
https://fireworks.ai/models/fireworks/llava-yi-34b
LLaVA V1.6 Yi 34B
LLaVA is an open-source chatbot trained by fine-tuning LLMs on multimodal instruction-following data. It is an auto-regressive language model, based on the...
llavayi
https://wandb.ai/byyoung3/ml-news/reports/TinyLLaVA-LLaVA-Just-Got-Faster---Vmlldzo2OTY2MjMz
TinyLLaVA: LLaVA Just Got Faster
llavagotfaster
https://arxiv.org/abs/2411.14505v1
[2411.14505v1] LLaVA-MR: Large Language-and-Vision Assistant for Video Moment Retrieval
Abstract page for arXiv paper 2411.14505v1: LLaVA-MR: Large Language-and-Vision Assistant for Video Moment Retrieval
https://learn.theaiedge.io/courses/introduction-to-langchain/lectures/50753269
Describing Images with LlaVA | The AiEdge
Learn to build Software Applications with Large Language Models
describingimagesllava
https://docs.vllm.ai/en/latest/api/vllm/model_executor/models/llava_next_video/
llava_next_video - vLLM
next videollavavllm
https://www.ilovefreesoftware.com/30/programming/free-open-source-chatgpt-alternative-with-vision-capabilities-llava.html
Free Open Source ChatGPT Alternative with Vision Capabilities: LLaVA
LLaVA is a large language model and vision assistant. Try this new open source GPT-4 alternative on your own machine.
open sourcechatgpt alternativevision capabilitiesfreellava
https://paperium.net/article/en/3032/pg-video-llava-pixel-grounding-large-video-language-models
PG-Video-LLaVA: Pixel Grounding Large Video-Language Models: Analysis, Review & Summary | Paperium
Quick breakdown of the 'PG-Video-LLaVA: Pixel Grounding Large Video-Language Models' paper. Methods, results, strengths/weaknesses explained in plain
large language models
https://aiiiii.com.cn/sites/8424.html
MG-LLaVA: 增强视觉处理能力的机器学习语言模型(MLLM) | AI设计导航
MG-LLaVA是一个增强视觉处理能力的机器学习语言模型(MLLM),通过整合多粒度视觉流程,包括低分辨率、高分辨率和以对象为中心的特征。提出了一个额外的高分辨率视觉编码器来捕捉细节,并通过Conv-Gate融合网络与基础视觉特征融合。
mgllavamllm
https://writingmate.ai/models/liuhaotian/llava-yi-34b
LLaVA v1.6 34B - AI Model Details | Writingmate
LLaVA Yi 34B is an open-source model trained by fine-tuning LLM on multimodal instruction-following data.
ai modelllavadetails
https://optiprime.com.au/gpt-4v-compared-to-llava-a-detailed-contrast/
GPT-4V Compared to LLaVa: A Detailed Contrast - OptiPrime
Nov 14, 2023 - On the 6th of November, 2023, OpenAI introduced its cutting-edge GPT-4V (GPT-4 with Vision), showcasing it as an advanced multimodal model during the premiere
compared togptllavadetailedcontrast