The Agent Index / Agent Frameworks / #276
mbzuai-oryx/groundingLMM
by mbzuai-oryx ยท Agent Frameworks ยท updated 10mo ago
[CVPR 2024 ๐ฅ] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
31
momentum
959
stars
56
forks
#276
rank
foundation-modelsllm-agentlmmvision-and-languagevision-language-model
View on GitHub โ