Kymata Labs/The Living IndexesBuilt by tekvisions โ†—
The Agent Index / Agent Frameworks / #276
mbzuai-oryx

mbzuai-oryx/groundingLMM

by mbzuai-oryx ยท Agent Frameworks ยท updated 10mo ago

[CVPR 2024 ๐Ÿ”ฅ] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

31
momentum
959
stars
56
forks
#276
rank
foundation-modelsllm-agentlmmvision-and-languagevision-language-model
View on GitHub โ†’