etri-vilab/SafeLLaVA-7B
Image-Text-to-Text
β’
7B
β’
Updated
β’
36
β’
3
Visual Intelligence, Pretrained Vision-and-Language Model, Embodied AI, Collaborative Agents, Vision Task(Object Detection, Segmentation)