Hugging Face - Tags

All Tags

CogVLM is a powerful open-source Vision Language Model (VLM).

245DR: 91AS: 90

An open-source vision-language (VL) model designed for real-world visual and language understanding applications.

215DR: 90AS: 85

The LLaVA-NeXT model aims to enhance reasoning capabilities, OCR, and world knowledge.

215DR: 92AS: 87

Qwen-VL is a large-scale Vision Language Model (Large Vision Language Model, LVLM) developed by Alibaba Cloud.

245DR: 91AS: 85