Alibaba Cloud launches open-source LVLM with image comprehension capability
Alibaba Cloud, the digital technology and intelligence backbone of Alibaba Group, launched two open-source large vision language models (LVLM): Qwen-VL, and its conversationally fine-tuned Qwen-VL-Chat. The models can comprehend images, texts, and bounding boxes in prompts and facilitate multi-round question answering in both English and Chinese. Qwen-VL is the multimodal version of Qwen-7B, Alibaba Cloud’s 7-billion-parameter […]