Qwen has released the Qwen3-VL-Embedding and Qwen3-VL-Reranker models, designed for advanced multimodal information retrieval and cross-modal understanding. These models support various inputs, including text and images, and enhance retrieval accuracy through a two-stage process of initial recall and precise re-ranking.
Qwen3 Embedding series introduces a new set of models designed for text embedding, retrieval, and reranking tasks, leveraging the advanced multilingual capabilities of the Qwen3 foundation model. These open-sourced models demonstrate state-of-the-art performance in multiple benchmarks and provide flexibility in size and functionality for various applications. The series aims to enhance text understanding and retrieval efficiency, with ongoing optimizations planned for future development.