Qwen has released the Qwen3-VL-Embedding and Qwen3-VL-Reranker models, designed for advanced multimodal information retrieval and cross-modal understanding. These models support various inputs, including text and images, and enhance retrieval accuracy through a two-stage process of initial recall and precise re-ranking.
MS MARCO Web Search is a comprehensive dataset designed for information retrieval research, featuring millions of real clicked query-document labels and a vast corpus from ClueWeb22. It supports various tasks in machine learning and retrieval systems, offering a benchmark for evaluating retrieval methods and performance across large datasets. Researchers can utilize this dataset to investigate the effectiveness of their techniques on both small and large data scales.