3 links
tagged with all of: large-language-models + open-source
Click any tag below to further narrow down your results
Links
Reinforcement learning (RL) is becoming essential in developing large language models (LLMs), particularly for aligning them with human preferences and enhancing their capabilities through multi-turn interactions. This article reviews various open-source RL libraries, analyzing their designs and trade-offs to assist researchers in selecting the appropriate tools for specific applications. Key libraries discussed include TRL, Verl, OpenRLHF, and several others, each catering to different RL needs and architectures.
ByteDance has unveiled the Seed-OSS-36B, an open-source large language model with a remarkable 512K token context, surpassing many competitors. The release includes three variants aimed at balancing performance and research flexibility, enabling extensive applications without licensing fees.
LLM4Ranking is a unified framework designed to facilitate the utilization of large language models (LLMs) for document reranking in various applications, such as search engines. It offers a simple and extensible interface, along with evaluation and fine-tuning scripts, allowing users to experiment with different ranking methods and models on popular datasets. The framework aims to enhance the performance and efficiency of LLMs in document reranking tasks and is available as open-source code.