Quit Emailing Yourself

3 links tagged with all of: large-language-models + open-source

Click any tag below to further narrow down your results

Links

Open Source RL Libraries for LLMs | Anyscale

Reinforcement learning (RL) is becoming essential in developing large language models (LLMs), particularly for aligning them with human preferences and enhancing their capabilities through multi-turn interactions. This article reviews various open-source RL libraries, analyzing their designs and trade-offs to assist researchers in selecting the appropriate tools for specific applications. Key libraries discussed include TRL, Verl, OpenRLHF, and several others, each catering to different RL needs and architectures.

Saved by tldr-importer · Last saved October 29, 2025 · 6 min read

+ reinforcement-learning open-source ✓ + libraries large-language-models ✓ + agentic-rl

TikTok parent company ByteDance releases new open source Seed-OSS-36B model with 512K token context | VentureBeat

ByteDance has unveiled the Seed-OSS-36B, an open-source large language model with a remarkable 512K token context, surpassing many competitors. The release includes three variants aimed at balancing performance and research flexibility, enabling extensive applications without licensing fees.

Saved by tldr-importer · Last saved October 29, 2025 · 4 min read

+ bytedance + seed-oss-36b open-source ✓ + ai large-language-models ✓

LLM4Ranking: An Easy-to-use Framework of Utilizing Large Language Models for Document Reranking

LLM4Ranking is a unified framework designed to facilitate the utilization of large language models (LLMs) for document reranking in various applications, such as search engines. It offers a simple and extensible interface, along with evaluation and fine-tuning scripts, allowing users to experiment with different ranking methods and models on popular datasets. The framework aims to enhance the performance and efficiency of LLMs in document reranking tasks and is available as open-source code.

Saved by tldr-importer · Last saved October 29, 2025 · 1 min read

+ document-reranking large-language-models ✓ + information-retrieval + framework open-source ✓