Quit Emailing Yourself

# large-language-models → pathway-optimization → test-time-learning → mixture-of-experts

1 link tagged with all of: large-language-models + pathway-optimization + test-time-learning + mixture-of-experts

GitHub - tianyi-lab/C3PO: [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"

C3PO introduces a novel approach for optimizing expert pathways in Mixture-of-Experts (MoE) Large Language Models at test time, significantly improving accuracy by 7-15% through collaborative re-weighting of core experts in critical layers. By utilizing surrogate objectives based on successful neighboring samples, C3PO enhances efficiency, enabling models with fewer parameters to outperform larger counterparts. The method demonstrates superior performance over existing test-time learning techniques across various benchmarks.

Saved by tldr-importer · Last saved October 29, 2025 · 1 min read

mixture-of-experts ✓ pathway-optimization ✓ test-time-learning ✓ large-language-models ✓ + accuracy-improvement

Links

GitHub - tianyi-lab/C3PO: [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"