5 skills found
liangyuwang / Zo2ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory [COLM2025]
fannie1208 / W4S[COLM2025] "Weak-for-Strong: Training Weak Meta-Agent to Harness Strong Executors"
zzbright1998 / SentenceKVOfficial implementation of "SentenceKV: Efficient LLM Inference via Sentence-Level Semantic KV Caching" (COLM 2025). A novel KV cache compression method that organizes cache at sentence level using semantic similarity.
chenxshuo / True MiclCode of True Multimodal In-Context Learning Needs Attention to the Visual Context (COLM2025)
mhjiang0408 / MAC Bench[COLM2025] MAC: A Live Benchmark for Multimodal Large Language Models in Scientific Understanding