713 skills found · Page 1 of 24
deepseek-ai / DeepSeek VL2DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
deepseek-ai / DeepSeek V2DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
vllm-project / Semantic RouterSystem Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge
togethercomputer / MoATogether Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models
PKU-YuanGroup / MoE LLaVA【TMM 2025🔥】 Mixture-of-Experts for Large Vision-Language Models
deepseek-ai / DeepSeek MoEDeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
XueFuzhao / OpenMoEA family of open-sourced Mixture-of-Experts (MoE) Large Language Models
MoonshotAI / Kimi VLKimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities
allenai / OLMoEOLMoE: Open Mixture-of-Experts Language Models
Time-MoE / Time MoE[ICLR 2025 Spotlight] Official implementation of "Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts"
lucidrains / Mixture Of ExpertsA Pytorch implementation of Sparsely-Gated Mixture of Experts, for massively increasing the parameter count of language models
hardmaru / Sketch RnnMultilayer LSTM and Mixture Density Network for modelling path-level SVG Vector Graphics data in TensorFlow
AviSoori1x / MakeMoEFrom scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)
drawbridge / Keras MmoeA TensorFlow Keras implementation of "Modeling Task Relationships in Multi-task Learning with Multi-gate Mixture-of-Experts" (KDD 2018)
ldeecke / Gmm TorchGaussian mixture models in PyTorch.
withinmiaov / A Survey On Mixture Of Experts In LLMs[TKDE'25] The official GitHub page for the survey paper "A Survey on Mixture of Experts in Large Language Models".
danieltan07 / DagmmMy attempt at reproducing the paper Deep Autoencoding Gaussian Mixture Model for Unsupervised Anomaly Detection
Ablustrund / LoRAMoELoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment
sangmichaelxie / DoremiPytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets
bing-jian / GmmregImplementations of the robust point set registration algorithm described in "Robust Point Set Registration Using Gaussian Mixture Models", Bing Jian and Baba C. Vemuri, IEEE Transactions on Pattern Analysis and Machine Intelligence, 2011, 33(8), pp. 1633-1645. For a Python implementation, please refer to http://github.com/bing-jian/gmmreg-python.