50 skills found · Page 1 of 2
InternScience / GraphGenGraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation
OFA-Sys / InsTagInsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning
synlp / ChiMed GPTChiMed-GPT is a Chinese medical large language model (LLM) built by continually training Ziya-v2 on Chinese medical data, where pre-training, supervised fine-tuning (SFT), and reinforcement learning from human feedback (RLHF) are comprehensively performed on it.
AI-Maker-Space / LLM Engineering Foundations To SLMs Open SourceLarge Language Model Engineering (LLM Engineering) refers to the emerging best-practices and tools for pretraining, post-training, and optimizing LLMs prior to production deployment. Pre- and post-training techniques include unsupervised pretraining, supervised fine-tuning, alignment, model merging, distillation, quantization. and others.
luo-junyu / SemiEvolSemiEvol: Semi-supervised Fine-tuning for LLM Adaptation
SharkSpicy-NLP / SR KISR-KI: Scalable and Real-Time Knowledge Integration into LLMs via Supervised Attention
WooooDyy / MathCritiqueImplementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".
UCSC-REAL / TokenCleaning[ICML 2025] Official implementation of paper "Token Cleaning: Fine-Grained Data Selection for LLM Supervised Fine-Tuning"
gfyddha / UDSOfficial implementation of our paper: "Utility-Diversity Aware Online Batch Selection for LLM Supervised Fine-tuning."
climatechange-ai-tutorials / Nlp Policy AnalysisExplore how Natural Language Processing (NLP) can be used to assist in identifying and mapping climate-relevant literature using a supervised learning approach and leverage a state of the art Large Language Model (LLM) to classify climate policy documents.
slSeanWU / Beats Conformer Bart Audio CaptionerPyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Supervision, and LLM Mix-up Augmentation"
knoxchat / KnoxchatKnox is a vigilant supervisor and management tool that ensures LLM teams rigorously develop reliable AI Agent programming extensions for VSCode and compatible editors.
vibrantlabsai / FuntunerSupervised instruction finetuning for LLM with HF trainer and Deepspeed
yongchao98 / R1 Code InterpreterR1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and Reinforcement Learning
ianhohoho / Auto Hyde🔎 A deep-dive into HyDE for Advanced LLM RAG + 💡 Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, coverage and applicability of HyDE
mnemon-dev / MnemonLLM-supervised persistent memory for AI agents — graph-based recall, cross-session knowledge, single binary. Works with Claude Code, OpenClaw, and any CLI agent.
CentreSecuriteIA / BELLSBenchmarks for the Evaluation of LLM Supervision
monaccode / AstromeshMulti-model AI agent runtime. Define agents in YAML, connect 6 LLM providers, orchestrate with ReAct/Plan&Execute/Fan-Out/Pipeline/Supervisor/Swarm patterns, and deploy as REST/WebSocket API with RAG, memory, MCP tools, guardrails, and OpenTelemetry observability.
LoveCatc / Supervised Llm Uncertainty EstimationThis repo contains code for paper: "Uncertainty Estimation and Quantification for LLMs: A Simple Supervised Approach".
yzhan238 / TELEClassThe source code used for paper "TELEClass: Taxonomy Enrichment and LLM-Enhanced Hierarchical Text Classification with Minimal Supervision", published in WWW 2025.