Results for "llm-supervised"

Claude Code Claude Desktop GitHub Copilot Cursor Windsurf Cline Zed JetBrains

📄SKILL.md 🤖CLAUDE.md ⚡Claude Commands 📐.cursorrules 📐Cursor Rules 🕹️AGENTS.md 🧬codex.md 🏄.windsurfrules 🔧.clinerules 🧑‍✈️Copilot Instructions

All Development Operations Data Product Marketing Customer Design Sales

50 skills found · Page 1 of 2

InternScience / GraphGen

1.0k

GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation

universal

ai4sciencedata-generationdata-synthesis+13

Updated 1d ago

OFA-Sys / InsTag

285

InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning

universal

alignmentlarge-language-modelsllama+4

Updated 8d ago

synlp / ChiMed GPT

105

ChiMed-GPT is a Chinese medical large language model (LLM) built by continually training Ziya-v2 on Chinese medical data, where pre-training, supervised fine-tuning (SFT), and reinforcement learning from human feedback (RLHF) are comprehensively performed on it.

universal

Updated 14h ago

AI-Maker-Space / LLM Engineering Foundations To SLMs Open Source

Large Language Model Engineering (LLM Engineering) refers to the emerging best-practices and tools for pretraining, post-training, and optimizing LLMs prior to production deployment. Pre- and post-training techniques include unsupervised pretraining, supervised fine-tuning, alignment, model merging, distillation, quantization. and others.

universal

Updated 10d ago

luo-junyu / SemiEvol

SemiEvol: Semi-supervised Fine-tuning for LLM Adaptation

universal

Updated 29d ago

SharkSpicy-NLP / SR KI

SR-KI: Scalable and Real-Time Knowledge Integration into LLMs via Supervised Attention

universal

Updated 1mo ago

WooooDyy / MathCritique

Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".

universal

llmreasoningscalable-oversight

Updated 3mo ago

UCSC-REAL / TokenCleaning

[ICML 2025] Official implementation of paper "Token Cleaning: Fine-Grained Data Selection for LLM Supervised Fine-Tuning"

universal

instruction-tuningllmtoken-cleaning

Updated 1d ago

gfyddha / UDS

Official implementation of our paper: "Utility-Diversity Aware Online Batch Selection for LLM Supervised Fine-tuning."

universal

Updated 8d ago

climatechange-ai-tutorials / Nlp Policy Analysis

Explore how Natural Language Processing (NLP) can be used to assist in identifying and mapping climate-relevant literature using a supervised learning approach and leverage a state of the art Large Language Model (LLM) to classify climate policy documents.

universal

Updated 1mo ago

slSeanWU / Beats Conformer Bart Audio Captioner

PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Supervision, and LLM Mix-up Augmentation"

universal

audio-captioningclotho-datasetdcase-challenge+2

Updated 1mo ago

knoxchat / Knoxchat

Knox is a vigilant supervisor and management tool that ensures LLM teams rigorously develop reliable AI Agent programming extensions for VSCode and compatible editors.

vscode copilot

Updated 5d ago

vibrantlabsai / Funtuner

Supervised instruction finetuning for LLM with HF trainer and Deepspeed

universal

Updated 12d ago

yongchao98 / R1 Code Interpreter

R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and Reinforcement Learning

universal

code-interpreterlarge-language-modelsplanning+2

Updated 8d ago

ianhohoho / Auto Hyde

🔎 A deep-dive into HyDE for Advanced LLM RAG + 💡 Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, coverage and applicability of HyDE

universal

aihydelangchain+2

Updated 2mo ago

mnemon-dev / Mnemon

LLM-supervised persistent memory for AI agents — graph-based recall, cross-session knowledge, single binary. Works with Claude Code, OpenClaw, and any CLI agent.

claude codeclaude desktop+1

agent-frameworkagent-memoryai-agent+17

Updated 1d ago

CentreSecuriteIA / BELLS

Benchmarks for the Evaluation of LLM Supervision

universal

Updated 2mo ago

monaccode / Astromesh

Multi-model AI agent runtime. Define agents in YAML, connect 6 LLM providers, orchestrate with ReAct/Plan&Execute/Fan-Out/Pipeline/Supervisor/Swarm patterns, and deploy as REST/WebSocket API with RAG, memory, MCP tools, guardrails, and OpenTelemetry observability.

claude codecursor

agent-frameworkagent-orchestrationagent-runtime+17

Updated 18h ago

LoveCatc / Supervised Llm Uncertainty Estimation

This repo contains code for paper: "Uncertainty Estimation and Quantification for LLMs: A Simple Supervised Approach".

universal

Updated 2d ago

yzhan238 / TELEClass

The source code used for paper "TELEClass: Taxonomy Enrichment and LLM-Enhanced Hierarchical Text Classification with Minimal Supervision", published in WWW 2025.

universal

data-generationhierarchical-text-classificationlarge-language-model+5

Updated 7d ago