Results for "sft-data"

Claude Code Claude Desktop GitHub Copilot Cursor Windsurf Cline Zed JetBrains

📄SKILL.md 🤖CLAUDE.md ⚡Claude Commands 📐.cursorrules 📐Cursor Rules 🕹️AGENTS.md 🧬codex.md 🏄.windsurfrules 🔧.clinerules 🧑‍✈️Copilot Instructions

All Development Operations Data Product Marketing Customer Design Sales

29 skills found

JudgmentLabs / Judgeval

1.0k

The open source post-building layer for agents. Our environment data and evals power agent post-training (RL, SFT) and monitoring.

universal

agentagentic-aiagents+12

Updated 1d ago

InternScience / GraphGen

1.0k

GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation

universal

ai4sciencedata-generationdata-synthesis+13

Updated 1d ago

chaoswork / Sft Datasets

575

开源SFT数据集整理,随时补充

universal

chinese-datasetdatasetslarge-language-models+2

Updated 1d ago

Gen-Verse / ReasonFlux

527

[NeurIPS 2025 Spotlight] LLM post-training suite — featuring ReasonFlux, ReasonFlux-PRM, and ReasonFlux-Coder.

gemini cli

chain-of-thoughtclawdbot-skillcode-generation+8

Updated 1d ago

argilla-io / Notus

168

Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first approach

universal

alignment-handbookdpofine-tuning+4

Updated 4d ago

synlp / ChiMed GPT

104

ChiMed-GPT is a Chinese medical large language model (LLM) built by continually training Ziya-v2 on Chinese medical data, where pre-training, supervised fine-tuning (SFT), and reinforcement learning from human feedback (RLHF) are comprehensively performed on it.

universal

Updated 2mo ago

ServiceNow / SyGra

SyGra - Graph-oriented Synthetic data generation Pipeline

universal

aidpoimage-datasets+10

Updated 6d ago

yiyepiaoling0715 / Codellm Data Preprocess Pipeline

代码大模型预训练&微调&DPO 数据处理业界处理pipeline sota

universal

codellm-completionfimfunction-dependency+3

Updated 9d ago

ZJU-REAL / TimeHC RL

This repository is the official implementation of TimeHC-RL (Distilabel (Data Generation) + TRL (SFT) + VeRL (GRPO)).

universal

Updated 3mo ago

ly-geming / AnyCam2Ros

Turn any camera (Insta360, RealSense, USB webcam, etc.) into ROS2 image topics. Unified config for VLA deployment and SFT data collection.

universal

Updated 11d ago

vvincenttttt / Awesome 3D AutoLabeling Tools

Papers, code and datasets about deep learning/LLM SFT data auto-labeling.

universal

computer-visiondeep-learningdeep-neural-networks+6

Updated 3d ago

xiatingyu / SFT DataSelection At Scale

No description available

universal

Updated 11h ago

quanshr / AugCon

[AAAI 2025]Automatically Generating Numerous Context-Driven SFT Data for LLMs across Diverse Granularity

universal

large-language-modelsupervised-finetuningsynthetic-data

Updated 25d ago

requestsession / IndexGPT

IndexGPT: 本地 AI 科研助手（RAG + SFT + LoRA）| Local AI research assistant for PDF parsing, retrieval QA, SFT data generation, and LoRA training.

universal

Updated 3d ago

Pe46dro / Bash MySQL Database SFTP FTP Backup

A small script to upload backup of MySQL database to an external FTP server

universal

backupftpmysql+1

Updated 5mo ago

ChristopheZhao / SFT Data Generation

Instruction Tuning data generation uses LLM in a specific scenario.

universal

Updated 6mo ago

seanzhang-zhichen / Qwen WisdomVast

Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and 2,000 single-turn self-cognition data, using the training methods of DORA and LORA+ based on Qwen1.5-7B as the base. Compared to Qwen1.5-7B-Chat, it has improved mathematical abilities by 5.16%, 12.8% on the Human

universal

Updated 1y ago

NVIDIA / Nvflow

Workflow orchestration framework for end-to-end synthetic data generation (SDG), training (SFT), and evaluation pipelines built on NVIDIA's NeMo ecosystem

universal

Updated 8d ago

Zizhao-HUANG / Financial Llm Dataset Pipeline

Modular pipeline for collecting, processing, and exporting financial data into LLM-ready formats (CPT/SFT/TXT). Built on a raw→silver→gold architecture with built-in auditing.

universal

data-pipelineetlfine-tuning+4

Updated 11d ago

EthioNLP / Afri Sft Data

This repository generates instruction tuning dataset from different datasets.

universal

Updated 1y ago