95 skills found · Page 1 of 4
tloen / Alpaca LoraInstruct-tune LLaMA on consumer hardware
baaivision / EmuEmu Series: Generative Multimodal Models from BAAI
SinclairCoder / Instruction Tuning PapersReading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).
AdaBit-AI / Parameter Efficient Instruction TuningNo description available
declare-lab / Instruct EvalThis repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.
zhilizju / Awesome Instruction TuningA curated list of awesome instruction tuning datasets, models, papers and repositories.
Shenzhi-Wang / Llama3 Chinese ChatThis is the first Chinese chat model specifically fine-tuned for Chinese through ORPO based on the Meta-Llama-3-8B-Instruct model.
ystemsrx / Qwen2 BoundlessA fine-tuned model from Qwen2-1.5B-Instruct, capable of handling sensitive topics like violence, explicit content. / 从 Qwen2-1.5B-Instruct 微调,能处理各类敏感话题
xiaoya-li / Instruction Tuning SurveyProject for the paper entitled `Instruction Tuning for Large Language Models: A Survey`
ystemsrx / Qwen2.5 SexA fine-tuned model from Qwen2.5-1.5B-Instruct, capable of handling sensitive topics. / 从 Qwen2.5-1.5B-Instruct 微调,主要擅长处理色情话题
yizhongw / Tk InstructTk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.
BAAI-DCAI / Visual Instruction TuningSVIT: Scaling up Visual Instruction Tuning
shizhediao / R Tuning[NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't Know'"
thinksoso / ChatGLM Instruct Tuning微调ChatGLM
bupticybee / FastLoRAChatInstruct-tune LLaMA on consumer hardware with shareGPT data
ldzhangyx / Instruct MusicGenThe official implementation of our paper "Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tuning".
palchenli / VL Instruction TuningNo description available
poteminr / Instruct NerInstruct LLMs for flat and nested NER. Fine-tuning Llama and Mistral models for instruction named entity recognition. (Instruction NER)
vihangd / Alpaca QloraInstruct-tune Open LLaMA / RedPajama / StableLM models on consumer hardware using QLoRA
sail-sg / Symbolic Instruction TuningThe official repository for the paper "From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning".