3 skills found
voidful / TextRLImplementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)
naver / GdcCode accompanying our papers on the "Generative Distributional Control" framework
ewsheng / Controllable Nlg BiasesFramework for controlling demographic biases in NLG (using adversarial prompts)