ParaphraseGen
No description available
Install / Use
/learn @arvind385801/ParaphraseGenREADME
A Deep Generative Framework for Paraphrase Generation
Model:
This is the implementation of A Deep Generative Framework for Paraphrase Generation by Ankush et al. (AAA2018) with Kim's Character-Aware Neural Language Models embedding for tokens. The code used the Samuel Bowman's Generating Sentences from a Continuous Space implementation as a base code available here.
Usage
Before model training it is necessary to train word embeddings for both questions and its paraphrases:
$ python train_word_embeddings.py --num-iterations 1200000
$ python train_word_embeddings_2.py --num-iterations 1200000
This script train word embeddings defined in Mikolov et al. Distributed Representations of Words and Phrases
Parameters:
--use-cuda
--num-iterations
--batch-size
--num-sample –– number of sampled from noise tokens
To train model use:
$ python train.py --num-iterations 140000
Parameters:
--use-cuda
--num-iterations
--batch-size
--learning-rate
--dropout –– probability of units to be zeroed in decoder input
--use-trained –– use trained before model
To sample data after training use:
$ python test.py
Parameters:
--use-cuda
--num-sample
Related Skills
node-connect
345.4kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
104.6kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
345.4kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
345.4kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
