StackGAN
No description available
Install / Use
/learn @AarohiSingla/StackGANREADME
StackGAN
Synthesizing high-quality images from text descriptions
To understand the code, Check video: https://youtu.be/ye6pYwBQQL4
Create 3 folders (test, weights,results_stage2) in your current working directory.
- <b>weights </b> (your model weights will be saved here)
- <b>test </b> (generated images from our stage I StackGAN)
- <b>results_stage2 </b> will have the generated images from stage2 fo StackGAN
About Dataset
Dataset Name: Caltech-UCSD Birds-200-2011
Download from : http://www.vision.caltech.edu/visipedia/CUB-200-2011.html
Text Embedding Model
Download char-CNN-RNN text embeddings for birds from :
https://drive.google.com/file/d/0B3y_msrWZaXLT1BZdVdycDY5TEE/view?resourcekey=0-sZrhftoEfdvHq6MweAeCjA or https://github.com/hanzhanggit/StackGAN
- char-CNN-RNN-embeddings.pickle — Dataframe for the pre-trained embeddings of the text.
- filenames.pickle — Dataframe containing the filenames of the images.
- class_info.pickle — Dataframe containing the info of classes for each image.
Related Skills
node-connect
341.8kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
84.6kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
341.8kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
commit-push-pr
84.6kCommit, push, and open a PR
