Pyramidbox.pytorch
Pyramidbox implement with pytorch
Install / Use
/learn @yxlijun/Pyramidbox.pytorchREADME
PyramidBox: A Context-assisted Single Shot Face Detector.##
A PyTorch Implementation of PyramidBox
Description
I train pyramidbox with pytorch and the result approaches the original paper result,the pretrained model can be downloaded in vgg,the final model can be downloaded in Pyramidbox.the AP in WIDER FACE as following:
| | Easy MAP | Medium MAP | hard MAP | | -------- | ---------|------------| --------- | | origin paper| 0.960 | 0.948 | 0.888 | | this repo | 0.948 | 0.938 | 0.880 |
the AP in AFW,PASCAL,FDDB as following:
| AFW | PASCAL | FDDB | | --------- |-----------| ---------| | 99.65 | 99.02 | 0.983 |
the gap is small with origin paper,I train 120k batch_size 4 which is different from paper,which maybe cause the gap,if you have more gpu ,the final result maybe better.
Requirement
- pytorch 0.3
- opencv
- numpy
- easydict
Prepare data
- download WIDER face dataset
- modify data/config.py
python prepare_wider_data.py
Train
python train.py --batch_size 4
--lr 5e-4
Evalution
according to yourself dataset path,modify data/config.py
- Evaluate on AFW.
python tools/afw_test.py
- Evaluate on FDDB
python tools/fddb_test.py
- Evaluate on PASCAL face
python tools/pascal_test.py
- test on WIDER FACE
python tools/wider_test.py
Demo
you can test yourself image
python demo.py
Result
<div align="center"> <img src="https://github.com/yxlijun/Pyramidbox.pytorch/blob/master/tmp/gsmarena_001.jpg" height="250px" alt="demo" > <img src="https://github.com/yxlijun/Pyramidbox.pytorch/blob/master/tmp/0_Parade_marchingband_1_488.jpg" height="200px" alt="demo" > </div>References
Related Skills
node-connect
349.0kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
109.4kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
349.0kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
349.0kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
