Pyramidbox.pytorch

Pyramidbox implement with pytorch

Generate Convert Improve

Install / Use

/learn @yxlijun/Pyramidbox.pytorch

About this skill

Quality Score

0/100

README

PyramidBox: A Context-assisted Single Shot Face Detector.##

A PyTorch Implementation of PyramidBox

Description

I train pyramidbox with pytorch and the result approaches the original paper result,the pretrained model can be downloaded in vgg,the final model can be downloaded in Pyramidbox.the AP in WIDER FACE as following:

| | Easy MAP | Medium MAP | hard MAP | | -------- | ---------|------------| --------- | | origin paper| 0.960 | 0.948 | 0.888 | | this repo | 0.948 | 0.938 | 0.880 |

the AP in AFW,PASCAL,FDDB as following:

| AFW | PASCAL | FDDB | | --------- |-----------| ---------| | 99.65 | 99.02 | 0.983 |

the gap is small with origin paper,I train 120k batch_size 4 which is different from paper,which maybe cause the gap,if you have more gpu ,the final result maybe better.

Requirement

pytorch 0.3
opencv
numpy
easydict

Prepare data

download WIDER face dataset
modify data/config.py
python prepare_wider_data.py

Train

python train.py --batch_size 4  
		--lr 5e-4

Evalution

according to yourself dataset path,modify data/config.py

Evaluate on AFW.

python tools/afw_test.py

Evaluate on FDDB

python tools/fddb_test.py

Evaluate on PASCAL face

python tools/pascal_test.py

test on WIDER FACE

python tools/wider_test.py

Demo

you can test yourself image

python demo.py

Result

References

Related Skills

node-connect

349.0k

Diagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps

frontend-design

109.4k

Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.

openai-whisper-api

349.0k

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

qqbot-media

349.0k

QQBot 富媒体收发能力。使用 <qqmedia> 标签，系统根据文件扩展名自动识别类型（图片/语音/视频/文件）。