SkillAgentSearch skills...

Audiocaps

๐Ÿ”Š Repository for our NAACL-HLT 2019 paper: AudioCaps

Install / Use

/learn @cdjkim/Audiocaps
About this skill

Quality Score

0/100

Supported Platforms

Universal

README

AudioCaps: Generating Captions for Audios in The Wild

๐ŸšจNEWS Feb.24.2025๐Ÿšจ We have released AudioCaps2.0 dataset with twice the size of the original AudioCaps dataset!

This repository contains the code and the dataset for our NAACL-HLT 2019 paper.

  • Chris Dongjoo Kim, Byeongchang Kim, Hyunmin Lee, and Gunhee Kim. AudioCaps: Generating Captions for Audios in The Wild. In NAACL-HLT, 2019. (Oral)

The Audio Captioning Task

For a live demo visit our website, https://audiocaps.github.io/

Citation

The code and the dataset are free to use for academic purposes only. If you use any of the material in this repository as part of your work, we ask you to cite:

@inproceedings{kim-NAACL-HLT-2019,
    author    = {Chris Dongjoo Kim and Byeongchang Kim and Hyunmin Lee and Gunhee Kim},
    title     = "{AudioCaps: Generating Captions for Audios in The Wild}"
    booktitle = {NAACL-HLT},
    year      = 2019
}

Last edit: Feb 24, 2025

Related Skills

View on GitHub
GitHub Stars207
CategoryDevelopment
Updated8d ago
Forks24

Languages

Python

Security Score

95/100

Audited on Mar 25, 2026

No findings