MetaDialog

Platform for few-shot natural language processing: Text Classification, Sequene Labeling.

Generate Convert Improve

Install / Use

/learn @AtmaHou/MetaDialog

About this skill

Quality Score

0/100

README

Meta Dialog Platform (MDP)

Meta Dialog Platform: a toolkit platform for NLP Few-Shot Learning tasks of:

Text Classification
Sequence Labeling

It also provides the baselines for:

Updates

Updates 2021.3.8: Fix wrong default setting for few-shot data generator scripts.
Updates 2020.9.17: FewJoint benchmark (Dataset for SMP) is available: paper, data, reformatted data (for MetaDialog)

Features

State-of-the-art solutions for Few-shot NLP:

Support Few-shot Learning for sequence-labeling task with state-of-the-art methods: CDT (Hou et al., 2020).
Support to use semantic within label name or label description.
Support various deep pre-trained embedding compatible with huggingface/transformers, such as BERT and Electra.
Support pair-wise embedding mechanism (Hou et al., 2020, Gao et al., 2019).

Easy-to-start & flexible framework:

Provide tools for easy training & testing.
Support various few-shot models with unified and extendable interfaces, such as ProtoNet and TapNet.
Support easy-to-switch similarity-metrics and logits-scaling methods.
Provide tools of generating episode-style data for meta-learning.

Citation

Please cite code and data:

@article{hou2020fewjoint,
	title={FewJoint: A Few-shot Learning Benchmark for Joint Language Understanding},
	author={Yutai Hou, Jiafeng Mao, Yongkui Lai, Cheng Chen, Wanxiang Che, Zhigang Chen, Ting Liu},
	journal={arXiv preprint},
	year={2020}
}

Get Started

Environment Requirement

python>=3.6
torch>=1.2.0
transformers>=2.9.0
numpy>=1.17.0
tqdm>=4.31.1
allennlp>=0.8.4
pytorch-nlp

Example for Sequence Labeling

Here, we take the few-shot slot tagging and NER task from (Hou et al., 2020) as quick start examples.

Step1: Prepare pre-trained embedding

Download the pytorch bert model, or convert tensorflow param by yourself with scripts.
Set BERT path in the ./scripts/run_1_shot_slot_tagging.sh to your setting:

bert_base_uncased=/your_dir/uncased_L-12_H-768_A-12/
bert_base_uncased_vocab=/your_dir/uncased_L-12_H-768_A-12/vocab.txt

Step2: Prepare data

Download the compatible few-shot data at here: download
Set test, train, dev data file path in ./scripts/run_1_shot_slot_tagging.sh to your setting.

For simplicity, your only need to set the root path for data as follow:

base_data_dir=/your_dir/ACL2020data/

Step3: Train and test the main model

Build a folder to collect running log

mkdir result

Execute cross-evaluation script with two params: -[gpu id] -[dataset name]

Example for 1-shot slot tagging:

source ./scripts/run_1_shot_slot_tagging.sh 0 snips

Example for 1-shot NER:

source ./scripts/run_1_shot_slot_tagging.sh 0 ner

To run 5-shots experiments, use ./scripts/run_5_shot_slot_tagging.sh

Other detailed functions and options:

You can experiment freely by passing parameters to main.py to choose different model architectures, hyperparameters, etc.

To view detailed options and corresponding descriptions, run commandline:

python main.py --h

We provide scripts for general few-shot classification and sequence labeling task respectively:

classification
- run_electra_sc.sh
- run_bert_sc.sh
sequence labeling
- run_electra_sl.sh
- run_bert_sl.sh

The usage of these scripts are similar to process in Get Started.

Run with FewJoint/SMP data

Get reformatted FewJoint data at here or construct episode-style data by yourself with our tool.
Use script ./scripts/run_smp_bert_sc.sh and ./scripts/run_smp_bert_sl.sh to perform few-shot intent detection or few-shot slot filling respectively.
Notice that:
1. Change train/dev/test path in the scripts before running.
2. Find predicted results at trained_model_path within running scripts.

Few-shot Data Construction Tool

We also provide a generation tool for converting normal data into few-shot/meta-episode style. The tool is included at path: scripts/other_tool/meta_dataset_generator.py.

Run following commandline to view detailed interface:

python generate_meta_dataset.py --h

For simplicity, we provide an example script to help generate few-shot data: ./scripts/gen_meta_data.sh.

The following are some key params for you to control the generation process:

input_dir: raw data path
output_dir: output data path
episode_num: the number of episode which you want to generate
support_shots_lst: to specified the support shot size in each episode, we can specified multiple number to generate at the same time.
query_shot: to specified the query shot size in each episode
seed_lst: random seed list to control random generation
use_fix_support: set the fix support in dev dataset
dataset_lst: specified the dataset type which our tool can handle, there are some choices: stanford & SLU & TourSG & SMP.

If you want to handle other type of dataset, you can add your code for load raw dataset in meta_dataset_generator/raw_data_loader.py.

few-shot/meta-episode style data example

{
  "domain_name": [
    {  // episode
      "support": {  // support set
        "seq_ins": [["we", "are", "friends", "."], ["how", "are", "you", "?"]],  // input sequence
        "seq_outs": [["O", "O", "O", "O"], ["O", "O", "O", "O"]],  // output sequence in sequence labeling task
        "labels": [["statement"], ["query"]]  // output labels in classification task
      },
      "query": {  // query set
        "seq_ins": [["we", "are", "friends", "."], ["how", "are", "you", "?"]],
        "seq_outs": [["O", "O", "O", "O"], ["O", "O", "O", "O"]],
        "labels": [["statement"], ["query"]]
      }
    },
    ...
  ],
  ...
}

Acknowledgment

The platform is developed by HIT-SCIR. If you have any question and advice for it, please contact us(Yutai Hou - ythou@ir.hit.edu.cn or Yongkui Lai - yklai@ir.hit.edu.cn).

License for code and data

Apache License 2.0

Related Skills

node-connect

346.4k

Diagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps

frontend-design

107.2k

Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.

openai-whisper-api

346.4k

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

qqbot-media

346.4k

QQBot 富媒体收发能力。使用 <qqmedia> 标签，系统根据文件扩展名自动识别类型（图片/语音/视频/文件）。