Knowledge Constrained Decoding

Official Code for EMNLP 2023 Paper "KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination Detection" (https://arxiv.org/abs/2310.09044).

Environment

pip install -r requirements.txt
pip install -e .

Prepare Data

First, download WoW dataset through ParlAI.
Then,

export WOW_PATH=<PATH to WOW DATASET>
sh scripts/shell/data_process/preprocess_wow.sh 20 $WOW_PATH

Generate Partial Negative data

bash scripts/shell/data_process/partial_neg_gen.sh 0 wow 16  # for wow
bash scripts/shell/data_process/partial_neg_gen.sh 0 cnn_dailymail 16  # for cnn/dm data

Sample Random Negative data (for WoW only)

bash scripts/shell/data_process/random_neg.sh wow

Mix the datasets to your liking.

# typo expected
from datasets import load_from_disk

partial_data_path = <CHANGE HERE>
random_data_path = <CHANGE HERE>

partial_data = load_from_disk(partial_data_path)
random_data = load_from_disk(random_data_path)

merged_dataset = concatenate_datasets([partial_data, random_data])
merged_dataset.train_test_split(test_size=0.1)

merged_dataset.save_to_disk(SAVE_PATH)

Train RIPA discrimnator

# the numbers are the stdin options of the train script. Details can be found at the top of the script file.
sh scripts/shell/train/train_t5_token_classifier.sh 0 EOS 0 0 0 0  # train f
sh scripts/shell/train/train_t5_token_classifier.sh 0 RIPA 0 0 0 1  # finetune RIPA from f
sh scripts/shell/train/train_t5_token_classifier_cnn.sh 0 RIPA 0 0 0 0  # cnn

Run Weighted Decoding

sh scripts/shell/guided_run.sh 0 fudge RAND wow 8 0 0 0 ''
sh scripts/shell/guided_run.sh 0 nado ALL wow 8 1 0 0 ''
# KWD
sh scripts/shell/guided_run.sh 0 fudge RIPA wow 8 0 0 0 ''

Run MCTS (KCTS)

sh scripts/shell/ppl_mcts_run.sh 0 RIPA '' wow 8 0 0 0 0 0

Guide GPT 3.5

Need to train RIPA on GPT2 for this. Checkout scripts/shell/train/train_token_classifier_gpt.sh.

export EXP_ROOT=<ROOT DIRECTORY FOR EXPERIMENT>
sh scripts/shell/openai_guided_run.sh 0 RIPA 4 $EXP_ROOT 0 0 3 0 0 0

Evaluation

We use UniEval (Zhong et al., 2022) + MFMA (Lee et al., 2022, for summarization) + Token-based metrics.

sh scripts/eval/unieval.sh

One can also evaluate the $f$ confidence, using scripts/eval/class_prob.sh script.
Also see scripts/eval/test_t5_token_classifier.sh to evaluate the classifier performance.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
UniEval		UniEval
baseline		baseline
human_eval		human_eval
kcd		kcd
notebooks		notebooks
scripts		scripts
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Knowledge Constrained Decoding

Environment

Prepare Data

Train RIPA discrimnator

Run Weighted Decoding

Run MCTS (KCTS)

Guide GPT 3.5

Evaluation

About

Releases

Packages

Languages

HKUST-KnowComp/Knowledge-Constrained-Decoding

Folders and files

Latest commit

History

Repository files navigation

Knowledge Constrained Decoding

Environment

Prepare Data

Train RIPA discrimnator

Run Weighted Decoding

Run MCTS (KCTS)

Guide GPT 3.5

Evaluation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages