Audio-Visual Scene-Aware Dialog

code for the paper: AVSD Huda Alamri, Vincent Cartillier, Abhishek Das, Jue Wang, Stefan Lee, Peter Anderson, Irfan Essa, Devi Parikh, Dhruv Batra, Anoop Cherian, Tim K. Marks, Chiori Hori

duplicate repo. from: https://github.com/batra-mlp-lab/avsd

website: video-dialog.com

This code has been developed upon batra-mlp-lab/visdial-challenge-starter-pytorch

Setup

# create and activate environment
conda env create -n avsd -f=env.yml
conda activate avsd

Data

download 'split'.json data at: video-dialog.com

Workflow

Build dialogs json file with otions using makejson_with_options.py (output: 'split'_options.json)
Adapt JSON format using convert_json_to_visdial_style.py (output: 'split'_options_2.json can be renamed after to 'split'_options.json)
Build tokenized captions, dialogs and image paths with prepro.py (output: dialogs.h5 and params.json)
Build the image features (if working with images) using prepro_img_vgg16.lua or prepro_img_resnet.lua from the batra-mlp-lab/visdial-challenge-starter-pytorch (output: data_img.h5)
Build video features I3D (output: data_video.h5) https://github.com/piergiaj/pytorch-i3d.git
Build audio features AENET (output: data_audio.h5) https://github.com/znaoya/aenet.git
Training: run train.py
evaluation: run evaluate.py --use_gt
visualize: run viz.py

If you find this code useful in your research, please consider citing:

@inproceedings{alamri2019audio,
  title={Audio visual scene-aware dialog},
  author={Alamri, Huda and Cartillier, Vincent and Das, Abhishek and Wang, Jue and Cherian, Anoop and Essa, Irfan and Batra, Dhruv and Marks, Tim K and Hori, Chiori and Anderson, Peter and others},
  booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
  pages={7558--7567},
  year={2019}
}

License

BSD

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
decoders		decoders
encoders		encoders
utils		utils
README.md		README.md
convert_json_to_visdial_style.py		convert_json_to_visdial_style.py
dataloader.py		dataloader.py
env.yml		env.yml
evaluate.py		evaluate.py
makejson_with_options.py		makejson_with_options.py
prepro.py		prepro.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Audio-Visual Scene-Aware Dialog

Setup

Data

Workflow

If you find this code useful in your research, please consider citing:

License

About

Releases

Packages

Languages

vincentcartillier/AVSD

Folders and files

Latest commit

History

Repository files navigation

Audio-Visual Scene-Aware Dialog

Setup

Data

Workflow

If you find this code useful in your research, please consider citing:

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages