Using Explanations to Improve Ensembling of Visual Question Answering Systems

Using Explanations to Improve Ensembling of Visual Question Answering Systems (2017)

Nazneen Fatema Rajani and Raymond J. Mooney

We present results on using explanations as auxiliary features to improve stacked ensembles for Visual Question Answering (VQA). VQA is a challenging task that requires systems to jointly reason about natural language and vision. We present results applying a recent ensembling approach to VQA, Stacking with Auxiliary Features (SWAF), which learns to combine the results of multiple systems. We propose using features based on explanations to improve SWAF. Using explanations we are able to improve ensembling of three recent VQA systems.

View:

PDF

Citation:

In Proceedings of the IJCAI 2017 Workshop on Explainable Artificial Intelligence (XAI), pp. 43-47, Melbourne, Australia, August 2017.

Bibtex:

Presentation:

Poster

People

Raymond J. Mooney	Faculty	mooney [at] cs utexas edu
Nazneen Rajani	Ph.D. Alumni	nrajani [at] cs utexas edu

Areas of Interest

Ensemble Learning Explainable AI Language and Vision

Labs

Machine Learning