UTCS Connecting Language Acquisition with Machine Perception Group Meeting

[ Meeting Schedule | Previous Discussions | Subscribe to Email List ]

Previous Discussions

Fall 2022

Date Time Place Paper

11/29/2022 12:00PM Zoom Jaemin Cho, Abhay Zala, Mohit Bansal
DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generative Models, arXiv 2022
Ben Hutchinson, Jason Baldridge, Vinodkumar Prabhakaran
Underspecification in Scene Description-to-Depiction Tasks, arXiv 2022

11/15/2022 12:00PM Zoom Jesse Thomason, Mohit Shridhar, Yonatan Bisk, Chris Paxton, Luke Zettlemoyer
Language Grounding with 3D Objects, CoRL 2021

11/1/2022 12:00PM Zoom Ben Poole, Ajay Jain, Jonathan T. Barron, Ben Mildenhall
DreamFusion: Text-to-3D using 2D Diffusion, arXiv 2022

10/25/2022 12:00PM Zoom Jonathan Ho, William Chan, Chitwan Saharia, Jay Whang, Ruiqi Gao, Alexey Gritsenko, Diederik P. Kingma, Ben Poole, Mohammad Norouzi, David J. Fleet, Tim Salimans
Imagen Video: High Definition Video Generation with Diffusion Models, arXiv 2022

10/11/2022 12:00PM Zoom Uriel Singer, Adam Polyak, Thomas Hayes, Xi Yin, Jie An, Songyang Zhang, Qiyuan Hu, Harry Yang, Oron Ashual, Oran Gafni, Devi Parikh, Sonal Gupta, Yaniv Taigman
Make-A-Video: Text-to-Video Generation without Text-Video Data, arXiv 2022

9/27/2022 12:00PM Zoom Mohit Shridhar, Lucas Manuelli, Dieter Fox
Perceiver-Actor: A Multi-Task Transformer for Robotic Manipulation, arXiv 2022

9/13/2022 12:00PM Zoom Michael Ahn, Anthony Brohan, Noah Brown, Yevgen Chebotar, Omar Cortes, Byron David, Chelsea Finn, Chuyuan Fu, Keerthana Gopalakrishnan, Karol Hausman, Alex Herzog, Daniel Ho, Jasmine Hsu, Julian Ibarz, Brian Ichter, Alex Irpan, Eric Jang, Rosario Jauregui Ruano, Kyle Jeffrey, Sally Jesmonth, Nikhil J Joshi, Ryan Julian, Dmitry Kalashnikov, Yuheng Kuang, Kuang-Huei Lee, Sergey Levine, Yao Lu, Linda Luu, Carolina Parada, Peter Pastor, Jornell Quiambao, Kanishka Rao, Jarek Rettinghouse, Diego Reyes, Pierre Sermanet, Nicolas Sievers, Clayton Tan, Alexander Toshev, Vincent Vanhoucke, Fei Xia, Ted Xiao, Peng Xu, Sichun Xu, Mengyuan Yan, Andy Zeng
Do As I Can, Not As I Say: Grounding Language in Robotic Affordances, arXiv 2022

Date	Time	Place	Paper
11/29/2022	12:00PM	Zoom	Jaemin Cho, Abhay Zala, Mohit Bansal DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generative Models, arXiv 2022 Ben Hutchinson, Jason Baldridge, Vinodkumar Prabhakaran Underspecification in Scene Description-to-Depiction Tasks, arXiv 2022
11/15/2022	12:00PM	Zoom	Jesse Thomason, Mohit Shridhar, Yonatan Bisk, Chris Paxton, Luke Zettlemoyer Language Grounding with 3D Objects, CoRL 2021
11/1/2022	12:00PM	Zoom	Ben Poole, Ajay Jain, Jonathan T. Barron, Ben Mildenhall DreamFusion: Text-to-3D using 2D Diffusion, arXiv 2022
10/25/2022	12:00PM	Zoom	Jonathan Ho, William Chan, Chitwan Saharia, Jay Whang, Ruiqi Gao, Alexey Gritsenko, Diederik P. Kingma, Ben Poole, Mohammad Norouzi, David J. Fleet, Tim Salimans Imagen Video: High Definition Video Generation with Diffusion Models, arXiv 2022
10/11/2022	12:00PM	Zoom	Uriel Singer, Adam Polyak, Thomas Hayes, Xi Yin, Jie An, Songyang Zhang, Qiyuan Hu, Harry Yang, Oron Ashual, Oran Gafni, Devi Parikh, Sonal Gupta, Yaniv Taigman Make-A-Video: Text-to-Video Generation without Text-Video Data, arXiv 2022
9/27/2022	12:00PM	Zoom	Mohit Shridhar, Lucas Manuelli, Dieter Fox Perceiver-Actor: A Multi-Task Transformer for Robotic Manipulation, arXiv 2022
9/13/2022	12:00PM	Zoom	Michael Ahn, Anthony Brohan, Noah Brown, Yevgen Chebotar, Omar Cortes, Byron David, Chelsea Finn, Chuyuan Fu, Keerthana Gopalakrishnan, Karol Hausman, Alex Herzog, Daniel Ho, Jasmine Hsu, Julian Ibarz, Brian Ichter, Alex Irpan, Eric Jang, Rosario Jauregui Ruano, Kyle Jeffrey, Sally Jesmonth, Nikhil J Joshi, Ryan Julian, Dmitry Kalashnikov, Yuheng Kuang, Kuang-Huei Lee, Sergey Levine, Yao Lu, Linda Luu, Carolina Parada, Peter Pastor, Jornell Quiambao, Kanishka Rao, Jarek Rettinghouse, Diego Reyes, Pierre Sermanet, Nicolas Sievers, Clayton Tan, Alexander Toshev, Vincent Vanhoucke, Fei Xia, Ted Xiao, Peng Xu, Sichun Xu, Mengyuan Yan, Andy Zeng Do As I Can, Not As I Say: Grounding Language in Robotic Affordances, arXiv 2022

Spring 2022

Date Time Place Paper

4/25/2022 3:00PM Zoom Tristan Thrush, Ryan Jiang, Max Bartolo, Amanpreet Singh, Adina Williams, Douwe Kiela, Candace Ross
Winoground: Probing Vision and Language Models for Visio-Linguistic Compositionality, CVPR 2022

4/11/2022 3:00PM Zoom Oran Gafni, Adam Polyak, Oron Ashual, Shelly Sheynin, Devi Parikh, Yaniv Taigman
Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors, arXiv 2022

3/25/2022 3:00PM Zoom Puyuan Peng, David Harwath
Fast-Slow Transformer for Visually Grounding Speech, ICASSP 2022

2/28/2022 3:00PM Zoom Dylan Ebert, Ellie Pavlick
A Visuospatial Dataset for Naturalistic Verb Learning, NAACL 2022

2/14/2022 3:00PM Zoom DeepMind Interactive Agents Team: Josh Abramson, Arun Ahuja, Arthur Brussee, Federico Carnevale, Mary Cassin, Felix Fischer, Petko Georgiev, Alex Goldin, Mansi Gupta, Tim Harley, Felix Hill, Peter C Humphreys, Alden Hung, Jessica Landon, Timothy Lillicrap, Hamza Merzic, Alistair Muldal, Adam Santoro, Guy Scully, Tamara von Glehn, Greg Wayne, Nathaniel Wong, Chen Yan, Rui Zhu
Creating Multimodal Interactive Agents with Imitation and Self-Supervised Learning, arXiv 2022

1/31/2022 3:00PM Zoom Wenlong Huang, Pieter Abbeel, Deepak Pathak, Igor Mordatch
Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents, ICML 2022

Date	Time	Place	Paper
4/25/2022	3:00PM	Zoom	Tristan Thrush, Ryan Jiang, Max Bartolo, Amanpreet Singh, Adina Williams, Douwe Kiela, Candace Ross Winoground: Probing Vision and Language Models for Visio-Linguistic Compositionality, CVPR 2022
4/11/2022	3:00PM	Zoom	Oran Gafni, Adam Polyak, Oron Ashual, Shelly Sheynin, Devi Parikh, Yaniv Taigman Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors, arXiv 2022
3/25/2022	3:00PM	Zoom	Puyuan Peng, David Harwath Fast-Slow Transformer for Visually Grounding Speech, ICASSP 2022
2/28/2022	3:00PM	Zoom	Dylan Ebert, Ellie Pavlick A Visuospatial Dataset for Naturalistic Verb Learning, NAACL 2022
2/14/2022	3:00PM	Zoom	DeepMind Interactive Agents Team: Josh Abramson, Arun Ahuja, Arthur Brussee, Federico Carnevale, Mary Cassin, Felix Fischer, Petko Georgiev, Alex Goldin, Mansi Gupta, Tim Harley, Felix Hill, Peter C Humphreys, Alden Hung, Jessica Landon, Timothy Lillicrap, Hamza Merzic, Alistair Muldal, Adam Santoro, Guy Scully, Tamara von Glehn, Greg Wayne, Nathaniel Wong, Chen Yan, Rui Zhu Creating Multimodal Interactive Agents with Imitation and Self-Supervised Learning, arXiv 2022
1/31/2022	3:00PM	Zoom	Wenlong Huang, Pieter Abbeel, Deepak Pathak, Igor Mordatch Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents, ICML 2022

Fall 2021

Date Time Place Paper

12/01/2021 1:00PM Zoom Haonan Chen, Hao Tan, Alan Kuntz, Mohit Bansal, Ron Alterovitz
Enabling Robots to Understand Incomplete Natural Language Instructions Using Commonsense Reasoning, ICRA 2020

11/10/2021 1:00PM Zoom Peixin Chang, Shuijing Liu, Katherine Driggs-Campbell
Robot Sound Interpretation: Learning Visual-Audio Representations for Voice-Controlled Robots, arXiv 2021

10/27/2021 1:00PM Zoom Vanya Cohen, Geraud Nangue Tasse, Nakul Gopalan, Steven James, Matthew Gombolay, Benjamin Rosman
Learning to Follow Language Instructions with Compositional Policies, AAAI Fall Symposium Series 2021

10/13/2021 1:00PM Zoom Mohit Shridhar, Lucas Manuelli, Dieter Fox
CLIPort: What and Where Pathways for Robotic Manipulation, CoRL 2021

09/29/2021 1:00PM Zoom Tianshi Cao, Jingkang Wang, Yining Zhang, Sivabalan Manivasagam, Jialu Wang, Mengye Ren
Zero-Shot Compositional Policy Learning via Language Grounding, NeurIPS Datasets and Benchmarks Track 2021 (under review)

04/26/2021 1:00PM Zoom Karan Desai, Justin Johnson
VirTex: Learning Visual Representations from Textual Annotations, CVPR 2021

09/01/2021 1:00PM Zoom Xiao Wang, Xiujun Shu, Zhipeng Zhang, Bo Jiang, Yaowei Wang, Yonghong Tian, Feng Wu
Towards More Flexible and Accurate Object Tracking with Natural Language: Algorithms and Benchmark, CVPR 2021

Date	Time	Place	Paper
12/01/2021	1:00PM	Zoom	Haonan Chen, Hao Tan, Alan Kuntz, Mohit Bansal, Ron Alterovitz Enabling Robots to Understand Incomplete Natural Language Instructions Using Commonsense Reasoning, ICRA 2020
11/10/2021	1:00PM	Zoom	Peixin Chang, Shuijing Liu, Katherine Driggs-Campbell Robot Sound Interpretation: Learning Visual-Audio Representations for Voice-Controlled Robots, arXiv 2021
10/27/2021	1:00PM	Zoom	Vanya Cohen, Geraud Nangue Tasse, Nakul Gopalan, Steven James, Matthew Gombolay, Benjamin Rosman Learning to Follow Language Instructions with Compositional Policies, AAAI Fall Symposium Series 2021
10/13/2021	1:00PM	Zoom	Mohit Shridhar, Lucas Manuelli, Dieter Fox CLIPort: What and Where Pathways for Robotic Manipulation, CoRL 2021
09/29/2021	1:00PM	Zoom	Tianshi Cao, Jingkang Wang, Yining Zhang, Sivabalan Manivasagam, Jialu Wang, Mengye Ren Zero-Shot Compositional Policy Learning via Language Grounding, NeurIPS Datasets and Benchmarks Track 2021 (under review)
04/26/2021	1:00PM	Zoom	Karan Desai, Justin Johnson VirTex: Learning Visual Representations from Textual Annotations, CVPR 2021
09/01/2021	1:00PM	Zoom	Xiao Wang, Xiujun Shu, Zhipeng Zhang, Bo Jiang, Yaowei Wang, Yonghong Tian, Feng Wu Towards More Flexible and Accurate Object Tracking with Natural Language: Algorithms and Benchmark, CVPR 2021

Spring 2021

Date Time Place Paper

05/10/2021 12:00PM Zoom Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, Gretchen Krueger, Ilya Sutskever
Learning Transferable Visual Models From Natural Language Supervision, arXiv 2021

04/26/2021 12:00PM Zoom Aditya Ramesh, Mikhail Pavlov, Gabriel Goh, Scott Gray, Chelsea Voss, Alec Radford, Mark Chen, Ilya Sutskever
Zero-Shot Text-to-Image Generation, arXiv 2021

04/12/2021 12:00PM Zoom Liunian Harold Li, Mark Yatskar, Da Yin, Cho-Jui Hsieh, Kai-Wei Chang
What Does BERT with Vision Look At?, ACL 2020

03/29/2021 12:00PM Zoom William N. Havard, Jean-Pierre Chevrot, Laurent Besacier
Catplayinginthesnow: Impact of Prior Segmentation on a Model of Visually Grounded Speech, CoNLL 2020

03/01/2021 12:00PM Zoom Yaoyiran Li, Edoardo Maria Ponti, Ivan Vulic, Anna Korhonen
Emergent Communication Pretraining for Few-Shot Machine Translation, COLING 2020

02/01/2021 12:00PM Zoom Thomas Carta, Subhajit Chaudhury, Kartik Talamadupula, Michiaki Tatsubori
VisualHints: A Visual-Lingual Environment for Multimodal Reinforcement Learning, arXiv 2020

01/14/2021 11:00AM Zoom Simon Stepputtis, Joseph Campbell, Mariano Phielipp, Stefan Lee, Chitta Baral, Heni Ben Amor
Language-Conditioned Imitation Learning for Robot Manipulation Tasks, NeurIPS 2020

Date	Time	Place	Paper
05/10/2021	12:00PM	Zoom	Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, Gretchen Krueger, Ilya Sutskever Learning Transferable Visual Models From Natural Language Supervision, arXiv 2021
04/26/2021	12:00PM	Zoom	Aditya Ramesh, Mikhail Pavlov, Gabriel Goh, Scott Gray, Chelsea Voss, Alec Radford, Mark Chen, Ilya Sutskever Zero-Shot Text-to-Image Generation, arXiv 2021
04/12/2021	12:00PM	Zoom	Liunian Harold Li, Mark Yatskar, Da Yin, Cho-Jui Hsieh, Kai-Wei Chang What Does BERT with Vision Look At?, ACL 2020
03/29/2021	12:00PM	Zoom	William N. Havard, Jean-Pierre Chevrot, Laurent Besacier Catplayinginthesnow: Impact of Prior Segmentation on a Model of Visually Grounded Speech, CoNLL 2020
03/01/2021	12:00PM	Zoom	Yaoyiran Li, Edoardo Maria Ponti, Ivan Vulic, Anna Korhonen Emergent Communication Pretraining for Few-Shot Machine Translation, COLING 2020
02/01/2021	12:00PM	Zoom	Thomas Carta, Subhajit Chaudhury, Kartik Talamadupula, Michiaki Tatsubori VisualHints: A Visual-Lingual Environment for Multimodal Reinforcement Learning, arXiv 2020
01/14/2021	11:00AM	Zoom	Simon Stepputtis, Joseph Campbell, Mariano Phielipp, Stefan Lee, Chitta Baral, Heni Ben Amor Language-Conditioned Imitation Learning for Robot Manipulation Tasks, NeurIPS 2020

Fall 2020

Date Time Place Paper

12/17/2020 11:00AM Zoom Didac Suris, Adria Recasens, David Bau, David Harwath, James Glass, Antonio Torralba
Learning Words by Drawing Images, CVPR 2019

12/03/2020 11:00AM Zoom Weixin Liang, James Zou, Zhou Yu
ALICE: Active Learning with Contrastive Natural Language Explanations, EMNLP 2020

11/12/2020 11:00AM Zoom Gabriel Ilharco, Rowan Zellers, Ali Farhadi, Hannaneh Hajishirzi
Probing Contextual Language Models for Common Ground with Visual Representations, arXiv 2020

10/29/2020 11:00AM Zoom Felix Hill, Olivier Tieleman, Tamara von Glehn, Nathaniel Wong, Hamza Merzic, Stephen Clark
Grounded Language Learning Fast and Slow, arXiv 2020

10/15/2020 11:00AM Zoom Theodore R. Sumers, Mark K. Ho, Robert D. Hawkins, Karthik Narasimhan, Thomas L. Griffiths
Learning Rewards from Linguistic Feedback, arXiv 2020

10/01/2020 11:00AM Zoom Linjie Li, Yen-Chun Chen, Yu Cheng, Zhe Gan, Licheng Yu, Jingjing Liu
HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training, EMNLP 2020

09/17/2020 11:00AM Zoom Homero Roman Roman, Yonatan Bisk, Jesse Thomason, Asli Celikyilmaz, Jianfeng Gao
RMM: A Recursive Mental Model for Dialog Navigation, arXiv 2020

09/03/2020 11:00AM Zoom Trang Tran, Shubham Toshniwal, Mohit Bansal, Kevin Gimpel, Karen Livescu, Mari Ostendorf
Parsing Speech: A Neural Approach to Integrating Lexical and Acoustic-Prosodic Information, NAACL 2018

Date	Time	Place	Paper
12/17/2020	11:00AM	Zoom	Didac Suris, Adria Recasens, David Bau, David Harwath, James Glass, Antonio Torralba Learning Words by Drawing Images, CVPR 2019
12/03/2020	11:00AM	Zoom	Weixin Liang, James Zou, Zhou Yu ALICE: Active Learning with Contrastive Natural Language Explanations, EMNLP 2020
11/12/2020	11:00AM	Zoom	Gabriel Ilharco, Rowan Zellers, Ali Farhadi, Hannaneh Hajishirzi Probing Contextual Language Models for Common Ground with Visual Representations, arXiv 2020
10/29/2020	11:00AM	Zoom	Felix Hill, Olivier Tieleman, Tamara von Glehn, Nathaniel Wong, Hamza Merzic, Stephen Clark Grounded Language Learning Fast and Slow, arXiv 2020
10/15/2020	11:00AM	Zoom	Theodore R. Sumers, Mark K. Ho, Robert D. Hawkins, Karthik Narasimhan, Thomas L. Griffiths Learning Rewards from Linguistic Feedback, arXiv 2020
10/01/2020	11:00AM	Zoom	Linjie Li, Yen-Chun Chen, Yu Cheng, Zhe Gan, Licheng Yu, Jingjing Liu HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training, EMNLP 2020
09/17/2020	11:00AM	Zoom	Homero Roman Roman, Yonatan Bisk, Jesse Thomason, Asli Celikyilmaz, Jianfeng Gao RMM: A Recursive Mental Model for Dialog Navigation, arXiv 2020
09/03/2020	11:00AM	Zoom	Trang Tran, Shubham Toshniwal, Mohit Bansal, Kevin Gimpel, Karen Livescu, Mari Ostendorf Parsing Speech: A Neural Approach to Integrating Lexical and Acoustic-Prosodic Information, NAACL 2018

Summer 2020

Date Time Place Paper

08/20/2020 11:00AM Zoom Stefanie Tellex, Nakul Gopalan, Hadas Kress-Gazit and Cynthia Matuszek
Robots That Use Language , Annual Review of Control, Robotics, and Autonomous Systems 2020

08/06/2020 11:00AM Zoom Lin Shao, Toki Migimatsu, Qiang Zhang, Karen Yang, and Jeannette Bohg
Concept2Robot: Learning Manipulation Concepts from Instructions and Human Demonstrations , RSS 2020

07/23/2020 11:30AM Zoom Zhe Gan, Yen-Chun Chen, Linjie Li, Chen Zhu, Yu Cheng, Jingjing Liu
Large-Scale Adversarial Training for Vision-and-Language Representation Learning , arXiv, 2020

06/25/2020 11:00AM Zoom Daniel Fried, Jean-Baptiste Alayrac, Phil Blunsom, Chris Dyer, Stephen Clark, Aida Nematzadeh
Learning to Segment Actions from Observation and Narration , ACL 2020

06/11/2020 11:00AM Zoom Corey Lynch, Pierre Sermanet
Grounding Language in Play , arXiv, 2020

Date	Time	Place	Paper
08/20/2020	11:00AM	Zoom	Stefanie Tellex, Nakul Gopalan, Hadas Kress-Gazit and Cynthia Matuszek Robots That Use Language , Annual Review of Control, Robotics, and Autonomous Systems 2020
08/06/2020	11:00AM	Zoom	Lin Shao, Toki Migimatsu, Qiang Zhang, Karen Yang, and Jeannette Bohg Concept2Robot: Learning Manipulation Concepts from Instructions and Human Demonstrations , RSS 2020
07/23/2020	11:30AM	Zoom	Zhe Gan, Yen-Chun Chen, Linjie Li, Chen Zhu, Yu Cheng, Jingjing Liu Large-Scale Adversarial Training for Vision-and-Language Representation Learning , arXiv, 2020
06/25/2020	11:00AM	Zoom	Daniel Fried, Jean-Baptiste Alayrac, Phil Blunsom, Chris Dyer, Stephen Clark, Aida Nematzadeh Learning to Segment Actions from Observation and Narration , ACL 2020
06/11/2020	11:00AM	Zoom	Corey Lynch, Pierre Sermanet Grounding Language in Play , arXiv, 2020

Spring 2020

Date Time Place Paper

05/19/2020 4:00PM Zoom Tsung-Yen Yang, Karthik Narasimhan
Robust and Interpretable Grounding of Spatial References with Relation Networks , arXiv, 2020

05/05/2020 4:00PM Zoom Washington Ramos, Michel Silva, Edson Araujo, Leandro Soriano Marcolino, Erickson Nascimento
Straight to the Point: Fast-forwarding Videos via Reinforcement Learning Using Textual Data, CVPR, 2020

04/21/2020 4:00PM Zoom Ramprasaath R. Selvaraju, Purva Tendulkar, Devi Parikh, Eric Horvitz, Marco Ribeiro, Besmira Nushi, Ece Kamar
SQuINTing at VQA Models: Interrogating VQA Models with Sub-Questions, CVPR, 2020

04/07/2020 4:00PM Zoom David Harwath, Adri� Recasens, D�dac Sur�s, Galen Chuang, Antonio Torralba, and James Glass
Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input, ECCV, 2018

03/24/2020 4:00PM Zoom Haoyue Shi, Jiayuan Mao, Kevin Gimpel, Karen Livescu
Visually Grounded Neural Syntax Acquisition, ACL, 2019

03/10/2020 4:00PM GDC 3.816 Ta-Chung Chi, Mihail Eric, Seokhwan Kim, Minmin Shen, Dilek Hakkani-tur
Just Ask:An Interactive Learning Framework for Vision and Language Navigation, AAAI, 2020

02/25/2020 4:00PM GDC 3.816 Remi Cadene, Corentin Dancette, Hedi Ben younes, Matthieu Cord, Devi Parikh
RUBi: Reducing Unimodal Biases for Visual Question Answering, NeurIPS, 2019

02/11/2020 4:00PM GDC 3.816 Mohit Shridhar, Jesse Thomason, Daniel Gordon, Yonatan Bisk, Winson Han, Roozbeh Mottaghi, Luke Zettlemoyer, Dieter Fox
ALFRED: A Benchmark for Interpreting Grounded Instructions for Everyday Tasks, arXiv, 2019

01/28/2020 4:00PM GDC 3.816 Yonatan Bisk, Rowan Zellers, Ronan Le Bras, Jianfeng Gao, Yejin Choi
PIQA: Reasoning about Physical Commonsense in Natural Language, AAAI, 2020

Date	Time	Place	Paper
05/19/2020	4:00PM	Zoom	Tsung-Yen Yang, Karthik Narasimhan Robust and Interpretable Grounding of Spatial References with Relation Networks , arXiv, 2020
05/05/2020	4:00PM	Zoom	Washington Ramos, Michel Silva, Edson Araujo, Leandro Soriano Marcolino, Erickson Nascimento Straight to the Point: Fast-forwarding Videos via Reinforcement Learning Using Textual Data, CVPR, 2020
04/21/2020	4:00PM	Zoom	Ramprasaath R. Selvaraju, Purva Tendulkar, Devi Parikh, Eric Horvitz, Marco Ribeiro, Besmira Nushi, Ece Kamar SQuINTing at VQA Models: Interrogating VQA Models with Sub-Questions, CVPR, 2020
04/07/2020	4:00PM	Zoom	David Harwath, Adri� Recasens, D�dac Sur�s, Galen Chuang, Antonio Torralba, and James Glass Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input, ECCV, 2018
03/24/2020	4:00PM	Zoom	Haoyue Shi, Jiayuan Mao, Kevin Gimpel, Karen Livescu Visually Grounded Neural Syntax Acquisition, ACL, 2019
03/10/2020	4:00PM	GDC 3.816	Ta-Chung Chi, Mihail Eric, Seokhwan Kim, Minmin Shen, Dilek Hakkani-tur Just Ask:An Interactive Learning Framework for Vision and Language Navigation, AAAI, 2020
02/25/2020	4:00PM	GDC 3.816	Remi Cadene, Corentin Dancette, Hedi Ben younes, Matthieu Cord, Devi Parikh RUBi: Reducing Unimodal Biases for Visual Question Answering, NeurIPS, 2019
02/11/2020	4:00PM	GDC 3.816	Mohit Shridhar, Jesse Thomason, Daniel Gordon, Yonatan Bisk, Winson Han, Roozbeh Mottaghi, Luke Zettlemoyer, Dieter Fox ALFRED: A Benchmark for Interpreting Grounded Instructions for Everyday Tasks, arXiv, 2019
01/28/2020	4:00PM	GDC 3.816	Yonatan Bisk, Rowan Zellers, Ronan Le Bras, Jianfeng Gao, Yejin Choi PIQA: Reasoning about Physical Commonsense in Natural Language, AAAI, 2020

Fall 2019

Date Time Place Paper

11/20/2019 3:00PM GDC 3.816 Ronghang Hu, Jacob Andreas, Trevor Darrell, Kate Saenko
Explainable Neural Computation via Stack Neural Module Networks, ECCV, 2018

11/06/2019 3:00PM GDC 3.816 Xintong Yu, Hongming Zhang, Yangqiu Song, Yan Song, and Changshui Zhang
What You See is What You Get:Visual Pronoun Coreference Resolution in Dialogues, EMNLP, 2019

10/23/2019 3:00PM GDC 3.816 Howard Chen, Alane Suhr, Dipendra Misra, Noah Snavely, Yoav Artzi
TOUCHDOWN: Natural Language Navigation and Spatial Reasoning in Visual Street Environments, CVPR, 2019

10/16/2019 3:00PM GDC 3.828 Chris Paxton, Yonatan Bisk, Jesse Thomason, Arunkumar Byravan, Dieter Fox
Prospection: Interpretable Plans From Language By Predicting the Future, ICRA, 2019

09/25/2019 3:00PM GDC 3.816 Jiasen Lu, Dhruv Batra, Devi Parikh, Stefan Lee
ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks, Arxiv, 2019

09/11/2019 3:00PM GDC 3.816 Andrea Burns, Reuben Tan, Kate Saenko, Stan Sclaroff, Bryan A. Plummer
Language Features Matter:Effective Language Representations for Vision-Language Tasks, ICCV, 2019

Date	Time	Place	Paper
11/20/2019	3:00PM	GDC 3.816	Ronghang Hu, Jacob Andreas, Trevor Darrell, Kate Saenko Explainable Neural Computation via Stack Neural Module Networks, ECCV, 2018
11/06/2019	3:00PM	GDC 3.816	Xintong Yu, Hongming Zhang, Yangqiu Song, Yan Song, and Changshui Zhang What You See is What You Get:Visual Pronoun Coreference Resolution in Dialogues, EMNLP, 2019
10/23/2019	3:00PM	GDC 3.816	Howard Chen, Alane Suhr, Dipendra Misra, Noah Snavely, Yoav Artzi TOUCHDOWN: Natural Language Navigation and Spatial Reasoning in Visual Street Environments, CVPR, 2019
10/16/2019	3:00PM	GDC 3.828	Chris Paxton, Yonatan Bisk, Jesse Thomason, Arunkumar Byravan, Dieter Fox Prospection: Interpretable Plans From Language By Predicting the Future, ICRA, 2019
09/25/2019	3:00PM	GDC 3.816	Jiasen Lu, Dhruv Batra, Devi Parikh, Stefan Lee ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks, Arxiv, 2019
09/11/2019	3:00PM	GDC 3.816	Andrea Burns, Reuben Tan, Kate Saenko, Stan Sclaroff, Bryan A. Plummer Language Features Matter:Effective Language Representations for Vision-Language Tasks, ICCV, 2019

Spring 2019

Date Time Place Paper

04/23/2019 3:30PM GDC 3.516 Kaihua Tang, Hanwang Zhang, Baoyuan Wu, Wenhan Luo, Wei Liu
Learning to Compose Dynamic Tree Structures for Visual Contexts, CVPR, 2019

04/08/2019 12PM GDC 3.816 Xin Wang, Qiuyuan Huang, Asli Celikyilmaz, Jianfeng Gao, Dinghan Shen, Yuan-Fang Wang, William Yang Wang, Lei Zhang
Reinforced Cross-Modal Matching and Self-Supervised Imitation Learning for Vision-Language Navigation, CVPR, 2019

03/28/2019 12PM GDC 3.816 Chaorui Deng, Qi Wu, Guanghui Xu, Zhuliang Yu, Yanwu Xu, Kui Jia, Mingkui Tan
You Only Look & Listen Once: Towards Fast and Accurate Visual Grounding, Arxiv, 2018

03/04/2019 12PM GDC 3.816 Rowan Zellers, Yonatan Bisk, Ali Farhadi, Yejin Choi
From Recognition to Cognition: Visual Commonsense Reasoning, Arxiv, 2018

02/11/2019 12PM GDC 3.816 Zhilin Yang, Saizheng Zhang, Jack Urbanek, Will Feng, Alexander H. Miller, Arthur Szlam, Douwe Kiela, Jason Weston
Mastering the Dungeon: Grounded Language Learning by Mechanical Turker Descent, ICLR, 2019

05/07/2019 3:30PM GDC 3.816 Fuwen Tan, Song Feng, Vicente Ordonez
Text2Scene: Generating Compositional Scenes from Textual Descriptions, CVPR, 2019

Date	Time	Place	Paper
04/23/2019	3:30PM	GDC 3.516	Kaihua Tang, Hanwang Zhang, Baoyuan Wu, Wenhan Luo, Wei Liu Learning to Compose Dynamic Tree Structures for Visual Contexts, CVPR, 2019
04/08/2019	12PM	GDC 3.816	Xin Wang, Qiuyuan Huang, Asli Celikyilmaz, Jianfeng Gao, Dinghan Shen, Yuan-Fang Wang, William Yang Wang, Lei Zhang Reinforced Cross-Modal Matching and Self-Supervised Imitation Learning for Vision-Language Navigation, CVPR, 2019
03/28/2019	12PM	GDC 3.816	Chaorui Deng, Qi Wu, Guanghui Xu, Zhuliang Yu, Yanwu Xu, Kui Jia, Mingkui Tan You Only Look & Listen Once: Towards Fast and Accurate Visual Grounding, Arxiv, 2018
03/04/2019	12PM	GDC 3.816	Rowan Zellers, Yonatan Bisk, Ali Farhadi, Yejin Choi From Recognition to Cognition: Visual Commonsense Reasoning, Arxiv, 2018
02/11/2019	12PM	GDC 3.816	Zhilin Yang, Saizheng Zhang, Jack Urbanek, Will Feng, Alexander H. Miller, Arthur Szlam, Douwe Kiela, Jason Weston Mastering the Dungeon: Grounded Language Learning by Mechanical Turker Descent, ICLR, 2019
05/07/2019	3:30PM	GDC 3.816	Fuwen Tan, Song Feng, Vicente Ordonez Text2Scene: Generating Compositional Scenes from Textual Descriptions, CVPR, 2019

Fall 2018

Date Time Place Paper

10/23/2018 11:00AM GDC 3.816 Dipendra Misra, Andrew Bennett, Valts Blukis, Eyvind Niklasson, Max Shatkhin and Yoav Artzi
Mapping Instructions to Actions in 3D Environments with Visual Goal Prediction, EMNLP, 2018

11/06/2018 11:00AM GDC 3.816 Dongchen Yu, Xing Gao and Hongkai Xiong
Structured Semantic Representation for Visual Question Answering, ICIP, 2018

10/23/2018 11:00AM GDC 3.816 Diane Bouchacourt and Marco Baroni
How Agents See Things: On Visual Representations in an Emergent Language Game, EMNLP, 2018

10/09/2018 11:00AM GDC 3.816 Andrea F. Daniele, Mohit Bansal, Matthew R. Walter
Navigational Instruction Generation as Inverse Reinforcement Learning with Neural Machine Translation, HRI, 2017

09/25/2018 11:00AM GDC 3.816 Peter Anderson, Xiaodong He, Chris Buehler, Damien Teney, Mark Johnson, Stephen Gould, Lei Zhang
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering, CVPR, 2018

09/11/2018 11:00AM GDC 3.816 Peter Anderson, Qi Wu, Damien Teney, Jake Bruce, Mark Johnson, Niko S�nderhauf, Ian Reid, Stephen Gould, Anton van den Hengel
Vision-and-Language Navigation: Interpreting visually-grounded navigation instructions in real environments, CVPR, 2018

Date	Time	Place	Paper
10/23/2018	11:00AM	GDC 3.816	Dipendra Misra, Andrew Bennett, Valts Blukis, Eyvind Niklasson, Max Shatkhin and Yoav Artzi Mapping Instructions to Actions in 3D Environments with Visual Goal Prediction, EMNLP, 2018
11/06/2018	11:00AM	GDC 3.816	Dongchen Yu, Xing Gao and Hongkai Xiong Structured Semantic Representation for Visual Question Answering, ICIP, 2018
10/23/2018	11:00AM	GDC 3.816	Diane Bouchacourt and Marco Baroni How Agents See Things: On Visual Representations in an Emergent Language Game, EMNLP, 2018
10/09/2018	11:00AM	GDC 3.816	Andrea F. Daniele, Mohit Bansal, Matthew R. Walter Navigational Instruction Generation as Inverse Reinforcement Learning with Neural Machine Translation, HRI, 2017
09/25/2018	11:00AM	GDC 3.816	Peter Anderson, Xiaodong He, Chris Buehler, Damien Teney, Mark Johnson, Stephen Gould, Lei Zhang Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering, CVPR, 2018
09/11/2018	11:00AM	GDC 3.816	Peter Anderson, Qi Wu, Damien Teney, Jake Bruce, Mark Johnson, Niko S�nderhauf, Ian Reid, Stephen Gould, Anton van den Hengel Vision-and-Language Navigation: Interpreting visually-grounded navigation instructions in real environments, CVPR, 2018

Spring 2018

Date Time Place Paper

4/30/2018 1:00PM GDC 3.816 Francesco Barbieri, Miguel Ballesteros, Francesco Ronzano, and Horacio Saggion
Multimodal Emoji Prediction, NAACL, 2018

4/16/2018 1:00PM GDC 3.816 Jacob Andreas, Dan Klein, and Sergey Levine
Learning with Latent Language, NAACL, 2018

4/2/2018 1:00PM GDC 3.816 Ronghang Hu, Marcus Rohrbach, Jacob Andreas, Trevor Darrell, and Kate Saenko
Modeling Relationships in Referential Expressions with Compositional Modular Networks, CVPR, 2017

3/19/2018 1:00PM GDC 3.816 Fanyi Xiao, Leonid Sigal, and Yong Jae Lee
Weakly-supervised Visual Grounding of Phrases with Linguistic Structures, CVPR, 2017

2/26/2018 1:00PM GDC 3.816 Jae Sung Park, Biao Jia, Mohit Bansal, and Dinesh Manocha
Generating Realtime Motion Plans from Attribute-Based Natural Language Instructions Using Dynamic Constraint Mapping, arXiv preprint, 2017

2/12/2018 1:00PM GDC 3.816 Lisa Anne Hendricks, Ronghang Hu, Trevor Darrell, and Zeynep Akata
Grounding Visual Explanations (Extended Abstract), arXiv preprint, 2017 +
Rakshith Shetty, Marcus Rohrbach, Lisa Anne Hendricks, Mario Fritz, and Bernt Schiele
Speaking the Same Language: Matching Machine to Human Captions by Adversarial Training, ICCV, 2017

1/29/2018 1:00PM GDC 3.816 Abhishek Das, Samyak Datta, Georgia Gkioxari, Stefan Lee, Devi Parikh, Dhruv Batra
Embodied Question Answering, arXiv preprint, 2017

Date	Time	Place	Paper
4/30/2018	1:00PM	GDC 3.816	Francesco Barbieri, Miguel Ballesteros, Francesco Ronzano, and Horacio Saggion Multimodal Emoji Prediction, NAACL, 2018
4/16/2018	1:00PM	GDC 3.816	Jacob Andreas, Dan Klein, and Sergey Levine Learning with Latent Language, NAACL, 2018
4/2/2018	1:00PM	GDC 3.816	Ronghang Hu, Marcus Rohrbach, Jacob Andreas, Trevor Darrell, and Kate Saenko Modeling Relationships in Referential Expressions with Compositional Modular Networks, CVPR, 2017
3/19/2018	1:00PM	GDC 3.816	Fanyi Xiao, Leonid Sigal, and Yong Jae Lee Weakly-supervised Visual Grounding of Phrases with Linguistic Structures, CVPR, 2017
2/26/2018	1:00PM	GDC 3.816	Jae Sung Park, Biao Jia, Mohit Bansal, and Dinesh Manocha Generating Realtime Motion Plans from Attribute-Based Natural Language Instructions Using Dynamic Constraint Mapping, arXiv preprint, 2017
2/12/2018	1:00PM	GDC 3.816	Lisa Anne Hendricks, Ronghang Hu, Trevor Darrell, and Zeynep Akata Grounding Visual Explanations (Extended Abstract), arXiv preprint, 2017 + Rakshith Shetty, Marcus Rohrbach, Lisa Anne Hendricks, Mario Fritz, and Bernt Schiele Speaking the Same Language: Matching Machine to Human Captions by Adversarial Training, ICCV, 2017
1/29/2018	1:00PM	GDC 3.816	Abhishek Das, Samyak Datta, Georgia Gkioxari, Stefan Lee, Devi Parikh, Dhruv Batra Embodied Question Answering, arXiv preprint, 2017

Fall 2017

Date Time Place Paper

12/05/2017 11:00AM GDC 3.816 Bolei Zhou, David Bau, Aude Oliva, and Antonio Torralba
Interpreting Deep Visual Representations via Network Dissection, CVPR, 2017

11/21/2017 11:00AM GDC 3.816 S. Palazzo, C. Spampinato, I.Kavasidis, D. Giordano
Generative Adversarial Networks Conditioned by Brain Signals, ICCV, 2017

11/07/2017 11:00AM GDC 3.816 Ishan Misra, Abhinav Gupta, and Martial Hebert
From Red Wine to Red Tomato: Composition with Context, CVPR, 2017

10/24/2017 11:00AM GDC 3.816 Scott L. Fairhall and Alfonso Caramazza
Brain Regions That Represent Amodal Conceptual Knowledge, The Journal of Neuroscience, 2013

09/26/2017 11:00AM GDC 3.816 Dong Huk Park, Lisa Anne Hendricks, Zeynep Akata, Bernt Schiele, Trevor Darrell, and Marcus Rohrbach
Attentive Explanations: Justifying Decisions and Pointing to the Evidence, arXiv

09/12/2017 11:00AM GDC 3.816 Maxwell Forbes and Yejin Choi
VERB PHYSICS: Relative Physical Knowledge of Actions and Objects, ACL 2017

Date	Time	Place	Paper
12/05/2017	11:00AM	GDC 3.816	Bolei Zhou, David Bau, Aude Oliva, and Antonio Torralba Interpreting Deep Visual Representations via Network Dissection, CVPR, 2017
11/21/2017	11:00AM	GDC 3.816	S. Palazzo, C. Spampinato, I.Kavasidis, D. Giordano Generative Adversarial Networks Conditioned by Brain Signals, ICCV, 2017
11/07/2017	11:00AM	GDC 3.816	Ishan Misra, Abhinav Gupta, and Martial Hebert From Red Wine to Red Tomato: Composition with Context, CVPR, 2017
10/24/2017	11:00AM	GDC 3.816	Scott L. Fairhall and Alfonso Caramazza Brain Regions That Represent Amodal Conceptual Knowledge, The Journal of Neuroscience, 2013
09/26/2017	11:00AM	GDC 3.816	Dong Huk Park, Lisa Anne Hendricks, Zeynep Akata, Bernt Schiele, Trevor Darrell, and Marcus Rohrbach Attentive Explanations: Justifying Decisions and Pointing to the Evidence, arXiv
09/12/2017	11:00AM	GDC 3.816	Maxwell Forbes and Yejin Choi VERB PHYSICS: Relative Physical Knowledge of Actions and Objects, ACL 2017

Spring 2017

Date Time Place Paper

05/01/2017 3:30PM GDC 3.816 Yonatan Bisk, Deniz Yuret, Daniel Marcu
Natural Language Communication with Robots, NAACL 2016

04/17/2017 3:30PM GDC 3.816 Abhishek Das, Satwik Kottur, José M.F. Moura, Stefan Lee, Dhruv Batra
Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning, arXiv 2017

04/3/2017 3:30PM GDC 3.816 Muhannad Alomari, Paul Duckworth, Davic C. Hogg, Anthony G. Cohn
Natural Language Acquisition and Grounding for Embodied Robotic Systems, AAAI 2017

03/13/2017 3:30PM GDC 3.816 Carina Silberer, Vittorio Ferrari, Mirella Lapata
Visually Grounded Meaning Representations, IEEE Transactions on Pattern Analysis and Machine Intelligence, 2016

02/27/2017 3:30PM GDC 3.816 Harm de Vries, Florian Strub, Sarath Chandar, Olivier Pietquin, Hugo Larochelle, Aaron Courville
GuessWhat?! Visual object discovery through multi-modal dialogue, arXiv, November 2016

02/06/2017 3:30PM GDC 3.816 Aishwarya Padmakumar and Jesse Thomason and Raymond J. Mooney
Integrated Learning of Dialog Strategies and Semantic Parsing, EACL 2017

01/23/2017 3:30PM GDC 3.816 Emanuele Bastianelli, Danilo Croce, Andrea Vanzo, Roberto Basili, and Daniele Nardi
A Discriminative Approach to Grounded Spoken Language Understanding in Interactive Robotics, IJCAI 2016

Date	Time	Place	Paper
05/01/2017	3:30PM	GDC 3.816	Yonatan Bisk, Deniz Yuret, Daniel Marcu Natural Language Communication with Robots, NAACL 2016
04/17/2017	3:30PM	GDC 3.816	Abhishek Das, Satwik Kottur, José M.F. Moura, Stefan Lee, Dhruv Batra Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning, arXiv 2017
04/3/2017	3:30PM	GDC 3.816	Muhannad Alomari, Paul Duckworth, Davic C. Hogg, Anthony G. Cohn Natural Language Acquisition and Grounding for Embodied Robotic Systems, AAAI 2017
03/13/2017	3:30PM	GDC 3.816	Carina Silberer, Vittorio Ferrari, Mirella Lapata Visually Grounded Meaning Representations, IEEE Transactions on Pattern Analysis and Machine Intelligence, 2016
02/27/2017	3:30PM	GDC 3.816	Harm de Vries, Florian Strub, Sarath Chandar, Olivier Pietquin, Hugo Larochelle, Aaron Courville GuessWhat?! Visual object discovery through multi-modal dialogue, arXiv, November 2016
02/06/2017	3:30PM	GDC 3.816	Aishwarya Padmakumar and Jesse Thomason and Raymond J. Mooney Integrated Learning of Dialog Strategies and Semantic Parsing, EACL 2017
01/23/2017	3:30PM	GDC 3.816	Emanuele Bastianelli, Danilo Croce, Andrea Vanzo, Roberto Basili, and Daniele Nardi A Discriminative Approach to Grounded Spoken Language Understanding in Interactive Robotics, IJCAI 2016

Fall 2016

Date Time Place Paper

12/05/2016 3:30PM GDC 3.816 Jaeyong Sung, Seok Hyun Jin, and Ashutosh Saxena
Robobarista: Object Part based Transfer of Manipulation Trajectories from Crowd-sourcing in 3D Pointclouds, ISRR 2015

11/21/2016 3:30PM GDC 3.816 Sergio Guadarrama, Erik Rodner, Kate Saenko, Trevor Darrell
Understanding object descriptions in robotics by open-vocabulary object retrieval and detection, IJRR 2016

11/07/2016 3:30PM GDC 3.816 David Whitney, Miles Eldon, John Oberlin, Stefanie Tellex
Interpreting Multimodal Referring Expressions in Real Time, IEEE 2016

10/24/2016 3:30PM GDC 3.816 Scott Reed, Zeynep Akata, Xinchen Yan, Lajanugen Logeswaran, Bernt Schiele, Honglak Lee
Generative Adversarial Text to Image Synthesis, ICML 2016

10/10/2016 3:30PM GDC 3.816 Antoine Bosselut, Jianfu Chen, David Warren, Hannaneh Hajishirzi, and Yejin Choi
Learning Prototypical Event Structure from Photo Albums, ACL 2016

9/26/2016 3:30PM GDC 3.816 Lisa Anne Hendricks, Zeynep Akata, Marcus Rohrbach, Jeff Donahue, Bernt Schiele, Trevor Darrell
Generating Visual Explanations, arXiv 2016

9/12/2016 3:30PM GDC 3.816 Gordon Christie, Ankit Laddha, Aishwarya Agrawal, Stanislaw Antol, Yash Goyal, Kevin Kochersberger, Dhruv Batra
Resolving Vision and Language Ambiguities Together: Joint Segmentation & Prepositional Attachment Resolution in Captioned Scenes, EMNLP 2016

8/29/2016 3:30PM GDC 3.816 Angel Chang, Will Monroe, Manolis Savva, Christopher Potts, Christopher D. Manning
Text to 3D Scene Generation with Rich Lexical Grounding, ACL 2015

Date	Time	Place	Paper
12/05/2016	3:30PM	GDC 3.816	Jaeyong Sung, Seok Hyun Jin, and Ashutosh Saxena Robobarista: Object Part based Transfer of Manipulation Trajectories from Crowd-sourcing in 3D Pointclouds, ISRR 2015
11/21/2016	3:30PM	GDC 3.816	Sergio Guadarrama, Erik Rodner, Kate Saenko, Trevor Darrell Understanding object descriptions in robotics by open-vocabulary object retrieval and detection, IJRR 2016
11/07/2016	3:30PM	GDC 3.816	David Whitney, Miles Eldon, John Oberlin, Stefanie Tellex Interpreting Multimodal Referring Expressions in Real Time, IEEE 2016
10/24/2016	3:30PM	GDC 3.816	Scott Reed, Zeynep Akata, Xinchen Yan, Lajanugen Logeswaran, Bernt Schiele, Honglak Lee Generative Adversarial Text to Image Synthesis, ICML 2016
10/10/2016	3:30PM	GDC 3.816	Antoine Bosselut, Jianfu Chen, David Warren, Hannaneh Hajishirzi, and Yejin Choi Learning Prototypical Event Structure from Photo Albums, ACL 2016
9/26/2016	3:30PM	GDC 3.816	Lisa Anne Hendricks, Zeynep Akata, Marcus Rohrbach, Jeff Donahue, Bernt Schiele, Trevor Darrell Generating Visual Explanations, arXiv 2016
9/12/2016	3:30PM	GDC 3.816	Gordon Christie, Ankit Laddha, Aishwarya Agrawal, Stanislaw Antol, Yash Goyal, Kevin Kochersberger, Dhruv Batra Resolving Vision and Language Ambiguities Together: Joint Segmentation & Prepositional Attachment Resolution in Captioned Scenes, EMNLP 2016
8/29/2016	3:30PM	GDC 3.816	Angel Chang, Will Monroe, Manolis Savva, Christopher Potts, Christopher D. Manning Text to 3D Scene Generation with Rich Lexical Grounding, ACL 2015

Spring 2016

Date Time Place Paper

5/3/2016 10AM GDC 3.816 Jacob Andreas, Marcus Rohrbach, Trevor Darrell, Dan Klein
Learning to Compose Neural Networks for Question Answering, NAACL 2016

4/19/2016 10AM GDC 3.816 Giampiero Salvi, Luis Montesano, Alexandre Bernardino, Jose Santos-Victor
Language bootstrapping: Learning word meanings from perception-action association, IEEE 2011

4/5/2016 10AM GDC 3.816 Stanislaw Antol, Aishwarya Agrawal, Jiasen Lu, Margaret Mitchell, Dhruv Batra, C. Lawrence Zitnick, Devi Parikh
VQA: Visual Question Answering, ICCV 2015

3/22/2016 10AM GDC 3.816 Ivan Vendrov, Ryan Kiros, Sanja Fidler, Raquel Urtasun
Order-embeddings of Images and Language, ICLR 2016

3/8/2016 10AM GDC 3.816 Hongyuan Mei, Mohit Bansal, and Matthew R. Walter
Listen, Attend, and Walk: Neural Mapping of Navigational Instructions to Action Sequences, AAAI 2016

2/9/2016 10AM GDC 3.816 Tom Williams, Gordon Briggs, Bradley Oosterveld, and Matthias Scheutz
Going Beyond Literal Command-Based Instructions: Extending Robotic Natural Language Interaction Capabilities, AAAI 2015

1/26/2016 10AM GDC 3.816 Angeliki Lazaridou, Nghia The Pham, and Marco Baroni
Combining Language and Vision with a Multimodal Skip-gram Model, NAACL 2015

Date	Time	Place	Paper
5/3/2016	10AM	GDC 3.816	Jacob Andreas, Marcus Rohrbach, Trevor Darrell, Dan Klein Learning to Compose Neural Networks for Question Answering, NAACL 2016
4/19/2016	10AM	GDC 3.816	Giampiero Salvi, Luis Montesano, Alexandre Bernardino, Jose Santos-Victor Language bootstrapping: Learning word meanings from perception-action association, IEEE 2011
4/5/2016	10AM	GDC 3.816	Stanislaw Antol, Aishwarya Agrawal, Jiasen Lu, Margaret Mitchell, Dhruv Batra, C. Lawrence Zitnick, Devi Parikh VQA: Visual Question Answering, ICCV 2015
3/22/2016	10AM	GDC 3.816	Ivan Vendrov, Ryan Kiros, Sanja Fidler, Raquel Urtasun Order-embeddings of Images and Language, ICLR 2016
3/8/2016	10AM	GDC 3.816	Hongyuan Mei, Mohit Bansal, and Matthew R. Walter Listen, Attend, and Walk: Neural Mapping of Navigational Instructions to Action Sequences, AAAI 2016
2/9/2016	10AM	GDC 3.816	Tom Williams, Gordon Briggs, Bradley Oosterveld, and Matthias Scheutz Going Beyond Literal Command-Based Instructions: Extending Robotic Natural Language Interaction Capabilities, AAAI 2015
1/26/2016	10AM	GDC 3.816	Angeliki Lazaridou, Nghia The Pham, and Marco Baroni Combining Language and Vision with a Multimodal Skip-gram Model, NAACL 2015

Fall 2015

Date Time Place Paper

12/04/2015 10AM GDC 3.816 Licheng Yu, Eunbyung Park, Alexander C. Berg, and Tamara L. Berg
Visual Madlibs: Fill in the blank Image Generation and Question Answering, ICCV 2015

11/13/2015 11AM GDC 3.816 Michael Spranger and Luc Steels
Co-Acquisition of Syntax and Semantics - An Investigation in Spatial Language, IJCAI 2015

10/30/2015 10AM GDC 3.816 Angeliki Lazaridou, Dat Tien Nguyen, and Marco Baroni
Do Distributed Semantic Models Dream of Electric Sheep? Visualizing Word Representations through Image Synthesis, EMNLP 2015

10/16/2015 11AM GDC 3.816 Douwe Kiela and Stephen Clark
Multi- and Cross-Modal Semantics Beyond Vision: Grounding in Auditory Perception, EMNLP 2015

10/02/2015 11AM GDC 3.816 Yukun Zhu, Ryan Kiros, Richard Zemel, Ruslan Salakhutdinov, Raquel Urtasun, Antonio Torralba, Sanja Fidler
Aligning Books and Movies: Towards Story-like Visual Explanations by Watching Movies and Reading Books, ICCV 2015

supplementary: Ryan Kiros, Yukun Zhu, Ruslan Salakhutdinov, Richard S. Zemel, Antonio Torralba, Raquel Urtasun, Sanja Fidler
Skip-Thought Vectors, NIPS 2015

09/18/2015 11AM GDC 3.816 Dongcai Lu and Xiaoping Chen
Towards an Architecture Combining Grounding and Planning for Human-Robot Interaction, RoboCup Symposium, 2015

09/09/2015 4PM GDC 3.816 Daniel Paul Barrett, Scott Alan Bronikowski, Haonan Yu, Jeffrey Mark Siskind
Robot Language Learning, Generation, and Comprehension, arXiv, 2015

Date	Time	Place	Paper
12/04/2015	10AM	GDC 3.816	Licheng Yu, Eunbyung Park, Alexander C. Berg, and Tamara L. Berg Visual Madlibs: Fill in the blank Image Generation and Question Answering, ICCV 2015
11/13/2015	11AM	GDC 3.816	Michael Spranger and Luc Steels Co-Acquisition of Syntax and Semantics - An Investigation in Spatial Language, IJCAI 2015
10/30/2015	10AM	GDC 3.816	Angeliki Lazaridou, Dat Tien Nguyen, and Marco Baroni Do Distributed Semantic Models Dream of Electric Sheep? Visualizing Word Representations through Image Synthesis, EMNLP 2015
10/16/2015	11AM	GDC 3.816	Douwe Kiela and Stephen Clark Multi- and Cross-Modal Semantics Beyond Vision: Grounding in Auditory Perception, EMNLP 2015
10/02/2015	11AM	GDC 3.816	Yukun Zhu, Ryan Kiros, Richard Zemel, Ruslan Salakhutdinov, Raquel Urtasun, Antonio Torralba, Sanja Fidler Aligning Books and Movies: Towards Story-like Visual Explanations by Watching Movies and Reading Books, ICCV 2015 supplementary: Ryan Kiros, Yukun Zhu, Ruslan Salakhutdinov, Richard S. Zemel, Antonio Torralba, Raquel Urtasun, Sanja Fidler Skip-Thought Vectors, NIPS 2015
09/18/2015	11AM	GDC 3.816	Dongcai Lu and Xiaoping Chen Towards an Architecture Combining Grounding and Planning for Human-Robot Interaction, RoboCup Symposium, 2015
09/09/2015	4PM	GDC 3.816	Daniel Paul Barrett, Scott Alan Bronikowski, Haonan Yu, Jeffrey Mark Siskind Robot Language Learning, Generation, and Comprehension, arXiv, 2015

Summer 2015

Date Time Place Paper

08/11/2015 1:00pm GDC 3.816 Natalie Parde, Adam Hair, Michalis Papakostas, Konstantinos Tsiakas, Maria Dagioglou, Vangelis Karkaletsis, Rodney D. Nielsen
Grounding the Meaning of Words through Vision and Interactive Gameplay, IJCAI, 2015

07/21/2015 1:00pm GDC 3.816 Junhua Mao, Wei Xu, Yi Yang, Jiang Wang, Zhiheng Huang, Alen L. Yuille
Learning like a Child: Fast Novel Visual Concept Learning from Sentence Descriptions of Images, arXiv, 2015

07/14/2015 1:00pm GDC 3.816 Yürüten, Onur, Erol Şahin, and Sinan Kalkan
The learning of adjectives and nouns from affordance and appearance features, Adaptive Behavior 21.6 (2013): 437-451

06/30/2015 1:00pm GDC 3.816 Brian McMahan and Matthew Stone
A Bayesian Model of Grounded Color Semantics, NAACL, 2015

05/04/2015 12:00pm GDC 3.816 Mateusz Malinowski and Mario Fritz
A Multi-World Approach to Question Answering about Real-World Scenes based on Uncertain Input, NIPS, 2014

Date	Time	Place	Paper
08/11/2015	1:00pm	GDC 3.816	Natalie Parde, Adam Hair, Michalis Papakostas, Konstantinos Tsiakas, Maria Dagioglou, Vangelis Karkaletsis, Rodney D. Nielsen Grounding the Meaning of Words through Vision and Interactive Gameplay, IJCAI, 2015
07/21/2015	1:00pm	GDC 3.816	Junhua Mao, Wei Xu, Yi Yang, Jiang Wang, Zhiheng Huang, Alen L. Yuille Learning like a Child: Fast Novel Visual Concept Learning from Sentence Descriptions of Images, arXiv, 2015
07/14/2015	1:00pm	GDC 3.816	Yürüten, Onur, Erol Şahin, and Sinan Kalkan The learning of adjectives and nouns from affordance and appearance features, Adaptive Behavior 21.6 (2013): 437-451
06/30/2015	1:00pm	GDC 3.816	Brian McMahan and Matthew Stone A Bayesian Model of Grounded Color Semantics, NAACL, 2015
05/04/2015	12:00pm	GDC 3.816	Mateusz Malinowski and Mario Fritz A Multi-World Approach to Question Answering about Real-World Scenes based on Uncertain Input, NIPS, 2014

Spring 2015

Date Time Place Paper

04/27/2015 11:00am GDC 3.816 Mohan Sridharan, Jeremy Wyatt, and Richard Dearden
HiPPo: Hierarchical POMDPs for Planning Information Processing and Sensing Actions on a Robot, ICAPS, 2008<$

04/20/2015 12:00pm GDC 3.816 Li Yao, Atousa Torabi, Kyunghyun Cho, Nicolas Ballas, Christopher Pal, Hugo Larochelle, and Aaron Courville
Video Description Generation Incorporating Spatio-Temporal Features and a Soft-Attention Mechanism, arXiv, 2015

04/13/2015 12:00pm GDC 3.816 Changsong Liu and Joyce Y. Chai
Learning to Mediate Perceptual Differences in Situated Human-Robot Dialogue, AAAI, 2015

04/06/2015 12:00pm GDC 3.816 Jonathan Malmaud, Jonathan Huang, Vivek Rathod, Nick Johnston, Andrew Rabinovich, and Kevin Murphy
What's Cookin'? Interpreting Cooking Videos using Text, Speech and Vision, arXiv, 2015

03/30/2015 11:00am GDC 3.816 Cynthia Matuszek, Liefeng Bo, Luke Zettlemoyer, Dieter Fox
Learning from Unscripted Deictic Gesture and Language for Human-Robot Interactions, AAAI, 2014

03/09/2015 11:00am GDC 3.816 Hao Fang, Saurabh Gupta, Forrest Iandola, Rupesh Srivastava, Li Deng, Piotr Dollar, Jianfeng Gao, Xiaodong He, Margaret Mitchell, John C., Platt C., Lawrence Zitnick, Geoffrey Zweig
From Captions to Visual Concepts and Back, arXiv, 2014

03/02/2015 12:00pm GDC 3.816 Thomas Kollar, Jayant Krishnamurthy, Grant Strimel
Toward Interactive Grounded Language Acquisition, Robotics: Science and Systems (RSS), 2013

02/23/2015 11:00am GDC 3.816 Subhashini Venugopalan, Huijuan Xu, Jeff Donahue, Marcus Rohrbach, Raymond Mooney, Kate Saenko
Translating Videos to Natural Language Using Deep Recurrent Neural Networks, arXiv, 2015

02/16/2015 12:00pm GDC 3.816 Jivko Sinapov, Connor Schenck, and Alexander Stoytchev
Learning relational object categories using behavioral exploration and multimodal perception, Robotics and Automation (ICRA), 2014

02/09/2015 11:00am GDC 3.816 Changsong Liu, Lanbo She, Rui Fang, Joyce Y. Chai
Probabilistic Labeling for Efficient Referential Grounding based on Collaborative Discourse, ACL 2014

Date	Time	Place	Paper
04/27/2015	11:00am	GDC 3.816	Mohan Sridharan, Jeremy Wyatt, and Richard Dearden HiPPo: Hierarchical POMDPs for Planning Information Processing and Sensing Actions on a Robot, ICAPS, 2008<$
04/20/2015	12:00pm	GDC 3.816	Li Yao, Atousa Torabi, Kyunghyun Cho, Nicolas Ballas, Christopher Pal, Hugo Larochelle, and Aaron Courville Video Description Generation Incorporating Spatio-Temporal Features and a Soft-Attention Mechanism, arXiv, 2015
04/13/2015	12:00pm	GDC 3.816	Changsong Liu and Joyce Y. Chai Learning to Mediate Perceptual Differences in Situated Human-Robot Dialogue, AAAI, 2015
04/06/2015	12:00pm	GDC 3.816	Jonathan Malmaud, Jonathan Huang, Vivek Rathod, Nick Johnston, Andrew Rabinovich, and Kevin Murphy What's Cookin'? Interpreting Cooking Videos using Text, Speech and Vision, arXiv, 2015
03/30/2015	11:00am	GDC 3.816	Cynthia Matuszek, Liefeng Bo, Luke Zettlemoyer, Dieter Fox Learning from Unscripted Deictic Gesture and Language for Human-Robot Interactions, AAAI, 2014
03/09/2015	11:00am	GDC 3.816	Hao Fang, Saurabh Gupta, Forrest Iandola, Rupesh Srivastava, Li Deng, Piotr Dollar, Jianfeng Gao, Xiaodong He, Margaret Mitchell, John C., Platt C., Lawrence Zitnick, Geoffrey Zweig From Captions to Visual Concepts and Back, arXiv, 2014
03/02/2015	12:00pm	GDC 3.816	Thomas Kollar, Jayant Krishnamurthy, Grant Strimel Toward Interactive Grounded Language Acquisition, Robotics: Science and Systems (RSS), 2013
02/23/2015	11:00am	GDC 3.816	Subhashini Venugopalan, Huijuan Xu, Jeff Donahue, Marcus Rohrbach, Raymond Mooney, Kate Saenko Translating Videos to Natural Language Using Deep Recurrent Neural Networks, arXiv, 2015
02/16/2015	12:00pm	GDC 3.816	Jivko Sinapov, Connor Schenck, and Alexander Stoytchev Learning relational object categories using behavioral exploration and multimodal perception, Robotics and Automation (ICRA), 2014
02/09/2015	11:00am	GDC 3.816	Changsong Liu, Lanbo She, Rui Fang, Joyce Y. Chai Probabilistic Labeling for Efficient Referential Grounding based on Collaborative Discourse, ACL 2014

Spring 2014

Date Time Place Paper

04/29/2014 10:00am GDC 3.816 Minh-Thang Luong, Michael C. Frank, and Mark Johnson
Parsing entire discourses as very long strings: Capturing topic continuity in grounded language learning, EMNLP 2013

04/15/2014 9:30am GDC 3.816 Calvin MacKenzie
Integrating Visual and Linguistic Information to Describe Properties of Objects, Undergraduate Thesis Project

04/01/2014 10:00am GDC 3.816 Pradipto Das, Chenliang Xu, Richard F. Doell, and Jason J. Corso
A Thousand Frames in Just a Few Words: Lingual Description of Videos through Latent Topics and Sparse Object Stitching, CVPR 2013

03/18/2014 10:00am GDC 3.816 Jayant Krishnamurthy, Thomas Kollar
Jointly Learning to Parse and Perceive: Connecting Natural Language to the Physical World, TACL 2013

02/18/2014 10:00am GDC 3.816 C. Lawrence Zitnick, Devi Parikh
Bringing Semantics Into Focus Using Visual Abstraction, CVPR 2013

02/04/2014 10:00am GDC 3.816 C. Lawrence Zitnick, Devi Parikh, Lucy Vanderwende
Learning the Visual Interpretation of Sentences, ICCV 2013

01/21/2014 10:00am GDC 3.816 Dieu-Thu Le, Jasper Uijlings, and Raffaella Bernardi
Exploiting language models for visual recognition, EMNLP 2013

01/10/2014 12:00pm GDC 3.816 Vicente Ordonez, Jia Deng, Yejin Choi, Alexander C. Berg, Tamara L. Berg
From Large Scale Image Categorization to Entry-Level Categories, ICCV 2013

Date	Time	Place	Paper
04/29/2014	10:00am	GDC 3.816	Minh-Thang Luong, Michael C. Frank, and Mark Johnson Parsing entire discourses as very long strings: Capturing topic continuity in grounded language learning, EMNLP 2013
04/15/2014	9:30am	GDC 3.816	Calvin MacKenzie Integrating Visual and Linguistic Information to Describe Properties of Objects, Undergraduate Thesis Project
04/01/2014	10:00am	GDC 3.816	Pradipto Das, Chenliang Xu, Richard F. Doell, and Jason J. Corso A Thousand Frames in Just a Few Words: Lingual Description of Videos through Latent Topics and Sparse Object Stitching, CVPR 2013
03/18/2014	10:00am	GDC 3.816	Jayant Krishnamurthy, Thomas Kollar Jointly Learning to Parse and Perceive: Connecting Natural Language to the Physical World, TACL 2013
02/18/2014	10:00am	GDC 3.816	C. Lawrence Zitnick, Devi Parikh Bringing Semantics Into Focus Using Visual Abstraction, CVPR 2013
02/04/2014	10:00am	GDC 3.816	C. Lawrence Zitnick, Devi Parikh, Lucy Vanderwende Learning the Visual Interpretation of Sentences, ICCV 2013
01/21/2014	10:00am	GDC 3.816	Dieu-Thu Le, Jasper Uijlings, and Raffaella Bernardi Exploiting language models for visual recognition, EMNLP 2013
01/10/2014	12:00pm	GDC 3.816	Vicente Ordonez, Jia Deng, Yejin Choi, Alexander C. Berg, Tamara L. Berg From Large Scale Image Categorization to Entry-Level Categories, ICCV 2013

Fall 2013

Date Time Place Paper

12/06/2013 12:00pm GDC 3.816 Elia Bruni, Gemma Boleda, Marco Baroni, Nam-Khan Tran
Distributional Semantics in Technicolor, ACL 2012

11/22/2013 12:00pm GDC 3.516 Carina Silberer, Vittorio Ferrari, Mirella Lapata
Models of Semantic Representation with Visual Attributes, ACL 2013

11/8/2013 12:00pm GDC 3.516 Andrew J. Anderson, Elia Bruni, Ulisse Bordignon, Massimo Poesio, and Marco Baroni
Of words, eyes and brains: Correlating image-based distributional semantic models with neural representations of concepts, EMNLP 2013

10/25/2013 12:00pm GDC 3.516 Haonan Yu and Jeffrey Mark Siskind
Grounded Language Learning from Video Described with Sentences, ACL 2013 (best paper award)

Date	Time	Place	Paper
12/06/2013	12:00pm	GDC 3.816	Elia Bruni, Gemma Boleda, Marco Baroni, Nam-Khan Tran Distributional Semantics in Technicolor, ACL 2012
11/22/2013	12:00pm	GDC 3.516	Carina Silberer, Vittorio Ferrari, Mirella Lapata Models of Semantic Representation with Visual Attributes, ACL 2013
11/8/2013	12:00pm	GDC 3.516	Andrew J. Anderson, Elia Bruni, Ulisse Bordignon, Massimo Poesio, and Marco Baroni Of words, eyes and brains: Correlating image-based distributional semantic models with neural representations of concepts, EMNLP 2013
10/25/2013	12:00pm	GDC 3.516	Haonan Yu and Jeffrey Mark Siskind Grounded Language Learning from Video Described with Sentences, ACL 2013 (best paper award)

Spring 2013

Date Time Place Paper

4/23/2013 1PM GDC 3.416 Michaela Regneri, Marcus Rohrbach, Dominikus Wetzel, Stefan Thater, Bernt Schiele, Manfred Pinkal
Grounding Action Descriptions in Videos, TACL, 2013

4/9/2013 1PM GDC 3.416 Yoav Artzi and Luke Zettlemoyer
Weakly Supervised Learning of Semantic Parsers for Mapping Instructions to Actions, TACL, 2013

3/26/2013 1PM GDC 3.416 C. Liu, R. Fang, and J. Y. Chai
Towards Mediating Shared Perceptual Basis in Situated Dialogue, SIGDIAL Meeting on Discourse and Dialogue, 2012

3/5/2013 1PM GDC 3.416 Richard Socher, Milind Ganjoo, Hamsa Sridhar, Osbert Bastani, Christopher D. Manning, Andrew Y. Ng
Zero-Shot Learning Through Cross-Modal Transfer, arXiv, 2013.

2/19/2013 1PM GDC 3.516 Y. Jia, M. Salzmann, and T. Darrell
Learning Cross-modality Similarity for Multinomial Data , ICCV, 2011.

2/5/2013 1PM GDC 3.416 Barbu, A. and Bridge, A. and Burchill, Z. and Coroian, D. and Dickinson, S. and Fidler, S. and Michaux, A. and Mussman, S. and Narayanaswamy, S. and Salvi, D. and others.
Video In Sentences Out. , UAI, 2012.

1/22/2013 1PM ENS 32NEA Hannaneh Hajishirzi, Julia Hockenmaier, Erik T. Mueller, and Eyal Amir.
Reasoning about RoboCup Soccer Commentaries., UAI, 2011.

Date	Time	Place	Paper
4/23/2013	1PM	GDC 3.416	Michaela Regneri, Marcus Rohrbach, Dominikus Wetzel, Stefan Thater, Bernt Schiele, Manfred Pinkal Grounding Action Descriptions in Videos, TACL, 2013
4/9/2013	1PM	GDC 3.416	Yoav Artzi and Luke Zettlemoyer Weakly Supervised Learning of Semantic Parsers for Mapping Instructions to Actions, TACL, 2013
3/26/2013	1PM	GDC 3.416	C. Liu, R. Fang, and J. Y. Chai Towards Mediating Shared Perceptual Basis in Situated Dialogue, SIGDIAL Meeting on Discourse and Dialogue, 2012
3/5/2013	1PM	GDC 3.416	Richard Socher, Milind Ganjoo, Hamsa Sridhar, Osbert Bastani, Christopher D. Manning, Andrew Y. Ng Zero-Shot Learning Through Cross-Modal Transfer, arXiv, 2013.
2/19/2013	1PM	GDC 3.516	Y. Jia, M. Salzmann, and T. Darrell Learning Cross-modality Similarity for Multinomial Data , ICCV, 2011.
2/5/2013	1PM	GDC 3.416	Barbu, A. and Bridge, A. and Burchill, Z. and Coroian, D. and Dickinson, S. and Fidler, S. and Michaux, A. and Mussman, S. and Narayanaswamy, S. and Salvi, D. and others. Video In Sentences Out. , UAI, 2012.
1/22/2013	1PM	ENS 32NEA	Hannaneh Hajishirzi, Julia Hockenmaier, Erik T. Mueller, and Eyal Amir. Reasoning about RoboCup Soccer Commentaries., UAI, 2011.

Fall 2012

Date Time Place Paper

12/04/2012 1PM ENS 32NEA Cynthia Matuszek, Evan Herbst, Luke Zettlemoyer, Dieter Fox.
Learning to Parse Natural Language Commands to a Robot Control System., ISER, 2012.

11/20/2012 1PM ENS 32NEA Yezhou Yang, Ching Lik Teo, Hal Daume III, and Yiannis Aloimonos
Corpus-Guided Sentence Generation of Natural Images, EMNLP 2011

11/06/2012 1PM ENS 32NEA Frank, M. C., Goodman, N. D., and Tenenbaum, J. B.
Using Speakers' Referential Intentions to Model Early Cross-Situational Word Learning, Psychological Science 20, 578-585, 2009.

10/23/2012 1PM ENS 32NEA Mark Johnson, Katherine Demuth, Michael Frank and Bevan Jones
Synergies in Learning Words and Their Referents, In Proceedings of NIPS 2010.

10/09/2012 1PM ENS 32NEA Mark Johnson, Katherine Demuth and Michael Frank
Exploiting Social Information In Grounded Language Learning Via Grammatical Reduction, In Proceedings of ACL 2012.

09/25/2012 1PM ENS 32NEA Polina Kuznetsova, Vicente Ordonez, Alexander Berg, Tamara Berg and Yejin Choi
Collective Generation of Natural Image Descriptions, In Association for Computational Linguistics (ACL), 2012.

09/11/2012 1PM ENS 32NEA C. Matuszek and N. FitzGerald and L. Zettlemoyer and L. Bo and D. Fox
A Joint Model of Language and Perception for Grounded Attribute Learning, In International Conference on Machine Learning (ICML), July 2012.

Date	Time	Place	Paper
12/04/2012	1PM	ENS 32NEA	Cynthia Matuszek, Evan Herbst, Luke Zettlemoyer, Dieter Fox. Learning to Parse Natural Language Commands to a Robot Control System., ISER, 2012.
11/20/2012	1PM	ENS 32NEA	Yezhou Yang, Ching Lik Teo, Hal Daume III, and Yiannis Aloimonos Corpus-Guided Sentence Generation of Natural Images, EMNLP 2011
11/06/2012	1PM	ENS 32NEA	Frank, M. C., Goodman, N. D., and Tenenbaum, J. B. Using Speakers' Referential Intentions to Model Early Cross-Situational Word Learning, Psychological Science 20, 578-585, 2009.
10/23/2012	1PM	ENS 32NEA	Mark Johnson, Katherine Demuth, Michael Frank and Bevan Jones Synergies in Learning Words and Their Referents, In Proceedings of NIPS 2010.
10/09/2012	1PM	ENS 32NEA	Mark Johnson, Katherine Demuth and Michael Frank Exploiting Social Information In Grounded Language Learning Via Grammatical Reduction, In Proceedings of ACL 2012.
09/25/2012	1PM	ENS 32NEA	Polina Kuznetsova, Vicente Ordonez, Alexander Berg, Tamara Berg and Yejin Choi Collective Generation of Natural Image Descriptions, In Association for Computational Linguistics (ACL), 2012.
09/11/2012	1PM	ENS 32NEA	C. Matuszek and N. FitzGerald and L. Zettlemoyer and L. Bo and D. Fox A Joint Model of Language and Perception for Grounded Attribute Learning, In International Conference on Machine Learning (ICML), July 2012.

Spring 2012

Date Time Place Paper

04/03/2012 3PM ENS 32NEA Chao, C.; Cakmak, M.; Thomaz, A.L
Towards grounding concepts for transfer in goal learning from demonstration, Development and Learning (ICDL), 2011 IEEE International Conference on , vol.2, no., pp.1-6, 24-27 Aug. 2011

03/20/2012 3PM ENS 32NEA Tanvi Motwani and Raymond Mooney
Improving Video Activity Recognition using Object Recognition and Text Mining, Submitted to ECAI,2012

02/28/2012 3PM ENS 32NEA Chee Wee Leong and Rada Mihalcea
Going Beyond Text: A Hybrid Image-Text Approach for Measuring Word Relatedness, in Proceedings of the International Joint Conference on Natural Language Processing (IJCNLP 2011), Chiang Mai, Thailand, November 2011

02/14/2012 3PM ENS 32NEA K, Saenko and T. Darrell
Unsupervised Learning of Visual Sense Models for Polysemous Words, Proc. NIPS, December 2008, Vancouver, Canada

Date	Time	Place	Paper
04/03/2012	3PM	ENS 32NEA	Chao, C.; Cakmak, M.; Thomaz, A.L Towards grounding concepts for transfer in goal learning from demonstration, Development and Learning (ICDL), 2011 IEEE International Conference on , vol.2, no., pp.1-6, 24-27 Aug. 2011
03/20/2012	3PM	ENS 32NEA	Tanvi Motwani and Raymond Mooney Improving Video Activity Recognition using Object Recognition and Text Mining, Submitted to ECAI,2012
02/28/2012	3PM	ENS 32NEA	Chee Wee Leong and Rada Mihalcea Going Beyond Text: A Hybrid Image-Text Approach for Measuring Word Relatedness, in Proceedings of the International Joint Conference on Natural Language Processing (IJCNLP 2011), Chiang Mai, Thailand, November 2011
02/14/2012	3PM	ENS 32NEA	K, Saenko and T. Darrell Unsupervised Learning of Visual Sense Models for Polysemous Words, Proc. NIPS, December 2008, Vancouver, Canada

Fall 2011

Date Time Place Paper

12/1/2011 4PM ENS 32NEA Siming Li, Girish Kulkarni, Tamara Berg, Alex Berg and Yejin Choi
Composing Simple Image Descriptions using Web-scale N-grams, Computational Natural Language Learning (CoNLL), 2011

11/10/2011 4PM ENS 32NEA Girish Kulkarni, Visruth Premraj, Sagnik Dhar, Siming Li, Yejin Choi, Alexander C. Berg, and Tamara Berg
Baby Talk: Understanding and Generating Image Descriptions , Computer Vision and Pattern Recognition (CVPR), 2011

10/27/2011 4PM ENS 32NEA E. Bruni, G.B. Tran and M. Baroni
Distributional semantics from text and images , Proceedings of the EMNLP 2011 Geometrical Models for Natural Language Semantics (GEMS 2011) Workshop, East Stroudsburg PA: ACL, 22-32

10/13/2011 4PM ENS 32NEA Richard Socher, Cliff C. Lin, Andrew Y. Ng, and Christopher D. Manning
Parsing Natural Scenes and Natural Language with Recursive Neural Networks , ICML, 2011

09/29/2011 4PM ENS 32NEA Benjamin Borschinger, Bevan K. Jones and Mark Johnson
Reducing Grounded Learning Tasks To Grammatical Inference, EMNLP, 2011

09/01/2011 4PM ENS 32NEA Stefanie Tellex, Thomas Kollar, Steven Dickerson, Matthew R. Walter, Ashis Gopal Banerjee, Seth Teller, Nicholas Roy
Understanding Natural Language Commands for Robotic Navigation and Mobile Manipulation, AAAI, 2011

Date	Time	Place	Paper
12/1/2011	4PM	ENS 32NEA	Siming Li, Girish Kulkarni, Tamara Berg, Alex Berg and Yejin Choi Composing Simple Image Descriptions using Web-scale N-grams, Computational Natural Language Learning (CoNLL), 2011
11/10/2011	4PM	ENS 32NEA	Girish Kulkarni, Visruth Premraj, Sagnik Dhar, Siming Li, Yejin Choi, Alexander C. Berg, and Tamara Berg Baby Talk: Understanding and Generating Image Descriptions , Computer Vision and Pattern Recognition (CVPR), 2011
10/27/2011	4PM	ENS 32NEA	E. Bruni, G.B. Tran and M. Baroni Distributional semantics from text and images , Proceedings of the EMNLP 2011 Geometrical Models for Natural Language Semantics (GEMS 2011) Workshop, East Stroudsburg PA: ACL, 22-32
10/13/2011	4PM	ENS 32NEA	Richard Socher, Cliff C. Lin, Andrew Y. Ng, and Christopher D. Manning Parsing Natural Scenes and Natural Language with Recursive Neural Networks , ICML, 2011
09/29/2011	4PM	ENS 32NEA	Benjamin Borschinger, Bevan K. Jones and Mark Johnson Reducing Grounded Learning Tasks To Grammatical Inference, EMNLP, 2011
09/01/2011	4PM	ENS 32NEA	Stefanie Tellex, Thomas Kollar, Steven Dickerson, Matthew R. Walter, Ashis Gopal Banerjee, Seth Teller, Nicholas Roy Understanding Natural Language Commands for Robotic Navigation and Mobile Manipulation, AAAI, 2011

Spring 2011

Date Time Place Paper

5/12/2010 4PM ENS 32NEA S.R.K. Branavan, David Silver, and Regina Barzilay
Learning to Win by Reading Manuals in a Monte-Carlo Framework, ACL, 2011

4/28/2010 4PM ENS 32NEA Antoine Bordes, Nicolas Usunier, and Jason Weston
Label Ranking under Ambiguous Supervision for Learning Semantic Correspondences, ICML, 2010

4/14/2010 4PM ENS 32NEA Hilke Reckman, Jeff Orkin, and Deb Roy
Learning meanings of words and constructions, grounded in a virtual game, 10th Conference on Natural Language Processing (KONVENS), 2010

3/31/2010 4PM ENS 32NEA Michael C. Frank, Noah D. Goodman, and Joshua B. Tenenbaum
Using Speakers' Referential Intentions to Model Early Cross-Situational Word Learning, Psychological Science, 2009

3/3/2010 4PM ENS 32NEA K. Prabhakar, S. Oh, P. Wang, G. Abowd, and J. M. Rehg
Temporal Causality for the Analysis of Visual Events, CVPR 2010, Oral presentation

2/3/2010 4PM ENS 32NEA David L. Chen and Raymond J. Mooney
Learning to Interpret Natural Language Navigation Instructions from Observations, Under Submission

1/20/2010 4PM ENS 32NEA Dave Golland, Percy Liang, Dan Klein
A game-theoretic approach to generating spatial descriptions, EMNLP, 2010

Date	Time	Place	Paper
5/12/2010	4PM	ENS 32NEA	S.R.K. Branavan, David Silver, and Regina Barzilay Learning to Win by Reading Manuals in a Monte-Carlo Framework, ACL, 2011
4/28/2010	4PM	ENS 32NEA	Antoine Bordes, Nicolas Usunier, and Jason Weston Label Ranking under Ambiguous Supervision for Learning Semantic Correspondences, ICML, 2010
4/14/2010	4PM	ENS 32NEA	Hilke Reckman, Jeff Orkin, and Deb Roy Learning meanings of words and constructions, grounded in a virtual game, 10th Conference on Natural Language Processing (KONVENS), 2010
3/31/2010	4PM	ENS 32NEA	Michael C. Frank, Noah D. Goodman, and Joshua B. Tenenbaum Using Speakers' Referential Intentions to Model Early Cross-Situational Word Learning, Psychological Science, 2009
3/3/2010	4PM	ENS 32NEA	K. Prabhakar, S. Oh, P. Wang, G. Abowd, and J. M. Rehg Temporal Causality for the Analysis of Visual Events, CVPR 2010, Oral presentation
2/3/2010	4PM	ENS 32NEA	David L. Chen and Raymond J. Mooney Learning to Interpret Natural Language Navigation Instructions from Observations, Under Submission
1/20/2010	4PM	ENS 32NEA	Dave Golland, Percy Liang, Dan Klein A game-theoretic approach to generating spatial descriptions, EMNLP, 2010

Fall 2010

Date Time Place Paper

12/9/2010 3PM ACES 5.444 Yansong Feng and Mirella Lapata
How Many Words is a Picture Worth? Automatic Caption Generation for News Images, ACL, 2010

Tuesday, 11/23/2010 3PM ACES 5.444 T. Kollar, S. Tellex, D. Roy, and N. Roy
Toward Understanding Natural Language Directions, International Conference on Human-Robot Interaction, 2010

11/11/2010 3PM ACES 5.444 A Farhadi, M Hejrati, M A Sadeghi, P Young, C Rashtchian1, J Hockenmaier, D. A. Forsyth
Every Picture Tells a Story: Generating Sentences from Images, ECCV, 2010

10/28/2010 3PM ACES 5.444 Adam Vogel and Dan Jurafsky
Learning to Follow Navigational Directions, ACL, 2010

Date	Time	Place	Paper
12/9/2010	3PM	ACES 5.444	Yansong Feng and Mirella Lapata How Many Words is a Picture Worth? Automatic Caption Generation for News Images, ACL, 2010
Tuesday, 11/23/2010	3PM	ACES 5.444	T. Kollar, S. Tellex, D. Roy, and N. Roy Toward Understanding Natural Language Directions, International Conference on Human-Robot Interaction, 2010
11/11/2010	3PM	ACES 5.444	A Farhadi, M Hejrati, M A Sadeghi, P Young, C Rashtchian1, J Hockenmaier, D. A. Forsyth Every Picture Tells a Story: Generating Sentences from Images, ECCV, 2010
10/28/2010	3PM	ACES 5.444	Adam Vogel and Dan Jurafsky Learning to Follow Navigational Directions, ACL, 2010

Spring 2010

Date Time Place Paper

5/26/2010 3PM ACES 3.116 Joo Hyun Kim, and Raymond J. Mooney
Generative Alignment and Semantic Parsing for Learning from Ambiguous Supervision, Under submission to Coling 2010

5/13/2010 4PM ACES 3.116 S. Qu and J. Y. Chai
Context-based Word Acquisition for Situated Dialogue in a Virtual World, JAIR, 2010

4/29/2010 4PM ACES 3.116 Stefanie Tellex and Deb Roy
Grounding Spatial Prepositions for Video Search, ICMI, 2009

4/15/2010 4PM ACES 3.116 Kristy Elizabeth Boyer, Robert Phillips, Amy Ingram, Eun Young Ha, Michael Wallis, Mladen Vouk, and James Lester
Characterizing the Effectiveness of Tutorial Dialogue with Hidden Markov Models, International Conference on Intelligent Tutoring Systems, 2010

3/11/2010 4PM ACES 3.116 Nobuyuki Shimizu, and Andrew Haas
Learning to Follow Navigational Route Instructions, IJCAI, 2009

2/25/2010 4PM ACES 3.116 Jeff Orkin and Deb Roy
Automatic Learning and Generation of Social Behavior from Collective Human Gameplay, AAMAS, 2009

2/11/2010 4PM ACES 3.116 Anton Leuski, and David Traum
A Statistical Approach for Text Processing in Virtual Humans, Army Science Conference, 2008

1/28/2010 4PM ACES 3.116 Nicholas Morsillo, Christopher Pal, and Randal Nelson
Semi-Supervised Learning of Visual Classifiers from Web Images and Text, IJCAI, 2009

Date	Time	Place	Paper
5/26/2010	3PM	ACES 3.116	Joo Hyun Kim, and Raymond J. Mooney Generative Alignment and Semantic Parsing for Learning from Ambiguous Supervision, Under submission to Coling 2010
5/13/2010	4PM	ACES 3.116	S. Qu and J. Y. Chai Context-based Word Acquisition for Situated Dialogue in a Virtual World, JAIR, 2010
4/29/2010	4PM	ACES 3.116	Stefanie Tellex and Deb Roy Grounding Spatial Prepositions for Video Search, ICMI, 2009
4/15/2010	4PM	ACES 3.116	Kristy Elizabeth Boyer, Robert Phillips, Amy Ingram, Eun Young Ha, Michael Wallis, Mladen Vouk, and James Lester Characterizing the Effectiveness of Tutorial Dialogue with Hidden Markov Models, International Conference on Intelligent Tutoring Systems, 2010
3/11/2010	4PM	ACES 3.116	Nobuyuki Shimizu, and Andrew Haas Learning to Follow Navigational Route Instructions, IJCAI, 2009
2/25/2010	4PM	ACES 3.116	Jeff Orkin and Deb Roy Automatic Learning and Generation of Social Behavior from Collective Human Gameplay, AAMAS, 2009
2/11/2010	4PM	ACES 3.116	Anton Leuski, and David Traum A Statistical Approach for Text Processing in Virtual Humans, Army Science Conference, 2008
1/28/2010	4PM	ACES 3.116	Nicholas Morsillo, Christopher Pal, and Randal Nelson Semi-Supervised Learning of Visual Classifiers from Web Images and Text, IJCAI, 2009

Fall 2009

Date Time Place Paper

12/3/2009 11AM ACES 5.444 Donna Byron, Alexander Koller, Kristina Striegnitz, Justine Cassell, Robert Dale, Johanna Moore, and Jon Oberlander
Report on the First NLG Challenge on Generating Instructions in Virtual Environments (GIVE), ENLG, 2009
For more information about the challenge and the individual systems, visit the GIVE Challenge website

11/12/2009 11AM ACES 5.444 Benjamin Snyder, and Regina Barzilay
Database-Text Alignment via Structured Multilabel Classification, IJCAI, 2007

10/29/2009 11AM ACES 5.444 T. Regier, and L. Carlson
Grounding spatial language in perception: An empirical and computational investigation, Journal of Experimental Psychology, 2001

10/15/2009 11AM ACES 5.444 Patrick Buehler, Mark Everingham, and Andrew Zisserman
Learning sign language by watching TV (using weakly aligned subtitles), CVPR, 2009

10/1/2009 11AM ACES 5.444 Percy Liang, Michael I. Jordan, and Dan Klein
Learning semantic correspondences with less supervision, ACL-IJCNLP, 2009

9/17/2009 11AM ACES 5.444 Tessa Lau, Clemens Drews, and Jeffrey Nichols
Interpreting Written How-To Instructions, IJCAI, 2009

9/3/2009 11AM ACES 5.444 S.R.K. Branavan, Harr Chen, Luke S. Zettlemoyer, and Regina Barzilay
Reinforcement Learning for Mapping Instructions to Actions, ACL-IJCNLP, 2009

Date	Time	Place	Paper
12/3/2009	11AM	ACES 5.444	Donna Byron, Alexander Koller, Kristina Striegnitz, Justine Cassell, Robert Dale, Johanna Moore, and Jon Oberlander Report on the First NLG Challenge on Generating Instructions in Virtual Environments (GIVE), ENLG, 2009 For more information about the challenge and the individual systems, visit the GIVE Challenge website
11/12/2009	11AM	ACES 5.444	Benjamin Snyder, and Regina Barzilay Database-Text Alignment via Structured Multilabel Classification, IJCAI, 2007
10/29/2009	11AM	ACES 5.444	T. Regier, and L. Carlson Grounding spatial language in perception: An empirical and computational investigation, Journal of Experimental Psychology, 2001
10/15/2009	11AM	ACES 5.444	Patrick Buehler, Mark Everingham, and Andrew Zisserman Learning sign language by watching TV (using weakly aligned subtitles), CVPR, 2009
10/1/2009	11AM	ACES 5.444	Percy Liang, Michael I. Jordan, and Dan Klein Learning semantic correspondences with less supervision, ACL-IJCNLP, 2009
9/17/2009	11AM	ACES 5.444	Tessa Lau, Clemens Drews, and Jeffrey Nichols Interpreting Written How-To Instructions, IJCAI, 2009
9/3/2009	11AM	ACES 5.444	S.R.K. Branavan, Harr Chen, Luke S. Zettlemoyer, and Regina Barzilay Reinforcement Learning for Mapping Instructions to Actions, ACL-IJCNLP, 2009

Spring 2009

Date Time Place Paper

5/15/2009 1PM ACES 5.444 Matt MacMahon, Brian Stankiewicz, and Benjamin Kuipers
Walk the Talk: Connecting Language, Knowledge, and Action in Route Instructions
AAAI, 2006

5/1/2009 1PM ACES 5.444 Sonal Gupta and Raymond J. Mooney
Using Closed Captions to Train Activity Recognizers that Improve Video Retrieval
Under submission to ACM Multimedia 2009
Workshop version

4/24/2009 1PM ACES 5.444 Ivan Laptev, Marcin Marszalek, Cordelia Schmid and Benjamin Rosenfeld
Learning realistic human actions from movies, CVPR, 2008
Note: Cordelia Schmid is giving a talk on Monday (4/27)

4/10/2009 1PM ACES 5.444 Kevin Gold, and Brian Scassellati
A Robot That Uses Existing Vocabulary to Infer Non-Visual Word Meanings from Observation, AAAI, 2007

3/6/2009 1PM ACES 5.444 A. Gupta and L. Davis
Beyond Nouns: Exploiting Prepositions and Comparative Adjectives for Learning Visual Classifiers, ECCV, 2008

2/20/2009 1PM ACES 5.444 Wesley Kerr, Paul R. Cohen, and Yu-Han Chang
Learning and Playing in Wubble World, AIIDE, 2008

2/6/2009 1PM ACES 5.444 T. Cour, C. Jordan, E. Miltsakaki, and B. Taskar
Movie/Script: Alignment and Parsing of Video and Text Transcription, ECCV, 2008

Date	Time	Place	Paper
5/15/2009	1PM	ACES 5.444	Matt MacMahon, Brian Stankiewicz, and Benjamin Kuipers Walk the Talk: Connecting Language, Knowledge, and Action in Route Instructions AAAI, 2006
5/1/2009	1PM	ACES 5.444	Sonal Gupta and Raymond J. Mooney Using Closed Captions to Train Activity Recognizers that Improve Video Retrieval Under submission to ACM Multimedia 2009 Workshop version
4/24/2009	1PM	ACES 5.444	Ivan Laptev, Marcin Marszalek, Cordelia Schmid and Benjamin Rosenfeld Learning realistic human actions from movies, CVPR, 2008 Note: Cordelia Schmid is giving a talk on Monday (4/27)
4/10/2009	1PM	ACES 5.444	Kevin Gold, and Brian Scassellati A Robot That Uses Existing Vocabulary to Infer Non-Visual Word Meanings from Observation, AAAI, 2007
3/6/2009	1PM	ACES 5.444	A. Gupta and L. Davis Beyond Nouns: Exploiting Prepositions and Comparative Adjectives for Learning Visual Classifiers, ECCV, 2008
2/20/2009	1PM	ACES 5.444	Wesley Kerr, Paul R. Cohen, and Yu-Han Chang Learning and Playing in Wubble World, AIIDE, 2008
2/6/2009	1PM	ACES 5.444	T. Cour, C. Jordan, E. Miltsakaki, and B. Taskar Movie/Script: Alignment and Parsing of Video and Text Transcription, ECCV, 2008

Fall 2008

Date Time Place Paper

11/13/2008 4PM ACES 5.444 Noboru Babaguchi, Yoshihiko Kawai, Tadahiro Kitahashi
Event Based Indexing of Broadcasted Sports Video by Intermodal Collaboration , IEEE Transactions on Multimedia, 2002

10/16/2008 4PM ACES 5.444 Michael Fleischman and Deb Roy
Grounded Language Modeling for Automatic Speech Recognition of Sports Video, ACL-08:HLT

10/02/2007 4PM ACES 5.444 Rama Bindiganavale, William Schuler, Jan M. Allbeck, Norman I. Badler, Aravind K. Joshi, Martha Palmer
Dynamically Altering Agent Behaviors Using Natural Language Insctructions, Conference on Autonomous Agents, 2000

09/04/2007 4PM ACES 3.116 Practice talk for Sonal
Sonal Gupta, Joohyun Kim, Kristen Grauman and Raymond Mooney
Watch, Listen & Learn: Co-training on Captioned Images and Videos , ECML, 2008

Date	Time	Place	Paper
11/13/2008	4PM	ACES 5.444	Noboru Babaguchi, Yoshihiko Kawai, Tadahiro Kitahashi Event Based Indexing of Broadcasted Sports Video by Intermodal Collaboration , IEEE Transactions on Multimedia, 2002
10/16/2008	4PM	ACES 5.444	Michael Fleischman and Deb Roy Grounded Language Modeling for Automatic Speech Recognition of Sports Video, ACL-08:HLT
10/02/2007	4PM	ACES 5.444	Rama Bindiganavale, William Schuler, Jan M. Allbeck, Norman I. Badler, Aravind K. Joshi, Martha Palmer Dynamically Altering Agent Behaviors Using Natural Language Insctructions, Conference on Autonomous Agents, 2000
09/04/2007	4PM	ACES 3.116	Practice talk for Sonal Sonal Gupta, Joohyun Kim, Kristen Grauman and Raymond Mooney Watch, Listen & Learn: Co-training on Captioned Images and Videos , ECML, 2008

Summer 2008

Date Time Place Paper

05/30/2008 2PM ENS 32NEA J Wang, L Duan, L Xu, H Lu, JS Jin
TV Ad Video Categorization with Probabilistic Latent Concept Learning, Workshop on Multimedia Information Retrieval, 2007

Date	Time	Place	Paper
05/30/2008	2PM	ENS 32NEA	J Wang, L Duan, L Xu, H Lu, JS Jin TV Ad Video Categorization with Probabilistic Latent Concept Learning, Workshop on Multimedia Information Retrieval, 2007

Spring 2008

Date Time Place Paper

05/02/2008 10AM ENS 32NEA M.R. Everingham, J. Sivic, and A. Zisserman
"Hello! My name ... Buffy" - Automatic Naming of Characters in TV Video, BMVC, 2006

04/18/2008 10AM ENS 32NEA Fleischman, M. and Roy, D.
Situated Models of Meaning for Sports Video Retrieval, HLT/NAACL, 2007

04/04/2008 10AM ENS 32NEA Kobus Barnard, Pinar Duygulu, David Forsyth, Nando de Freitas, David M. Blei, Michael I. Jordan
Matching Words and Pictures, JMLR, 2003

03/07/2008 10AM ENS 32NEA M. Jamieson A. Fazly, S. Dickinson, S. Stevenson, S. Wachsmuth
Learning Structured Appearance Models from Captioned Images of Cluttered Scenes, IEEE International Conference on Computer Vision (ICCV), 2007

02/22/2008 *11AM ENS 32NEA Sonal Gupta, Joo Hyun Kim, and Raymond J. Mooney
Co-training with Images and Text Captions, Under submission to ICML 2008

02/08/2008 10AM ENS 32NEA David L. Chen and Raymond J. Mooney
Learning to Sportscast: A Test of Grounded Language Acquisition, Under submission to ICML 2008

01/25/2008 10AM ENS 32NEA K. Forbus, C. Riesbeck, L. Birnbaum, K. Livingston, A. Sharma, L. Ureel
Integrating Natural Language, Knowledge Representation and Reasonin\ g, and Analogical Processing to Learn by Reading, AAAI, 2007

Date	Time	Place	Paper
05/02/2008	10AM	ENS 32NEA	M.R. Everingham, J. Sivic, and A. Zisserman "Hello! My name ... Buffy" - Automatic Naming of Characters in TV Video, BMVC, 2006
04/18/2008	10AM	ENS 32NEA	Fleischman, M. and Roy, D. Situated Models of Meaning for Sports Video Retrieval, HLT/NAACL, 2007
04/04/2008	10AM	ENS 32NEA	Kobus Barnard, Pinar Duygulu, David Forsyth, Nando de Freitas, David M. Blei, Michael I. Jordan Matching Words and Pictures, JMLR, 2003
03/07/2008	10AM	ENS 32NEA	M. Jamieson A. Fazly, S. Dickinson, S. Stevenson, S. Wachsmuth Learning Structured Appearance Models from Captioned Images of Cluttered Scenes, IEEE International Conference on Computer Vision (ICCV), 2007
02/22/2008	*11AM	ENS 32NEA	Sonal Gupta, Joo Hyun Kim, and Raymond J. Mooney Co-training with Images and Text Captions, Under submission to ICML 2008
02/08/2008	10AM	ENS 32NEA	David L. Chen and Raymond J. Mooney Learning to Sportscast: A Test of Grounded Language Acquisition, Under submission to ICML 2008
01/25/2008	10AM	ENS 32NEA	K. Forbus, C. Riesbeck, L. Birnbaum, K. Livingston, A. Sharma, L. Ureel Integrating Natural Language, Knowledge Representation and Reasonin\ g, and Analogical Processing to Learn by Reading, AAAI, 2007

Fall 2007

Date Time Place Paper

11/28/2007 1PM ACES 5.444 L. Zhu, Y. Chen, and A.L. Yuille
Unsupervised Learning of a Probabilistic Grammar for Object Detection and Parsing, NIPS, 2006

11/14/2007 1PM ACES 5.444 J. K.Aggarwal and Sangho Park
Human Motion: Modeling and Recognition of Actions and Interactions, 3D Data Processing, Visualization and Transmission, 2004

10/31/2007 1PM ACES 5.444 Jeffrey M. Siskind
A computational study of cross-situational techniques for learning word-to-meaning mappings, Cognition, 1996

10/17/2007 1PM ACES 5.444 Peter Gorniak, Deb Roy
Speaking with your Sidekick: Understanding Situated Speech in Computer Role Playing Games, AIIDE, 2005

10/3/2007 1PM ACES 5.444 Geert-Jan M. Kruijff, Hendrik Zender, Patric Jensfelt, and Henrik I. Christensen
Situated dialogue and spatial organization: What, where... and why?, International Journal of Advanced Robotic Systems, 4(2), 2007

09/19/2007 1PM ACES 5.444 Deb K. Roy, Alex P. Pentland
Learning words from sights and sounds: a computational model, Cognitive Science, 2002

09/05/2007 1PM ACES 5.444 Xiaojin Zhu, Andrew B. Goldberg, Mohamed Eldawy, Charles R. Dyer, Bradley Strock
A Text-to-Picture Synthesis System for Augmenting Communication, AAAI, 2007

Date	Time	Place	Paper
11/28/2007	1PM	ACES 5.444	L. Zhu, Y. Chen, and A.L. Yuille Unsupervised Learning of a Probabilistic Grammar for Object Detection and Parsing, NIPS, 2006
11/14/2007	1PM	ACES 5.444	J. K.Aggarwal and Sangho Park Human Motion: Modeling and Recognition of Actions and Interactions, 3D Data Processing, Visualization and Transmission, 2004
10/31/2007	1PM	ACES 5.444	Jeffrey M. Siskind A computational study of cross-situational techniques for learning word-to-meaning mappings, Cognition, 1996
10/17/2007	1PM	ACES 5.444	Peter Gorniak, Deb Roy Speaking with your Sidekick: Understanding Situated Speech in Computer Role Playing Games, AIIDE, 2005
10/3/2007	1PM	ACES 5.444	Geert-Jan M. Kruijff, Hendrik Zender, Patric Jensfelt, and Henrik I. Christensen Situated dialogue and spatial organization: What, where... and why?, International Journal of Advanced Robotic Systems, 4(2), 2007
09/19/2007	1PM	ACES 5.444	Deb K. Roy, Alex P. Pentland Learning words from sights and sounds: a computational model, Cognitive Science, 2002
09/05/2007	1PM	ACES 5.444	Xiaojin Zhu, Andrew B. Goldberg, Mohamed Eldawy, Charles R. Dyer, Bradley Strock A Text-to-Picture Synthesis System for Augmenting Communication, AAAI, 2007

Summer 2007

Date Time Place Paper

07/30/2007 4PM ACES 5.444 Chen Yu, Dana Ballard, Richard Aslin
The Role of Embodied Intention in Early Lexical Acquisition, Cognitive Science, 2005

07/16/2007 4PM ACES 5.444 Deb Roy and Ehud Reiter
Connecting language to the world, AIJ, Volume 167, Issues 1-2, September 2005, Pages 1-12
Kobus Barnard and Matthew Johnson
Word sense disambiguation with pictures, AIJ, Volume 167, Issues 1-2, September 2005, Pages 13-30

07/02/2007 3PM ACES 5.444 D. Marocco and S. Nolfi
Emergence of communication in embodied agents evolved for the ability to solve a collective navigation problem
Connection Science, 19(1):53-74, March 2007.

06/11/2007 3PM ACES 5.444 Sudheendra Narasimhan, Pushkala Iyer, and Kristen Grauman
Keywords to Visual Categories: Multiple-Instance Learning for Unsupervised Object Recognition, Under submission to NIPS, 2007

05/29/2007
*TUESDAY 3PM ACES 5.444 Tamara L. Berg, Alexander C. Berg, Jaety Edwards, and D. A. Forsyth
Who's in the Picture, NIPS, 2004
Rohini K. Srihari and Debra T. Burhans
Visual Semantics: Extracting Visual Information from Text Accompanying Pictures, AAAI, 1994

Date	Time	Place	Paper
07/30/2007	4PM	ACES 5.444	Chen Yu, Dana Ballard, Richard Aslin The Role of Embodied Intention in Early Lexical Acquisition, Cognitive Science, 2005
07/16/2007	4PM	ACES 5.444	Deb Roy and Ehud Reiter Connecting language to the world, AIJ, Volume 167, Issues 1-2, September 2005, Pages 1-12 Kobus Barnard and Matthew Johnson Word sense disambiguation with pictures, AIJ, Volume 167, Issues 1-2, September 2005, Pages 13-30
07/02/2007	3PM	ACES 5.444	D. Marocco and S. Nolfi Emergence of communication in embodied agents evolved for the ability to solve a collective navigation problem Connection Science, 19(1):53-74, March 2007.
06/11/2007	3PM	ACES 5.444	Sudheendra Narasimhan, Pushkala Iyer, and Kristen Grauman Keywords to Visual Categories: Multiple-Instance Learning for Unsupervised Object Recognition, Under submission to NIPS, 2007
05/29/2007 *TUESDAY	3PM	ACES 5.444	Tamara L. Berg, Alexander C. Berg, Jaety Edwards, and D. A. Forsyth Who's in the Picture, NIPS, 2004 Rohini K. Srihari and Debra T. Burhans Visual Semantics: Extracting Visual Information from Text Accompanying Pictures, AAAI, 1994