[ Meeting Schedule | Previous Discussions | Subscribe to Email List ]

Previous Discussions

[ Spring 2019 | Fall 2018 | Spring 2018 | Fall 2017 | Spring 2017 | Fall 2016 | Spring 2016 | Fall 2015 | Summer 2015 | Spring 2015 | Spring 2014 | Fall 2013 | Spring 2013 | Fall 2012 | Spring 2012 | Fall 2011 | Spring 2011 | Fall 2010 | Spring 2010 | Fall 2009 | Spring 2009 | Fall 2008 | Summer 2008 | Spring 2008 | Fall 2007 | Summer 2007 ]

Spring 2019

Back to Top
04/23/2019 3:30PM GDC 3.516 Kaihua Tang, Hanwang Zhang, Baoyuan Wu, Wenhan Luo, Wei Liu
Learning to Compose Dynamic Tree Structures for Visual Contexts, CVPR, 2019
04/08/2019 12PM GDC 3.816 Xin Wang, Qiuyuan Huang, Asli Celikyilmaz, Jianfeng Gao, Dinghan Shen, Yuan-Fang Wang, William Yang Wang, Lei Zhang
Reinforced Cross-Modal Matching and Self-Supervised Imitation Learning for Vision-Language Navigation, CVPR, 2019
03/28/2019 12PM GDC 3.816 Chaorui Deng, Qi Wu, Guanghui Xu, Zhuliang Yu, Yanwu Xu, Kui Jia, Mingkui Tan
You Only Look & Listen Once: Towards Fast and Accurate Visual Grounding, Arxiv, 2018
03/04/2019 12PM GDC 3.816 Rowan Zellers, Yonatan Bisk, Ali Farhadi, Yejin Choi
From Recognition to Cognition: Visual Commonsense Reasoning, Arxiv, 2018
02/11/2019 12PM GDC 3.816 Zhilin Yang, Saizheng Zhang, Jack Urbanek, Will Feng, Alexander H. Miller, Arthur Szlam, Douwe Kiela, Jason Weston
Mastering the Dungeon: Grounded Language Learning by Mechanical Turker Descent, ICLR, 2019
05/07/2019 3:30PM GDC 3.816 Fuwen Tan, Song Feng, Vicente Ordonez
Text2Scene: Generating Compositional Scenes from Textual Descriptions, CVPR, 2019

Fall 2018

Back to Top
10/23/2018 11:00AM GDC 3.816 Dipendra Misra, Andrew Bennett, Valts Blukis, Eyvind Niklasson, Max Shatkhin and Yoav Artzi
Mapping Instructions to Actions in 3D Environments with Visual Goal Prediction, EMNLP, 2018
11/06/2018 11:00AM GDC 3.816 Dongchen Yu, Xing Gao and Hongkai Xiong
Structured Semantic Representation for Visual Question Answering, ICIP, 2018
10/23/2018 11:00AM GDC 3.816 Diane Bouchacourt and Marco Baroni
How Agents See Things: On Visual Representations in an Emergent Language Game, EMNLP, 2018
10/09/2018 11:00AM GDC 3.816 Andrea F. Daniele, Mohit Bansal, Matthew R. Walter
Navigational Instruction Generation as Inverse Reinforcement Learning with Neural Machine Translation, HRI, 2017
09/25/2018 11:00AM GDC 3.816 Peter Anderson, Xiaodong He, Chris Buehler, Damien Teney, Mark Johnson, Stephen Gould, Lei Zhang
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering, CVPR, 2018
09/11/2018 11:00AM GDC 3.816 Peter Anderson, Qi Wu, Damien Teney, Jake Bruce, Mark Johnson, Niko Sünderhauf, Ian Reid, Stephen Gould, Anton van den Hengel
Vision-and-Language Navigation: Interpreting visually-grounded navigation instructions in real environments, CVPR, 2018

Spring 2018

Back to Top
4/30/2018 1:00PM GDC 3.816 Francesco Barbieri, Miguel Ballesteros, Francesco Ronzano, and Horacio Saggion
Multimodal Emoji Prediction, NAACL, 2018
4/16/2018 1:00PM GDC 3.816 Jacob Andreas, Dan Klein, and Sergey Levine
Learning with Latent Language, NAACL, 2018
4/2/2018 1:00PM GDC 3.816 Ronghang Hu, Marcus Rohrbach, Jacob Andreas, Trevor Darrell, and Kate Saenko
Modeling Relationships in Referential Expressions with Compositional Modular Networks, CVPR, 2017
3/19/2018 1:00PM GDC 3.816 Fanyi Xiao, Leonid Sigal, and Yong Jae Lee
Weakly-supervised Visual Grounding of Phrases with Linguistic Structures, CVPR, 2017
2/26/2018 1:00PM GDC 3.816 Jae Sung Park, Biao Jia, Mohit Bansal, and Dinesh Manocha
Generating Realtime Motion Plans from Attribute-Based Natural Language Instructions Using Dynamic Constraint Mapping, arXiv preprint, 2017
2/12/2018 1:00PM GDC 3.816 Lisa Anne Hendricks, Ronghang Hu, Trevor Darrell, and Zeynep Akata
Grounding Visual Explanations (Extended Abstract), arXiv preprint, 2017 +
Rakshith Shetty, Marcus Rohrbach, Lisa Anne Hendricks, Mario Fritz, and Bernt Schiele
Speaking the Same Language: Matching Machine to Human Captions by Adversarial Training, ICCV, 2017
1/29/2018 1:00PM GDC 3.816 Abhishek Das, Samyak Datta, Georgia Gkioxari, Stefan Lee, Devi Parikh, Dhruv Batra
Embodied Question Answering, arXiv preprint, 2017

Fall 2017

Back to Top
12/05/2017 11:00AM GDC 3.816 Bolei Zhou, David Bau, Aude Oliva, and Antonio Torralba
Interpreting Deep Visual Representations via Network Dissection, CVPR, 2017
11/21/2017 11:00AM GDC 3.816 S. Palazzo, C. Spampinato, I.Kavasidis, D. Giordano
Generative Adversarial Networks Conditioned by Brain Signals, ICCV, 2017
11/07/2017 11:00AM GDC 3.816 Ishan Misra, Abhinav Gupta, and Martial Hebert
From Red Wine to Red Tomato: Composition with Context, CVPR, 2017
10/24/2017 11:00AM GDC 3.816 Scott L. Fairhall and Alfonso Caramazza
Brain Regions That Represent Amodal Conceptual Knowledge, The Journal of Neuroscience, 2013
09/26/2017 11:00AM GDC 3.816 Dong Huk Park, Lisa Anne Hendricks, Zeynep Akata, Bernt Schiele, Trevor Darrell, and Marcus Rohrbach
Attentive Explanations: Justifying Decisions and Pointing to the Evidence, arXiv
09/12/2017 11:00AM GDC 3.816 Maxwell Forbes and Yejin Choi
VERB PHYSICS: Relative Physical Knowledge of Actions and Objects, ACL 2017

Spring 2017

Back to Top
05/01/2017 3:30PM GDC 3.816 Yonatan Bisk, Deniz Yuret, Daniel Marcu
Natural Language Communication with Robots, NAACL 2016
04/17/2017 3:30PM GDC 3.816 Abhishek Das, Satwik Kottur, José M.F. Moura, Stefan Lee, Dhruv Batra
Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning, arXiv 2017
04/3/2017 3:30PM GDC 3.816 Muhannad Alomari, Paul Duckworth, Davic C. Hogg, Anthony G. Cohn
Natural Language Acquisition and Grounding for Embodied Robotic Systems, AAAI 2017
03/13/2017 3:30PM GDC 3.816 Carina Silberer, Vittorio Ferrari, Mirella Lapata
Visually Grounded Meaning Representations, IEEE Transactions on Pattern Analysis and Machine Intelligence, 2016
02/27/2017 3:30PM GDC 3.816 Harm de Vries, Florian Strub, Sarath Chandar, Olivier Pietquin, Hugo Larochelle, Aaron Courville
GuessWhat?! Visual object discovery through multi-modal dialogue, arXiv, November 2016
02/06/2017 3:30PM GDC 3.816 Aishwarya Padmakumar and Jesse Thomason and Raymond J. Mooney
Integrated Learning of Dialog Strategies and Semantic Parsing, EACL 2017
01/23/2017 3:30PM GDC 3.816 Emanuele Bastianelli, Danilo Croce, Andrea Vanzo, Roberto Basili, and Daniele Nardi
A Discriminative Approach to Grounded Spoken Language Understanding in Interactive Robotics, IJCAI 2016

Fall 2016

Back to Top
12/05/2016 3:30PM GDC 3.816 Jaeyong Sung, Seok Hyun Jin, and Ashutosh Saxena
Robobarista: Object Part based Transfer of Manipulation Trajectories from Crowd-sourcing in 3D Pointclouds, ISRR 2015
11/21/2016 3:30PM GDC 3.816 Sergio Guadarrama, Erik Rodner, Kate Saenko, Trevor Darrell
Understanding object descriptions in robotics by open-vocabulary object retrieval and detection, IJRR 2016
11/07/2016 3:30PM GDC 3.816 David Whitney, Miles Eldon, John Oberlin, Stefanie Tellex
Interpreting Multimodal Referring Expressions in Real Time, IEEE 2016
10/24/2016 3:30PM GDC 3.816 Scott Reed, Zeynep Akata, Xinchen Yan, Lajanugen Logeswaran, Bernt Schiele, Honglak Lee
Generative Adversarial Text to Image Synthesis, ICML 2016
10/10/2016 3:30PM GDC 3.816 Antoine Bosselut, Jianfu Chen, David Warren, Hannaneh Hajishirzi, and Yejin Choi
Learning Prototypical Event Structure from Photo Albums, ACL 2016
9/26/2016 3:30PM GDC 3.816 Lisa Anne Hendricks, Zeynep Akata, Marcus Rohrbach, Jeff Donahue, Bernt Schiele, Trevor Darrell
Generating Visual Explanations, arXiv 2016
9/12/2016 3:30PM GDC 3.816 Gordon Christie, Ankit Laddha, Aishwarya Agrawal, Stanislaw Antol, Yash Goyal, Kevin Kochersberger, Dhruv Batra
Resolving Vision and Language Ambiguities Together: Joint Segmentation & Prepositional Attachment Resolution in Captioned Scenes, EMNLP 2016
8/29/2016 3:30PM GDC 3.816 Angel Chang, Will Monroe, Manolis Savva, Christopher Potts, Christopher D. Manning
Text to 3D Scene Generation with Rich Lexical Grounding, ACL 2015

Spring 2016

Back to Top
5/3/2016 10AM GDC 3.816 Jacob Andreas, Marcus Rohrbach, Trevor Darrell, Dan Klein
Learning to Compose Neural Networks for Question Answering, NAACL 2016
4/19/2016 10AM GDC 3.816 Giampiero Salvi, Luis Montesano, Alexandre Bernardino, Jose Santos-Victor
Language bootstrapping: Learning word meanings from perception-action association, IEEE 2011
4/5/2016 10AM GDC 3.816 Stanislaw Antol, Aishwarya Agrawal, Jiasen Lu, Margaret Mitchell, Dhruv Batra, C. Lawrence Zitnick, Devi Parikh
VQA: Visual Question Answering, ICCV 2015
3/22/2016 10AM GDC 3.816 Ivan Vendrov, Ryan Kiros, Sanja Fidler, Raquel Urtasun
Order-embeddings of Images and Language, ICLR 2016
3/8/2016 10AM GDC 3.816 Hongyuan Mei, Mohit Bansal, and Matthew R. Walter
Listen, Attend, and Walk: Neural Mapping of Navigational Instructions to Action Sequences, AAAI 2016
2/9/2016 10AM GDC 3.816 Tom Williams, Gordon Briggs, Bradley Oosterveld, and Matthias Scheutz
Going Beyond Literal Command-Based Instructions: Extending Robotic Natural Language Interaction Capabilities, AAAI 2015
1/26/2016 10AM GDC 3.816 Angeliki Lazaridou, Nghia The Pham, and Marco Baroni
Combining Language and Vision with a Multimodal Skip-gram Model, NAACL 2015

Fall 2015

Back to Top
12/04/2015 10AM GDC 3.816 Licheng Yu, Eunbyung Park, Alexander C. Berg, and Tamara L. Berg
Visual Madlibs: Fill in the blank Image Generation and Question Answering, ICCV 2015
11/13/2015 11AM GDC 3.816 Michael Spranger and Luc Steels
Co-Acquisition of Syntax and Semantics - An Investigation in Spatial Language, IJCAI 2015
10/30/2015 10AM GDC 3.816 Angeliki Lazaridou, Dat Tien Nguyen, and Marco Baroni
Do Distributed Semantic Models Dream of Electric Sheep? Visualizing Word Representations through Image Synthesis, EMNLP 2015
10/16/2015 11AM GDC 3.816 Douwe Kiela and Stephen Clark
Multi- and Cross-Modal Semantics Beyond Vision: Grounding in Auditory Perception, EMNLP 2015
10/02/2015 11AM GDC 3.816 Yukun Zhu, Ryan Kiros, Richard Zemel, Ruslan Salakhutdinov, Raquel Urtasun, Antonio Torralba, Sanja Fidler
Aligning Books and Movies: Towards Story-like Visual Explanations by Watching Movies and Reading Books, ICCV 2015

supplementary: Ryan Kiros, Yukun Zhu, Ruslan Salakhutdinov, Richard S. Zemel, Antonio Torralba, Raquel Urtasun, Sanja Fidler
Skip-Thought Vectors, NIPS 2015
09/18/2015 11AM GDC 3.816 Dongcai Lu and Xiaoping Chen
Towards an Architecture Combining Grounding and Planning for Human-Robot Interaction, RoboCup Symposium, 2015
09/09/2015 4PM GDC 3.816 Daniel Paul Barrett, Scott Alan Bronikowski, Haonan Yu, Jeffrey Mark Siskind
Robot Language Learning, Generation, and Comprehension, arXiv, 2015

Summer 2015

Back to Top
08/11/2015 1:00pm GDC 3.816 Natalie Parde, Adam Hair, Michalis Papakostas, Konstantinos Tsiakas, Maria Dagioglou, Vangelis Karkaletsis, Rodney D. Nielsen
Grounding the Meaning of Words through Vision and Interactive Gameplay, IJCAI, 2015
07/21/2015 1:00pm GDC 3.816 Junhua Mao, Wei Xu, Yi Yang, Jiang Wang, Zhiheng Huang, Alen L. Yuille
Learning like a Child: Fast Novel Visual Concept Learning from Sentence Descriptions of Images, arXiv, 2015
07/14/2015 1:00pm GDC 3.816 Yürüten, Onur, Erol Şahin, and Sinan Kalkan
The learning of adjectives and nouns from affordance and appearance features, Adaptive Behavior 21.6 (2013): 437-451
06/30/2015 1:00pm GDC 3.816 Brian McMahan and Matthew Stone
A Bayesian Model of Grounded Color Semantics, NAACL, 2015
05/04/2015 12:00pm GDC 3.816 Mateusz Malinowski and Mario Fritz
A Multi-World Approach to Question Answering about Real-World Scenes based on Uncertain Input, NIPS, 2014

Spring 2015

Back to Top
04/27/2015 11:00am GDC 3.816 Mohan Sridharan, Jeremy Wyatt, and Richard Dearden
HiPPo: Hierarchical POMDPs for Planning Information Processing and Sensing Actions on a Robot, ICAPS, 2008<$
04/20/2015 12:00pm GDC 3.816 Li Yao, Atousa Torabi, Kyunghyun Cho, Nicolas Ballas, Christopher Pal, Hugo Larochelle, and Aaron Courville
Video Description Generation Incorporating Spatio-Temporal Features and a Soft-Attention Mechanism, arXiv, 2015
04/13/2015 12:00pm GDC 3.816 Changsong Liu and Joyce Y. Chai
Learning to Mediate Perceptual Differences in Situated Human-Robot Dialogue, AAAI, 2015
04/06/2015 12:00pm GDC 3.816 Jonathan Malmaud, Jonathan Huang, Vivek Rathod, Nick Johnston, Andrew Rabinovich, and Kevin Murphy
What's Cookin'? Interpreting Cooking Videos using Text, Speech and Vision, arXiv, 2015
03/30/2015 11:00am GDC 3.816 Cynthia Matuszek, Liefeng Bo, Luke Zettlemoyer, Dieter Fox
Learning from Unscripted Deictic Gesture and Language for Human-Robot Interactions, AAAI, 2014
03/09/2015 11:00am GDC 3.816 Hao Fang, Saurabh Gupta, Forrest Iandola, Rupesh Srivastava, Li Deng, Piotr Dollar, Jianfeng Gao, Xiaodong He, Margaret Mitchell, John C., Platt C., Lawrence Zitnick, Geoffrey Zweig
From Captions to Visual Concepts and Back, arXiv, 2014
03/02/2015 12:00pm GDC 3.816 Thomas Kollar, Jayant Krishnamurthy, Grant Strimel
Toward Interactive Grounded Language Acquisition, Robotics: Science and Systems (RSS), 2013
02/23/2015 11:00am GDC 3.816 Subhashini Venugopalan, Huijuan Xu, Jeff Donahue, Marcus Rohrbach, Raymond Mooney, Kate Saenko
Translating Videos to Natural Language Using Deep Recurrent Neural Networks, arXiv, 2015
02/16/2015 12:00pm GDC 3.816 Jivko Sinapov, Connor Schenck, and Alexander Stoytchev
Learning relational object categories using behavioral exploration and multimodal perception, Robotics and Automation (ICRA), 2014
02/09/2015 11:00am GDC 3.816 Changsong Liu, Lanbo She, Rui Fang, Joyce Y. Chai
Probabilistic Labeling for Efficient Referential Grounding based on Collaborative Discourse, ACL 2014

Spring 2014

Back to Top
04/29/2014 10:00am GDC 3.816 Minh-Thang Luong, Michael C. Frank, and Mark Johnson
Parsing entire discourses as very long strings: Capturing topic continuity in grounded language learning, EMNLP 2013
04/15/2014 9:30am GDC 3.816 Calvin MacKenzie
Integrating Visual and Linguistic Information to Describe Properties of Objects, Undergraduate Thesis Project
04/01/2014 10:00am GDC 3.816 Pradipto Das, Chenliang Xu, Richard F. Doell, and Jason J. Corso
A Thousand Frames in Just a Few Words: Lingual Description of Videos through Latent Topics and Sparse Object Stitching, CVPR 2013
03/18/2014 10:00am GDC 3.816 Jayant Krishnamurthy, Thomas Kollar
Jointly Learning to Parse and Perceive: Connecting Natural Language to the Physical World, TACL 2013
02/18/2014 10:00am GDC 3.816 C. Lawrence Zitnick, Devi Parikh
Bringing Semantics Into Focus Using Visual Abstraction, CVPR 2013
02/04/2014 10:00am GDC 3.816 C. Lawrence Zitnick, Devi Parikh, Lucy Vanderwende
Learning the Visual Interpretation of Sentences, ICCV 2013
01/21/2014 10:00am GDC 3.816 Dieu-Thu Le, Jasper Uijlings, and Raffaella Bernardi
Exploiting language models for visual recognition, EMNLP 2013
01/10/2014 12:00pm GDC 3.816 Vicente Ordonez, Jia Deng, Yejin Choi, Alexander C. Berg, Tamara L. Berg
From Large Scale Image Categorization to Entry-Level Categories, ICCV 2013

Fall 2013

Back to Top
12/06/2013 12:00pm GDC 3.816 Elia Bruni, Gemma Boleda, Marco Baroni, Nam-Khan Tran
Distributional Semantics in Technicolor, ACL 2012
11/22/2013 12:00pm GDC 3.516 Carina Silberer, Vittorio Ferrari, Mirella Lapata
Models of Semantic Representation with Visual Attributes, ACL 2013
11/8/2013 12:00pm GDC 3.516 Andrew J. Anderson, Elia Bruni, Ulisse Bordignon, Massimo Poesio, and Marco Baroni
Of words, eyes and brains: Correlating image-based distributional semantic models with neural representations of concepts, EMNLP 2013
10/25/2013 12:00pm GDC 3.516 Haonan Yu and Jeffrey Mark Siskind
Grounded Language Learning from Video Described with Sentences, ACL 2013 (best paper award)

Spring 2013

Back to Top
4/23/2013 1PM GDC 3.416 Michaela Regneri, Marcus Rohrbach, Dominikus Wetzel, Stefan Thater, Bernt Schiele, Manfred Pinkal
Grounding Action Descriptions in Videos, TACL, 2013
4/9/2013 1PM GDC 3.416 Yoav Artzi and Luke Zettlemoyer
Weakly Supervised Learning of Semantic Parsers for Mapping Instructions to Actions, TACL, 2013
3/26/2013 1PM GDC 3.416 C. Liu, R. Fang, and J. Y. Chai
Towards Mediating Shared Perceptual Basis in Situated Dialogue, SIGDIAL Meeting on Discourse and Dialogue, 2012
3/5/2013 1PM GDC 3.416 Richard Socher, Milind Ganjoo, Hamsa Sridhar, Osbert Bastani, Christopher D. Manning, Andrew Y. Ng
Zero-Shot Learning Through Cross-Modal Transfer, arXiv, 2013.
2/19/2013 1PM GDC 3.516 Y. Jia, M. Salzmann, and T. Darrell
Learning Cross-modality Similarity for Multinomial Data , ICCV, 2011.
2/5/2013 1PM GDC 3.416 Barbu, A. and Bridge, A. and Burchill, Z. and Coroian, D. and Dickinson, S. and Fidler, S. and Michaux, A. and Mussman, S. and Narayanaswamy, S. and Salvi, D. and others.
Video In Sentences Out. , UAI, 2012.
1/22/2013 1PM ENS 32NEA Hannaneh Hajishirzi, Julia Hockenmaier, Erik T. Mueller, and Eyal Amir.
Reasoning about RoboCup Soccer Commentaries., UAI, 2011.

Fall 2012

Back to Top
12/04/2012 1PM ENS 32NEA Cynthia Matuszek, Evan Herbst, Luke Zettlemoyer, Dieter Fox.
Learning to Parse Natural Language Commands to a Robot Control System., ISER, 2012.
11/20/2012 1PM ENS 32NEA Yezhou Yang, Ching Lik Teo, Hal Daume III, and Yiannis Aloimonos
Corpus-Guided Sentence Generation of Natural Images, EMNLP 2011
11/06/2012 1PM ENS 32NEA Frank, M. C., Goodman, N. D., and Tenenbaum, J. B.
Using Speakers' Referential Intentions to Model Early Cross-Situational Word Learning, Psychological Science 20, 578-585, 2009.
10/23/2012 1PM ENS 32NEA Mark Johnson, Katherine Demuth, Michael Frank and Bevan Jones
Synergies in Learning Words and Their Referents, In Proceedings of NIPS 2010.
10/09/2012 1PM ENS 32NEA Mark Johnson, Katherine Demuth and Michael Frank
Exploiting Social Information In Grounded Language Learning Via Grammatical Reduction, In Proceedings of ACL 2012.
09/25/2012 1PM ENS 32NEA Polina Kuznetsova, Vicente Ordonez, Alexander Berg, Tamara Berg and Yejin Choi
Collective Generation of Natural Image Descriptions, In Association for Computational Linguistics (ACL), 2012.
09/11/2012 1PM ENS 32NEA C. Matuszek and N. FitzGerald and L. Zettlemoyer and L. Bo and D. Fox
A Joint Model of Language and Perception for Grounded Attribute Learning, In International Conference on Machine Learning (ICML), July 2012.

Spring 2012

Back to Top
04/03/2012 3PM ENS 32NEA Chao, C.; Cakmak, M.; Thomaz, A.L
Towards grounding concepts for transfer in goal learning from demonstration, Development and Learning (ICDL), 2011 IEEE International Conference on , vol.2, no., pp.1-6, 24-27 Aug. 2011
03/20/2012 3PM ENS 32NEA Tanvi Motwani and Raymond Mooney
Improving Video Activity Recognition using Object Recognition and Text Mining, Submitted to ECAI,2012
02/28/2012 3PM ENS 32NEA Chee Wee Leong and Rada Mihalcea
Going Beyond Text: A Hybrid Image-Text Approach for Measuring Word Relatedness, in Proceedings of the International Joint Conference on Natural Language Processing (IJCNLP 2011), Chiang Mai, Thailand, November 2011
02/14/2012 3PM ENS 32NEA K, Saenko and T. Darrell
Unsupervised Learning of Visual Sense Models for Polysemous Words, Proc. NIPS, December 2008, Vancouver, Canada

Fall 2011

Back to Top
12/1/2011 4PM ENS 32NEA Siming Li, Girish Kulkarni, Tamara Berg, Alex Berg and Yejin Choi
Composing Simple Image Descriptions using Web-scale N-grams, Computational Natural Language Learning (CoNLL), 2011
11/10/2011 4PM ENS 32NEA Girish Kulkarni, Visruth Premraj, Sagnik Dhar, Siming Li, Yejin Choi, Alexander C. Berg, and Tamara Berg
Baby Talk: Understanding and Generating Image Descriptions , Computer Vision and Pattern Recognition (CVPR), 2011
10/27/2011 4PM ENS 32NEA E. Bruni, G.B. Tran and M. Baroni
Distributional semantics from text and images , Proceedings of the EMNLP 2011 Geometrical Models for Natural Language Semantics (GEMS 2011) Workshop, East Stroudsburg PA: ACL, 22-32
10/13/2011 4PM ENS 32NEA Richard Socher, Cliff C. Lin, Andrew Y. Ng, and Christopher D. Manning
Parsing Natural Scenes and Natural Language with Recursive Neural Networks , ICML, 2011
09/29/2011 4PM ENS 32NEA Benjamin Borschinger, Bevan K. Jones and Mark Johnson
Reducing Grounded Learning Tasks To Grammatical Inference, EMNLP, 2011
09/01/2011 4PM ENS 32NEA Stefanie Tellex, Thomas Kollar, Steven Dickerson, Matthew R. Walter, Ashis Gopal Banerjee, Seth Teller, Nicholas Roy
Understanding Natural Language Commands for Robotic Navigation and Mobile Manipulation, AAAI, 2011

Spring 2011

Back to Top
5/12/2010 4PM ENS 32NEA S.R.K. Branavan, David Silver, and Regina Barzilay
Learning to Win by Reading Manuals in a Monte-Carlo Framework, ACL, 2011
4/28/2010 4PM ENS 32NEA Antoine Bordes, Nicolas Usunier, and Jason Weston
Label Ranking under Ambiguous Supervision for Learning Semantic Correspondences, ICML, 2010
4/14/2010 4PM ENS 32NEA Hilke Reckman, Jeff Orkin, and Deb Roy
Learning meanings of words and constructions, grounded in a virtual game, 10th Conference on Natural Language Processing (KONVENS), 2010
3/31/2010 4PM ENS 32NEA Michael C. Frank, Noah D. Goodman, and Joshua B. Tenenbaum
Using Speakers' Referential Intentions to Model Early Cross-Situational Word Learning, Psychological Science, 2009
3/3/2010 4PM ENS 32NEA K. Prabhakar, S. Oh, P. Wang, G. Abowd, and J. M. Rehg
Temporal Causality for the Analysis of Visual Events, CVPR 2010, Oral presentation
2/3/2010 4PM ENS 32NEA David L. Chen and Raymond J. Mooney
Learning to Interpret Natural Language Navigation Instructions from Observations, Under Submission
1/20/2010 4PM ENS 32NEA Dave Golland, Percy Liang, Dan Klein
A game-theoretic approach to generating spatial descriptions, EMNLP, 2010

Fall 2010

Back to Top
12/9/2010 3PM ACES 5.444 Yansong Feng and Mirella Lapata
How Many Words is a Picture Worth? Automatic Caption Generation for News Images, ACL, 2010
Tuesday, 11/23/2010 3PM ACES 5.444 T. Kollar, S. Tellex, D. Roy, and N. Roy
Toward Understanding Natural Language Directions, International Conference on Human-Robot Interaction, 2010
11/11/2010 3PM ACES 5.444 A Farhadi, M Hejrati, M A Sadeghi, P Young, C Rashtchian1, J Hockenmaier, D. A. Forsyth
Every Picture Tells a Story: Generating Sentences from Images, ECCV, 2010
10/28/2010 3PM ACES 5.444 Adam Vogel and Dan Jurafsky
Learning to Follow Navigational Directions, ACL, 2010

Spring 2010

Back to Top
5/26/2010 3PM ACES 3.116 Joo Hyun Kim, and Raymond J. Mooney
Generative Alignment and Semantic Parsing for Learning from Ambiguous Supervision, Under submission to Coling 2010
5/13/2010 4PM ACES 3.116 S. Qu and J. Y. Chai
Context-based Word Acquisition for Situated Dialogue in a Virtual World, JAIR, 2010
4/29/2010 4PM ACES 3.116 Stefanie Tellex and Deb Roy
Grounding Spatial Prepositions for Video Search, ICMI, 2009
4/15/2010 4PM ACES 3.116 Kristy Elizabeth Boyer, Robert Phillips, Amy Ingram, Eun Young Ha, Michael Wallis, Mladen Vouk, and James Lester
Characterizing the Effectiveness of Tutorial Dialogue with Hidden Markov Models, International Conference on Intelligent Tutoring Systems, 2010
3/11/2010 4PM ACES 3.116 Nobuyuki Shimizu, and Andrew Haas
Learning to Follow Navigational Route Instructions, IJCAI, 2009
2/25/2010 4PM ACES 3.116 Jeff Orkin and Deb Roy
Automatic Learning and Generation of Social Behavior from Collective Human Gameplay, AAMAS, 2009
2/11/2010 4PM ACES 3.116 Anton Leuski, and David Traum
A Statistical Approach for Text Processing in Virtual Humans, Army Science Conference, 2008
1/28/2010 4PM ACES 3.116 Nicholas Morsillo, Christopher Pal, and Randal Nelson
Semi-Supervised Learning of Visual Classifiers from Web Images and Text, IJCAI, 2009

Fall 2009

Back to Top
12/3/2009 11AM ACES 5.444 Donna Byron, Alexander Koller, Kristina Striegnitz, Justine Cassell, Robert Dale, Johanna Moore, and Jon Oberlander
Report on the First NLG Challenge on Generating Instructions in Virtual Environments (GIVE), ENLG, 2009
For more information about the challenge and the individual systems, visit the GIVE Challenge website
11/12/2009 11AM ACES 5.444 Benjamin Snyder, and Regina Barzilay
Database-Text Alignment via Structured Multilabel Classification, IJCAI, 2007
10/29/2009 11AM ACES 5.444 T. Regier, and L. Carlson
Grounding spatial language in perception: An empirical and computational investigation, Journal of Experimental Psychology, 2001
10/15/2009 11AM ACES 5.444 Patrick Buehler, Mark Everingham, and Andrew Zisserman
Learning sign language by watching TV (using weakly aligned subtitles), CVPR, 2009
10/1/2009 11AM ACES 5.444 Percy Liang, Michael I. Jordan, and Dan Klein
Learning semantic correspondences with less supervision, ACL-IJCNLP, 2009
9/17/2009 11AM ACES 5.444 Tessa Lau, Clemens Drews, and Jeffrey Nichols
Interpreting Written How-To Instructions, IJCAI, 2009
9/3/2009 11AM ACES 5.444 S.R.K. Branavan, Harr Chen, Luke S. Zettlemoyer, and Regina Barzilay
Reinforcement Learning for Mapping Instructions to Actions, ACL-IJCNLP, 2009

Spring 2009

Back to Top
5/15/2009 1PM ACES 5.444 Matt MacMahon, Brian Stankiewicz, and Benjamin Kuipers
Walk the Talk: Connecting Language, Knowledge, and Action in Route Instructions
AAAI, 2006
5/1/2009 1PM ACES 5.444 Sonal Gupta and Raymond J. Mooney
Using Closed Captions to Train Activity Recognizers that Improve Video Retrieval
Under submission to ACM Multimedia 2009
Workshop version
4/24/2009 1PM ACES 5.444 Ivan Laptev, Marcin Marszalek, Cordelia Schmid and Benjamin Rosenfeld
Learning realistic human actions from movies, CVPR, 2008
Note: Cordelia Schmid is giving a talk on Monday (4/27)
4/10/2009 1PM ACES 5.444 Kevin Gold, and Brian Scassellati
A Robot That Uses Existing Vocabulary to Infer Non-Visual Word Meanings from Observation, AAAI, 2007
3/6/2009 1PM ACES 5.444 A. Gupta and L. Davis
Beyond Nouns: Exploiting Prepositions and Comparative Adjectives for Learning Visual Classifiers, ECCV, 2008
2/20/2009 1PM ACES 5.444 Wesley Kerr, Paul R. Cohen, and Yu-Han Chang
Learning and Playing in Wubble World, AIIDE, 2008
2/6/2009 1PM ACES 5.444 T. Cour, C. Jordan, E. Miltsakaki, and B. Taskar
Movie/Script: Alignment and Parsing of Video and Text Transcription, ECCV, 2008

Fall 2008

Back to Top
11/13/2008 4PM ACES 5.444 Noboru Babaguchi, Yoshihiko Kawai, Tadahiro Kitahashi
Event Based Indexing of Broadcasted Sports Video by Intermodal Collaboration , IEEE Transactions on Multimedia, 2002
10/16/2008 4PM ACES 5.444 Michael Fleischman and Deb Roy
Grounded Language Modeling for Automatic Speech Recognition of Sports Video, ACL-08:HLT
10/02/2007 4PM ACES 5.444 Rama Bindiganavale, William Schuler, Jan M. Allbeck, Norman I. Badler, Aravind K. Joshi, Martha Palmer
Dynamically Altering Agent Behaviors Using Natural Language Insctructions, Conference on Autonomous Agents, 2000
09/04/2007 4PM ACES 3.116 Practice talk for Sonal
Sonal Gupta, Joohyun Kim, Kristen Grauman and Raymond Mooney
Watch, Listen & Learn: Co-training on Captioned Images and Videos , ECML, 2008

Summer 2008

Back to Top
05/30/2008 2PM ENS 32NEA J Wang, L Duan, L Xu, H Lu, JS Jin
TV Ad Video Categorization with Probabilistic Latent Concept Learning, Workshop on Multimedia Information Retrieval, 2007

Spring 2008

Back to Top
05/02/2008 10AM ENS 32NEA M.R. Everingham, J. Sivic, and A. Zisserman
"Hello! My name ... Buffy" - Automatic Naming of Characters in TV Video, BMVC, 2006
04/18/2008 10AM ENS 32NEA Fleischman, M. and Roy, D.
Situated Models of Meaning for Sports Video Retrieval, HLT/NAACL, 2007
04/04/2008 10AM ENS 32NEA Kobus Barnard, Pinar Duygulu, David Forsyth, Nando de Freitas, David M. Blei, Michael I. Jordan
Matching Words and Pictures, JMLR, 2003
03/07/2008 10AM ENS 32NEA M. Jamieson A. Fazly, S. Dickinson, S. Stevenson, S. Wachsmuth
Learning Structured Appearance Models from Captioned Images of Cluttered Scenes, IEEE International Conference on Computer Vision (ICCV), 2007
02/22/2008 *11AM ENS 32NEA Sonal Gupta, Joo Hyun Kim, and Raymond J. Mooney
Co-training with Images and Text Captions, Under submission to ICML 2008
02/08/2008 10AM ENS 32NEA David L. Chen and Raymond J. Mooney
Learning to Sportscast: A Test of Grounded Language Acquisition, Under submission to ICML 2008
01/25/2008 10AM ENS 32NEA K. Forbus, C. Riesbeck, L. Birnbaum, K. Livingston, A. Sharma, L. Ureel
Integrating Natural Language, Knowledge Representation and Reasonin\ g, and Analogical Processing to Learn by Reading, AAAI, 2007

Fall 2007

Back to Top
11/28/2007 1PM ACES 5.444 L. Zhu, Y. Chen, and A.L. Yuille
Unsupervised Learning of a Probabilistic Grammar for Object Detection and Parsing, NIPS, 2006
11/14/2007 1PM ACES 5.444 J. K.Aggarwal and Sangho Park
Human Motion: Modeling and Recognition of Actions and Interactions, 3D Data Processing, Visualization and Transmission, 2004
10/31/2007 1PM ACES 5.444 Jeffrey M. Siskind
A computational study of cross-situational techniques for learning word-to-meaning mappings, Cognition, 1996

10/17/2007 1PM ACES 5.444 Peter Gorniak, Deb Roy
Speaking with your Sidekick: Understanding Situated Speech in Computer Role Playing Games, AIIDE, 2005

10/3/2007 1PM ACES 5.444 Geert-Jan M. Kruijff, Hendrik Zender, Patric Jensfelt, and Henrik I. Christensen
Situated dialogue and spatial organization: What, where... and why?, International Journal of Advanced Robotic Systems, 4(2), 2007

09/19/2007 1PM ACES 5.444 Deb K. Roy, Alex P. Pentland
Learning words from sights and sounds: a computational model, Cognitive Science, 2002
09/05/2007 1PM ACES 5.444 Xiaojin Zhu, Andrew B. Goldberg, Mohamed Eldawy, Charles R. Dyer, Bradley Strock
A Text-to-Picture Synthesis System for Augmenting Communication, AAAI, 2007

Summer 2007

Back to Top
07/30/2007 4PM ACES 5.444 Chen Yu, Dana Ballard, Richard Aslin
The Role of Embodied Intention in Early Lexical Acquisition, Cognitive Science, 2005

07/16/2007 4PM ACES 5.444 Deb Roy and Ehud Reiter
Connecting language to the world, AIJ, Volume 167, Issues 1-2, September 2005, Pages 1-12
Kobus Barnard and Matthew Johnson
Word sense disambiguation with pictures, AIJ, Volume 167, Issues 1-2, September 2005, Pages 13-30
07/02/2007 3PM ACES 5.444 D. Marocco and S. Nolfi
Emergence of communication in embodied agents evolved for the ability to solve a collective navigation problem
Connection Science, 19(1):53-74, March 2007.
06/11/2007 3PM ACES 5.444 Sudheendra Narasimhan, Pushkala Iyer, and Kristen Grauman
Keywords to Visual Categories: Multiple-Instance Learning for Unsupervised Object Recognition, Under submission to NIPS, 2007
3PM ACES 5.444 Tamara L. Berg, Alexander C. Berg, Jaety Edwards, and D. A. Forsyth
Who's in the Picture, NIPS, 2004
Rohini K. Srihari and Debra T. Burhans
Visual Semantics: Extracting Visual Information from Text Accompanying Pictures, AAAI, 1994