23 Nov 2022 |
Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language. Zeng et al. 2022 |
Dan |
12 Oct 2022 |
Bidirectional Language Models Are Also Few-shot Learners. Patel et al. 2022 |
Wenyan |
21 Sep 2022 |
DP-Parse: Finding Word Boundaries from Raw Speech with an Instance Lexicon. Algayres et al. 2022 |
Ramon |
24 Nov 2021 |
Does language help generalization in vision models? Devillers et al. 2021 |
Dan |
27 Oct 2021 |
Pre-Training a Language Model Without Human Language. Chiang and Lee 2020 |
Ramon |
29 Sep 2021 |
Towards General Purpose Vision Systems. Gupta et al. 2021 |
Rita |
15 Sep 2021 |
Multimodal Few-Shot Learning with Frozen Language Models. Tsimpoukelli et al. 2021 |
Emanuele |
16 Jun 2021 |
Grounding ‘Grounding’ in NLP. Chandu et al. 2021. |
Ramon |
26 May 2021 |
Episodic Transformer for Vision-and-Language Navigation. Pashevich et al. 2021. |
Dylan |
21 Apr 2021 |
Unifying Vision-and-Language Tasks via Text Generation. Cho et al. 2021. |
Rita |
7 Apr 2021 |
How Many Data Points is a Prompt Worth?. Le Scao and Rush 2021. |
Erkut |
24 Mar 2021 |
Learning Transferable Visual Models From Natural Language Supervision. Radford et al. 2021. |
Aykut |
24 Feb 2021 |
Behind the Scene: Revealing the Secrets of Pre-trained Vision-and-Language Models. Cao et al. 2020. |
Des |
10 Feb 2021 |
UNIMO: Towards Unified-Modal Understanding and Generation via Cross-Modal Contrastive Learning. Li et al. 2020. |
Emanuele |
27 Jan 2021 |
VinVL: Making Visual Representations Matter in Vision-Language Models. Zhang et al. 2021. |
Semih |
13 Jan 2021 |
Watch-And-Help: A Challenge for Social Perception and Human-AI Collaboration. Puig et al. 2020. |
Ramon |
25 Nov 2020 |
Language Grounds Experience. Bisk et al. 2020. |
Semih |
28 Oct 2020 |
Vokenization: Improving Language Understanding with Contextualized, Visual-Grounded Supervision. Tan and Bansal 2020. |
Des |
14 Oct 2020 |
Grounded Language Learning Fast and Slow. Hill et al. 2020. |
Łukasz |
30 Sep 2020 |
A Developmental Approach to Machine Learning? Smith and Slone 2017. |
Łukasz |
9 Sep 2020 |
Learning Visual Representations with Caption Annotations. Sariyildiz et al. 2020. |
??? |
26 Aug 2020 |
Probing Text Models for Common Ground with Visual Representations. Ilharco et al. 2020. |
Erkut |
15 Jul 2020 |
Learning to Learn Words from Visual Scenes. Surís et al. 2019. |
Aykut |
24 Jun 2020 |
The symbol grounding problem. Harnad 1990. |
Des |