Towards Bilingual Lexicon Discovery From Visually Grounded Speech Audio

Published in Interspeech, 2019

Recommended citation: Azuh, Emmanuel, David Harwath, and James R. Glass. "Towards Bilingual Lexicon Discovery From Visually Grounded Speech Audio." INTERSPEECH. 2019. http://groups.csail.mit.edu/sls/publications/2019/EmmanuelAzuh_Interspeech-2019.PDF