Towards Bilingual Lexicon Discovery From Visually Grounded Speech Audio
Published in Interspeech, 2019
Recommended citation: Azuh, Emmanuel, David Harwath, and James R. Glass. "Towards Bilingual Lexicon Discovery From Visually Grounded Speech Audio." INTERSPEECH. 2019. http://groups.csail.mit.edu/sls/publications/2019/EmmanuelAzuh_Interspeech-2019.PDF