This is a Keras implementation of "Look, Listen and Learn" Model on the research by R. Arandjelovic and A. Zisserman, at DeepMind. This model can get cross-modal features between audios and images.
ddeeppnneett/LLLNet
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|
Repository files navigation
Releases
No releases published
Languages
- Jupyter Notebook 86.7%
- Python 13.3%

