Home / Deep Vision / Image Captioning > Unifying Visual-Semantic Embeddings with Multimodal Neural Language Models Image Captioning > Unifying Visual-Semantic Embeddings with Multimodal Neural Language Models arXiv:1411.2539. Package GitHub Back to Deep Vision