Menu
![]() Our model will treat CNN as the ‘image model’ and the RNN/LSTM as the ‘language model’ to encode the text sequences of varying length. Here our encoder model will combine both the encoded form of the image and the encoded form of the text caption and feed to the decoder. We will tackle this problem using an Encoder-Decoder model. Implementing the Image Caption Generator in Keras. ![]()
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |