Question: Make a neural network that can do image captioning. Network should take in an image and produce a string of text describing what is found
Make a neural network that can do image captioning. Network should take in an image and produce a string of text describing what is found in the image. Want the neural network to:
1. Use Convolutional Neural Networks (CNNs) to capture features from the images and condense it down to a vector of numbers that 2. a LSTM will use as its input, generating the caption in an auto-regressive manner
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
