Skip to content

Project to generate summary or caption for a given image using deep learning models.

Notifications You must be signed in to change notification settings

darshandv/Image-Caption-and-Summary-Generation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Image Caption and Summary Generation

The project aims at generating summary or caption for a given image.

There are 2 important parts of the project. The first part is the image model where the model architecture outputs the image features as non-linear activations of pixel values of the image.
These values are fed into the second part i.e. the language model which generates the summary sentence based on the output of the image model.

The dataset used for the project is MSCOCO dataset. You can download the trianing dataset by clicking here (Please note that the dataset is around 19.3 GB).
You also need the captions for them and they can be downloaded by clicking here.
Both of them are downloaded from the official cocodataset website.

You need to extract them and keep them ready before starting.

We have also used Python API for using coco dataset. Please install by following the procedures mentioned for PythonAPI from here

Progress

To save the training time the activations from image model is generated pre hand because we are using trained model as an image model and not meddling with the weights of the model.

So after cloning the project run -

python image_model_activations.py

This generates a file containing all the activations.

Then to train the model run -

python train.py

To generate captions run -

python generate_caption.py

Some of the results from this model : Image 1

Image 2

Image 3

Image 5

Image 6

Image 7

Image 8

Image 9

About

Project to generate summary or caption for a given image using deep learning models.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages