Skip to content

med-khalil-jabri/vessel-id

Repository files navigation

Explainable Vessel Re-identification

Overview

This project introduces an explainable vessel re-identification system aimed at improving the interpretability of maritime vessel recognition systems. It utilizes a Vision Transformer (ViT) model, coupled with pairwise heatmap visualizations through Attention Flow Approximation as proposed in [1,4], as well as GradCam++ visualizations [2,5]. The ViT model achieves parity with traditional CNN architectures in performance, while offering more intuitive result interpretations and more effective explanations than conventional methods like GradCam.

Model Architecture: The employed architecture is a ViT with 12 layers, 12 attention heads, 16x16 patches and 768-dimensional embeddings.

Quantitative Results

The model is evaluated on two different test sets:

  • The first test set contains unseen images for vessels present in the training set.
  • The second test set contains images of completely unseen vessels.

The table below summarizes the model's performance on both sets.

Seen Vessel

Qualitative Results

Displayed below are pairwise heatmaps of selected vessel images alongside their top three model-retrieved counterparts. Correct matches are highlighted in green, while incorrect ones are marked in red.

"Seen" Test Set

Examples of correctly retrieved images Seen Vessel In the examples below, the model is focusing on the relevant parts, but the difference between the vessels is very subtle. Seen Vessel Again, the model is focusing on the most distinctive parts of the vessels. Seen Vessel In some instances, the model is distracted by the background Seen Vessel Here, the model is focusing on the containers instead of the vessel itself, which leads to wrong predictions. Seen Vessel

"Unseen" Test Set

In the example below, the part on which the model focuses most is the ship's chimney. Unseen Vessel 1 Below are some other examples in which the model is focusing on distinctive parts of the vessels. Unseen Vessel 1 In the following examples, the model focuses on the relevant parts but fails due to the high resemblance between different vessels. Unseen Vessel 1 Again, the model can be distracted by the backround or the containers. Unseen Vessel 1

Comparison with GradCam++

Below are some exmaples of the vizualizations output by GradCam++.

Comparison Image 1 Comparison Image 1

References

About

Interpretable Vessel Re-Identification Project

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages