Multimodal-Product-Recommendation Cross Retrieval with Pinecone

Pinterest: Any-to-Image Product Recommendations Cross Modal Retrieval You can search a collection of images using text, audio or images.

About

Code Structure

    ├── .github/workflows//
    │   └── push_to_s3.yml
    |
    ├── audio/
    │   ├── (audio files)
    │
    ├── docs/
    │   └── app_info.md
    │
    ├── images/
    │   ├── (sample images)
    │
    ├── data/
    |    └── fashion-cat.json
    |
    ├── doc/
    |    └── APP_README.md
    |
    ├── models/
    │   ├── __init__.py
    │   ├── data.py
    │   ├── helper.py
    │   ├── imagebind_model.py
    │   ├── model_utils.py
    │   ├── multimodal_preprocessors.py
    │   └── transformer.py
    │
    ├── Notebooks/
    │   ├── gemini-description-dataset.ipynb
    │   ├── imagebind-model-download.ipynb
    │   ├── model-inference.ipynb
    │   ├── pinecone-upsert-embeddings.ipynb
    |   └── synthetic-fashion-dataset-creation.ipynb
    |
    ├──sample-workflow
    |   └──docker-image-build.yml
    ├── .env
    ├── .gitignore
    ├── Dockerfile 
    ├── gradio_app.py
    ├── README.md
    └── requirements.txt

TechStack

ImageBind Model
Generative AI
Vector Store: Pinecone
Image Embeddings
Any-to-Any Image Similarity Search
Vision Transformers

Data

For Image Descriptions data Visit here https://www.kaggle.com/datasets/samikshakolhe/pinterest-fashion-dataset

Note: Image Descriptions are created using gemini-pro-vision model.

For Image and Text Embeddding data created using ImageBind model Visit this website to download the embeddings pickle file https://www.kaggle.com/datasets/samikshakolhe/pinterest-fashion-imagebind-multimodal-embed-data

Retrieval with Imagebind and Pinecone from scratch

Note: This code is builted and tested on python==v3.8.19.

create a virtual env

conda create --name multimodal python=3.8.19
conda activate multimodal

How to Run the App

There are two ways to

Clone the github repository: follow below steps

git clone https://github.com/kolhesamiksha/Multimodal-Product-recommendation.git
pip install -r requirements.txt
python gradio_app.py -i <your-pinecone-index> -k <topk>

How to Use Dockerfile

Built the Dockerimage and publish it to docker-hub
```
docker build -t multimodal-image . 
```

Now After publishing to docker-hub, run the below command
```
docker run multimodal-image
```

Sample Outputs with 3 modalities

Text to Image : Office coats and jackets
Image to Image: White Shirt
Audio to Image: Shoes in winters

For more info

Thank you for visiting the application, connect with me in linkedin, I love writing blogs on ML/AI/GenAI visit blogs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multimodal-Product-Recommendation Cross Retrieval with Pinecone

About

Code Structure

TechStack

Data

Retrieval with Imagebind and Pinecone from scratch

How to Run the App

How to Use Dockerfile

Sample Outputs with 3 modalities

For more info

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
.github/workflows		.github/workflows
Notebooks		Notebooks
audio		audio
data		data
docs		docs
images		images
models		models
sample-workflow		sample-workflow
.env		.env
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
gradio_app.py		gradio_app.py
requirements.txt		requirements.txt

kolhesamiksha/Multimodal-Product-recommendation

Folders and files

Latest commit

History

Repository files navigation

Multimodal-Product-Recommendation Cross Retrieval with Pinecone

About

Code Structure

TechStack

Data

Retrieval with Imagebind and Pinecone from scratch

How to Run the App

How to Use Dockerfile

Sample Outputs with 3 modalities

For more info

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages