take over video swin checkpoints #2448

innat · 2024-05-23T06:40:13Z

Keras-team

Could you please take over the video swin checkponts and upload it to kaggle in order to make it usable in kaggle platform?

I have posted regarding weights. Add Video Swin Transformer #2369 (comment)
Weight verification. Add Video Swin Transformer #2369 (comment)
Also could you please answer this query? Add Video Swin Transformer #2369 (comment)

divyashreepathihalli · 2024-05-30T00:07:49Z

Hi Innat!! will do
Thanks!!

divyashreepathihalli · 2024-05-30T00:22:45Z

@innat the verification notebooks are not accessible. can you please double check the permissions?

what is the difference between the base and the backbone weights?
do you have weights for the video classifier task model that you added?
Also can you please use the keras_model.save_to_preset(checkpoint_name) to save to preset, this will generate a config, metadata and weights file that we can then upload to Kaggle
Can you also please add your checkpoint conversion file here - keras_cv/src/tools/checkpoint_conversion. We have added some in .ipynb format but this is a good example to follow - https://github.com/keras-team/keras-nlp/blob/master/tools/checkpoint_conversion/convert_pali_gemma_checkpoints.py

innat · 2024-05-30T07:49:26Z

@divyashreepathihalli

what is the difference between the base and the backbone weights?
do you have weights for the video classifier task model that you added?

Ignore else, you can only look at here.

- videoswin_tiny_kinetics400.weights.h5 <- for backbone
- videoswin_tiny_kinetics400_classifier.weights.h5 <- for classifier

- videoswin_small_kinetics400.weights.h5 <- for backbone
- videoswin_small_kinetics400_classifier.weights.h5 <- for classifier

- videoswin_base_kinetics400.weights.h5 <- for backbone
- videoswin_base_kinetics400_classifier.weights.h5 <- for classifier

Some other variations of this model. (also check this comment)

- videoswin_base_kinetics400_imagenet22k.weights.h5 <- for backbone
- videoswin_base_kinetics400_imagenet22k_classifier.weights.h5 <- for classifier

- videoswin_base_kinetics600_imagenet22k.weights.h5 <- for backbone
- videoswin_base_kinetics600_imagenet22k_classifier.weights.h5 <- for classifier

- videoswin_base_something_something_v2.weights.h5 <- for backbone
- videoswin_base_something_something_v2_classifier.weights.h5 <- for classifier

FYI, the tiny, small, base are refer to the variants of the model. The kinetics400 and kinetics600 refer the kinetics dataset with 400 and 600 classes. The _imagenet22k term refers the pre-trained weight which were incorporated from 2D swin image model to initialize the video swin model. You can also check the official repo for more clarification.

Also can you please use the keras_model.save_to_preset(checkpoint_name) to save to preset, this will generate a config, metadata and weights file that we can then upload to Kaggle

I see, new API. Let me check.

Can you also please add your checkpoint conversion file here - keras_cv/src/tools/checkpoint_conversion.

I have to clean lots of messy code. How about the following two files

for bacbone - keras-cv VS torch-vision
for classifier - keras-cv VS torch-vision
and more.

divyashreepathihalli · 2024-06-05T22:38:25Z

I will wait for your generated preset files. Also I cannot still access the verification files.

innat · 2024-06-06T12:51:26Z

@divyashreepathihalli Do you have a kaggle id? If so, could you please share?

divyashreepathihalli · 2024-06-07T16:45:35Z

@innat what do you mean by Kaggle ID?

are you talking about the Kaggle handle that needs to be added to the presets file?
or an authentication key that you need from Kaggle account?
or login id for a kaggle account? - If this is for uploading the preset, the team will have to be doing that for now.

innat · 2024-06-07T17:06:22Z

@divyashreepathihalli
Uh, sorry for the confusion. Actually, I prepared some kaggle notebooks (currently its private) which will help you to evaluate or verify the model and weights. You can also run them out of the box on kaggle env (plug-n-play). If you share your kaggle id, I can add you as collaborator.

Here are the notebooks on kaggle. (currently in private.). I don't like to make these notebook public (as I already did something before with my own code). The following notebooks load the keras-cv.video-swin from its latest release. After the take over process (checking, verifying, saving of presets or weights) is done (by you), I will remove these notebooks from my end. Hope its clear now.

k400-logit-matching-torch-vs-keras-cv (classifier)
k400-logit-matching-torch-vs-keras-cv-backbone
k600-ssv2-logit-matching-torch-vs-keras-cv (classifier)
k600-ssv2-logit-matching-torch-vs-keras-cv-backbon
generate presets for keras-cv with save_to_preset
All the above notebooks, wget the model weights from here.

(Note. In number 1 and 2, we load the official video swin model from torchvision (with their API), and in number 3 and 4, we load the official vidoe swin model in raw pytorch code. )

divyashreepathihalli · 2024-06-10T18:29:18Z

Hi Innat!! here is my kaggle id - divyasss.

divyashreepathihalli · 2024-06-10T18:35:40Z

Also here is the process for presets

you would load your backbone with the necessary config and weights and then use you_backbone.save_to_preset('preset_name') - this will generate config.json, metadata.json, weights.h5, etc
Similarly you can load your task model with weights and necessary config and do your_task model.save_to_preset('preset_name') - this will generate config.json, metadata.json, weights.h5, etc
You can add your presets file like this - https://github.com/keras-team/keras-cv/blob/master/keras_cv/src/models/segmentation/segment_anything/sam_presets.py
and update the kaggle handle to be "kaggle_handle": "kaggle://keras/<model_name>/keras/<model_variation>/<version_number",
you can upload this to your kaggle account to test it out and provide your own path for kaggle handle
Once that is in, everything is good to go and I can upload the model weights to Keras page

innat · 2024-06-18T08:12:53Z

@divyashreepathihalli

and update the kaggle handle to be "kaggle_handle": "kaggle://keras/<model_name>/keras/<model_variation>/<version_number"

I prefer not to do that coz it will take much time to upload all the weights to kaggle. However, is it possible to test the weight with local file path? Also, if I want to manually load the preset file, what are the loading APIs? For example, to load the .weight.h5, that is load_weight?

!ls video-swin-presets/videoswin_base_kinetics400
- config.json
- metadata.json
- model.weights.h5

def vswin_tiny():
    backbone=VideoSwinBackbone(
        input_shape=(32, 224, 224, 3), 
        embed_dim=96,
        depths=[2, 2, 6, 2],
        num_heads=[3, 6, 12, 24],
        include_rescaling=False, 
    )
    keras_model = VideoClassifier(
        backbone=backbone,
        num_classes=400,
        activation=None,
        pooling='avg',
    )

    # option 1
    keras_model.load_weights(
        'video-swin-presets/videoswin_tiny_kinetics400'
    )
 
   # option 2
   keras_model.load_presets(
        'video-swin-presets/videoswin_tiny_kinetics400'
    )
    return keras_model

divyashreepathihalli · 2024-07-23T18:19:06Z

loading the preset is done using model.from_preset("preset_name or kaggle uri")

github-actions · 2024-08-07T01:54:41Z

This issue is stale because it has been open for 14 days with no activity. It will be closed if no further activity occurs. Thank you.

github-actions · 2024-08-22T01:55:55Z

This issue was closed because it has been inactive for 28 days. Please reopen if you'd like to work on this further.

github-actions bot assigned sachinprasadhs May 23, 2024

chunduriv added keras-team-review-pending labels May 23, 2024

divyashreepathihalli assigned divyashreepathihalli and unassigned sachinprasadhs May 30, 2024

divyashreepathihalli removed the keras-team-review-pending label Jun 6, 2024

sachinprasadhs added the stat:awaiting response from contributor label Jul 23, 2024

github-actions bot added the stale label Aug 7, 2024

github-actions bot closed this as completed Aug 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

take over video swin checkpoints #2448

take over video swin checkpoints #2448

innat commented May 23, 2024

divyashreepathihalli commented May 30, 2024

divyashreepathihalli commented May 30, 2024

innat commented May 30, 2024

divyashreepathihalli commented Jun 5, 2024 •

edited

Loading

innat commented Jun 6, 2024

divyashreepathihalli commented Jun 7, 2024 •

edited

Loading

innat commented Jun 7, 2024 •

edited

Loading

divyashreepathihalli commented Jun 10, 2024

divyashreepathihalli commented Jun 10, 2024 •

edited

Loading

innat commented Jun 18, 2024

divyashreepathihalli commented Jul 23, 2024

github-actions bot commented Aug 7, 2024

github-actions bot commented Aug 22, 2024

take over video swin checkpoints #2448

take over video swin checkpoints #2448

Comments

innat commented May 23, 2024

divyashreepathihalli commented May 30, 2024

divyashreepathihalli commented May 30, 2024

innat commented May 30, 2024

divyashreepathihalli commented Jun 5, 2024 • edited Loading

innat commented Jun 6, 2024

divyashreepathihalli commented Jun 7, 2024 • edited Loading

innat commented Jun 7, 2024 • edited Loading

divyashreepathihalli commented Jun 10, 2024

divyashreepathihalli commented Jun 10, 2024 • edited Loading

innat commented Jun 18, 2024

divyashreepathihalli commented Jul 23, 2024

github-actions bot commented Aug 7, 2024

github-actions bot commented Aug 22, 2024

divyashreepathihalli commented Jun 5, 2024 •

edited

Loading

divyashreepathihalli commented Jun 7, 2024 •

edited

Loading

innat commented Jun 7, 2024 •

edited

Loading

divyashreepathihalli commented Jun 10, 2024 •

edited

Loading