Attn #138

AnFreTh · 2024-09-28T21:10:25Z

MambAttn Class: Introduced a new model class MambAttn that alternates between Mamba blocks and attention layers, providing a flexible architecture for various deep learning tasks. (mambular/arch_utils/mambattn_arch.py)
ConvRNN Class: Added the ConvRNN class that combines convolutional layers with RNN layers, supporting various RNN types (RNN, LSTM, GRU) and optional residual connections. (mambular/arch_utils/rnn_utils.py)

MambAttention Model: Implemented the MambAttention model that leverages the MambAttn architecture, with support for various normalization techniques and pooling methods. (mambular/base_models/mambattn.py)
Model Registration: Registered the MambAttn model in the __init__.py of base_models to ensure it's accessible within the module. (mambular/base_models/__init__.py) [1] [2]

Early Pruning and Optimizer Configuration: Enhanced the lightning_wrapper.py to include early pruning based on validation loss and dynamic optimizer configuration, allowing for more flexible and efficient training.
Include automatic bayesian HPO for all models -> config-mapper for automatic hparam-range detection
(mambular/base_models/lightning_wrapper.py) [1] [2]

AnFreTh added 18 commits September 4, 2024 17:54

include MambAttn

9998d38

adapt mabattn config

de0886e

pruning for hpo

d39c056

include config mapper for hpo

c1467bd

include gp_minimize in sklearn base regressor

85adc31

fix bug in TabTransformer embedding_layer

6a6a51b

mlp basemodel convenience fix

29811d6

resnet convenience fix

1cb93c5

config convenience fix

4ef4b5d

add hpo to classifier and lss

797b624

minor pooling error in mambular

5b05664

add conv layer to rnn for positional invariance

14663e0

add convrnn to base class

4bd83e9

adjust config of RNN

b3327a1

include optimizer args in taskmodel

3d76805

adapt sklearn classes to allow optimizer kwargs

81ae17f

adjust default optimizer

bce26cb

include skopt in requirements

28d56b6

AnFreTh merged commit ed5a0f3 into develop Sep 28, 2024

AnFreTh deleted the attn branch November 5, 2024 14:58

Provide feedback