Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Moonshine #34784

Merged
merged 93 commits into from
Jan 10, 2025
Merged
Show file tree
Hide file tree
Changes from 1 commit
Commits
Show all changes
93 commits
Select commit Hold shift + click to select a range
35434da
config draft
eustlb Nov 18, 2024
7e18038
full encoder forward
eustlb Dec 2, 2024
6517251
full decoder forward
eustlb Dec 3, 2024
b0efed1
fix sdpa and FA2
eustlb Dec 12, 2024
b4d18f9
fix sdpa and FA2
eustlb Dec 12, 2024
b3777e0
moonshine model
eustlb Dec 13, 2024
e313ab5
moonshine model forward
eustlb Dec 13, 2024
7a6935a
fix attention with past_key_values
eustlb Dec 13, 2024
8fda426
add MoonshineForConditionalGeneration
eustlb Dec 13, 2024
d0ed917
fix cache handling and causality for cross attention
eustlb Dec 13, 2024
461f210
no causal attention mask for the encoder
eustlb Dec 15, 2024
22dbaae
model addition (imports etc)
eustlb Dec 15, 2024
72ba8c4
small nit
eustlb Dec 15, 2024
f548504
nits
eustlb Dec 16, 2024
3d52b1e
Update src/transformers/models/moonshine/convert_usefulsensors_to_hf.py
eustlb Dec 16, 2024
8f82a40
add rope_theta
eustlb Dec 16, 2024
fc73b37
nits
eustlb Dec 16, 2024
aedccf5
model doc
eustlb Dec 16, 2024
0954133
Update src/transformers/models/auto/configuration_auto.py
eustlb Dec 17, 2024
b1f0909
imports
eustlb Dec 17, 2024
f647a9f
add MODEL_FOR_SPEECH_SEQ_2_SEQ_MAPPING_NAMES
eustlb Dec 17, 2024
2da255d
updates modular
eustlb Dec 17, 2024
407cd3e
make
eustlb Dec 17, 2024
b3da73d
make fix-copies
eustlb Dec 17, 2024
8cf195d
ruff check examples fix
eustlb Dec 17, 2024
f78db3d
fix check_modular_conversion
eustlb Dec 17, 2024
5927846
Merge branch 'main' into add-moonshine
eustlb Dec 17, 2024
b680f0d
nit
eustlb Dec 17, 2024
01a4ea1
nits
eustlb Dec 17, 2024
5124cbd
nits
eustlb Dec 17, 2024
a528bd3
copied from -> imports
eustlb Dec 18, 2024
5648c38
Merge branch 'main' into add-moonshine
eustlb Dec 20, 2024
338c7c0
imports fix
eustlb Dec 20, 2024
2ec366a
integrate attention refacto
eustlb Dec 20, 2024
5261886
modular edge case
eustlb Dec 20, 2024
467b8e4
remove encoder
eustlb Jan 2, 2025
445dfcb
convolutions params in config
eustlb Jan 2, 2025
a656e8c
run modular_model_converter
eustlb Jan 2, 2025
6547634
make
eustlb Jan 2, 2025
9eec03b
Update docs/source/en/model_doc/moonshine.md
eustlb Jan 2, 2025
53b9b9d
MoonshineModelTest
eustlb Jan 2, 2025
fb2258f
correct typo
eustlb Jan 7, 2025
259c6b3
make style
eustlb Jan 7, 2025
59dd57f
integration tests
eustlb Jan 7, 2025
8fe2cdc
Merge branch 'main' into add-moonshine
eustlb Jan 7, 2025
02c4343
make
eustlb Jan 7, 2025
6c7dc79
modular convert
eustlb Jan 7, 2025
89904f9
Merge branch 'main' into add-moonshine
eustlb Jan 8, 2025
829a2b8
name conversion update (up_proj -> fc1 etc)
eustlb Jan 9, 2025
02a1f1f
update config
eustlb Jan 9, 2025
80c39cd
update MLP
eustlb Jan 9, 2025
55ac7e2
update attention
eustlb Jan 9, 2025
4d540af
update encoder layer
eustlb Jan 9, 2025
4f52158
update decoder layer
eustlb Jan 9, 2025
6568576
update convolutions parameters
eustlb Jan 9, 2025
495bd73
update encoder
eustlb Jan 9, 2025
48492c2
remove INPUTS_DOCSTRING
eustlb Jan 9, 2025
21a743f
update decoder
eustlb Jan 9, 2025
3b7af33
update conditional generation
eustlb Jan 9, 2025
563a360
update pretrained model
eustlb Jan 9, 2025
0ebd9a5
imports
eustlb Jan 9, 2025
d0233a7
modular converted
eustlb Jan 9, 2025
f9a6cac
update doc
eustlb Jan 9, 2025
b8a6e3b
fix
eustlb Jan 9, 2025
bfea59a
typo
eustlb Jan 9, 2025
ea150d3
update doc
eustlb Jan 9, 2025
8170cc8
update license
eustlb Jan 9, 2025
9bc7b35
update init
eustlb Jan 9, 2025
2bbb8ee
split config in file
eustlb Jan 9, 2025
b42d7f6
two classes for MLP
eustlb Jan 9, 2025
08272b9
attention from GLM
eustlb Jan 9, 2025
61462db
from GlmRotaryEmbedding
eustlb Jan 9, 2025
c866e58
split MLP
eustlb Jan 9, 2025
b0183be
apply arthur's review suggestions
eustlb Jan 9, 2025
3aa4f8d
apply arthur's review suggestions
eustlb Jan 9, 2025
d64190c
apply arthur's review suggestions
eustlb Jan 9, 2025
0bfb6bc
auto feature extractor
eustlb Jan 9, 2025
1268e13
convert modular
eustlb Jan 9, 2025
141a70b
fix + make
eustlb Jan 9, 2025
70f16dc
Merge branch 'main' into add-moonshine
eustlb Jan 9, 2025
1f3cc63
convert modular
eustlb Jan 9, 2025
77ca19a
make
eustlb Jan 9, 2025
e6b19db
unsplit config
eustlb Jan 9, 2025
7155fbd
use correct checkpoint
eustlb Jan 9, 2025
15b2552
wrap generate
eustlb Jan 9, 2025
89ceca8
Merge branch 'main' into add-moonshine
eustlb Jan 9, 2025
fc4febc
update tests
eustlb Jan 10, 2025
3e2a84b
typos
eustlb Jan 10, 2025
9a8f91d
make
eustlb Jan 10, 2025
834c364
typo
eustlb Jan 10, 2025
b491d95
update doc
eustlb Jan 10, 2025
d357fff
Merge branch 'main' into add-moonshine
eustlb Jan 10, 2025
2a6c59e
Merge branch 'main' into add-moonshine
eustlb Jan 10, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
make
  • Loading branch information
eustlb committed Jan 10, 2025
commit 9a8f91d95504252c08766214dcbe684894e60110
2 changes: 1 addition & 1 deletion docs/source/en/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -234,8 +234,8 @@ Flax), PyTorch, and/or TensorFlow.
| [MobileNetV2](model_doc/mobilenet_v2) | ✅ | ❌ | ❌ |
| [MobileViT](model_doc/mobilevit) | ✅ | ✅ | ❌ |
| [MobileViTV2](model_doc/mobilevitv2) | ✅ | ❌ | ❌ |
| [Moonshine](model_doc/moonshine) | ✅ | ❌ | ❌ |
| [ModernBERT](model_doc/modernbert) | ✅ | ❌ | ❌ |
| [Moonshine](model_doc/moonshine) | ✅ | ❌ | ❌ |
| [Moshi](model_doc/moshi) | ✅ | ❌ | ❌ |
| [MPNet](model_doc/mpnet) | ✅ | ✅ | ❌ |
| [MPT](model_doc/mpt) | ✅ | ❌ | ❌ |
Expand Down
16 changes: 8 additions & 8 deletions src/transformers/utils/dummy_pt_objects.py
Original file line number Diff line number Diff line change
Expand Up @@ -6488,56 +6488,56 @@ def __init__(self, *args, **kwargs):
requires_backends(self, ["torch"])


class MoonshineForConditionalGeneration(metaclass=DummyObject):
class ModernBertForMaskedLM(metaclass=DummyObject):
_backends = ["torch"]

def __init__(self, *args, **kwargs):
requires_backends(self, ["torch"])


class ModernBertForMaskedLM(metaclass=DummyObject):
class ModernBertForSequenceClassification(metaclass=DummyObject):
_backends = ["torch"]

def __init__(self, *args, **kwargs):
requires_backends(self, ["torch"])


class MoonshineModel(metaclass=DummyObject):
class ModernBertForTokenClassification(metaclass=DummyObject):
_backends = ["torch"]

def __init__(self, *args, **kwargs):
requires_backends(self, ["torch"])


class ModernBertForSequenceClassification(metaclass=DummyObject):
class ModernBertModel(metaclass=DummyObject):
_backends = ["torch"]

def __init__(self, *args, **kwargs):
requires_backends(self, ["torch"])


class MoonshinePreTrainedModel(metaclass=DummyObject):
class ModernBertPreTrainedModel(metaclass=DummyObject):
_backends = ["torch"]

def __init__(self, *args, **kwargs):
requires_backends(self, ["torch"])


class ModernBertForTokenClassification(metaclass=DummyObject):
class MoonshineForConditionalGeneration(metaclass=DummyObject):
_backends = ["torch"]

def __init__(self, *args, **kwargs):
requires_backends(self, ["torch"])


class ModernBertModel(metaclass=DummyObject):
class MoonshineModel(metaclass=DummyObject):
_backends = ["torch"]

def __init__(self, *args, **kwargs):
requires_backends(self, ["torch"])


class ModernBertPreTrainedModel(metaclass=DummyObject):
class MoonshinePreTrainedModel(metaclass=DummyObject):
_backends = ["torch"]

def __init__(self, *args, **kwargs):
Expand Down
Loading