Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add GroupQueryAttention with KV-Cache #3425
Add GroupQueryAttention with KV-Cache #3425
Changes from 78 commits
194d5e9
157c576
06321e5
d3ac4c8
dfcc73f
692a404
dcaba12
0d20b7f
75788e1
9040735
875dd2e
9e72f3b
5ece981
332fde6
c79b495
695c4b9
79efe42
442d055
4fd0902
1e31f03
cd194b2
829324e
979f4e2
d6b60a2
90a2375
736c0b2
a9d9f9d
1a21ee1
e8933cd
c420f26
fb6b6de
4b54ac5
541a406
496e213
bc4a240
eaa0a87
291ba66
41622b0
141a1bf
41d2af7
7617ae5
3a99379
20c0f15
54ff0e9
d94c2e4
08e1d35
5991822
691c447
56f7e96
eb734cb
6092f21
f19f41d
37dc23e
facef36
024953a
124fbb8
370b71c
343c91c
c7b8590
ada9b6d
bbf73a8
7ac592f
a892630
946b619
0ccf30c
54dfefb
3dd21b7
72d0985
a67c609
6128100
b107fb4
9d4fcb2
441b390
2a486f4
2903dcb
7c53d34
0a0551a
aa8e18b
bd974c0
ad57d17
725f34f
File filter
Filter by extension
Conversations
Jump to
There are no files selected for viewing
Large diffs are not rendered by default.
Check warning on line 77 in src/onnx/parse_group_query_attention.cpp
Codecov / codecov/patch
src/onnx/parse_group_query_attention.cpp#L77