[Bugfix][Quantization]Fix support for non quantized visual layers in otherwise quantized mllama model, including missing scaling factors #26152
Job | Run time |
---|---|
2m 33s | |
2m 16s | |
2m 15s | |
1m 39s | |
1m 33s | |
10m 16s |
Job | Run time |
---|---|
2m 33s | |
2m 16s | |
2m 15s | |
1m 39s | |
1m 33s | |
10m 16s |