b3181 #170

Nexesenex · 2024-06-18T20:15:22Z

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

* whisper : use ggml_backend_sched (wip) * use sched in whisper_allocr * whisper : single backend in whisper_context * whisper : remove whisper_state->backends_used * whisper : remove whisper_context->backend * whisper : reset scheduler after init * whisper : fix external encoder (e.g. CoreML) * whisper : cleanup * whisper : handle null GPU buffer types + fix sycl --------- Co-authored-by: slaren <[email protected]>

Signed-off-by: thxCode <[email protected]>

On hosts which are not prepared/dedicated to execute code using CUDA it is still possible to compile llama.cpp with CUDA support by just installing the development packages. Missing are the runtime libraries like /usr/lib64/libcuda.so* and currently the link step will fail. The development environment is prepared for such situations. There are stub libraries for all the CUDA libraries available in the $(CUDA_PATH)/lib64/stubs directory. Adding this directory to the end of the search path will not change anything for environments which currently work fine but will enable compiling llama.cpp also in case the runtime code is not available.

* Only use FIM middle if it exists * Only use FIM middle if it exists

* Random test: add_bos_token, add_eos_token * Random test: add BPE models for testing * Custom regex split fails with codepoint 0 * Fix falcon punctuation regex * Refactor llm_tokenizer_bpe: move code to constructor * Move 'add_special_bos/eos' logic to llm_tokenizer_bpe * Move tokenizer flags to vocab structure. * Default values for special_add_bos/eos * Build vocab.special_tokens_cache using vocab token types * Generalize 'jina-v2' per token attributes * Fix unicode whitespaces (deepseek-coder, deepseek-llm) * Skip missing byte tokens (falcon) * Better unicode data generation * Replace char32_t with uint32_t

ggerganov and others added 8 commits June 18, 2024 09:50

ggml : sync

5326bcc

readme : update UI list (#7943)

1193778

chore: clean useless beam search param (#7985)

b96f9af

Signed-off-by: thxCode <[email protected]>

Fix no gcc pragma on Windows (#7751)

84f6de1

Only use FIM middle token if it exists (#7648)

91c188d

* Only use FIM middle if it exists * Only use FIM middle if it exists

Nexesenex merged commit 0136b3f into Nexesenex:skystream Jun 18, 2024
37 of 40 checks passed

github-actions bot added testing examples python server ggml script labels Jun 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

b3181 #170

b3181 #170

Nexesenex commented Jun 18, 2024

b3181 #170

b3181 #170

Conversation

Nexesenex commented Jun 18, 2024