pls refer to:
project kantv: https://github.com/kantv-ai/kantv (AI subtitle for English online-TV, on-device LLM inference, powered by whisper.cpp + FFmpeg)
project ggml-qnn: https://github.com/kantv-ai/ggml-qnn/discussions/2 (implementation of ggml-qnn in llama.cpp)