feat: more embedded models, coqui fixes, add model usage and description #1556

mudler · 2024-01-06T18:41:36Z

Description

This PR adds whisper, embeddings, and tts to the one-click install. I still need to test all of them - so it's still in WIP. It removes the hardcoded model gallery, but most importantly it adds a description and usage field in the model yamls so it is displayed during the model preloading. This allows for instance to inform the user on how a specific model which is loaded is meant to be used.

As not-related, improves UX and output when trying to connect to backend services. It also fixes #1549

Tested embedded models configs:

Testing with, for example:

docker run -ti --rm -p 8080:8080 quay.io/go-skynet/local-ai:master-ffmpeg-core https://gist.githubusercontent.com/mudler/43fd618e6acf57f9b15467b0e60c1f3e/raw/edc8bc8ddd6344bd6e17747d99cc91e72147cc8b/bert-cpp.yaml

usage example is printed out during start

netlify · 2024-01-06T18:41:40Z

✅ Deploy Preview for localai ready!

Name	Link
🔨 Latest commit	`c4b2885`
🔍 Latest deploy log	https://app.netlify.com/sites/localai/deploys/659b1ed7c9439600086c446e
😎 Deploy Preview	https://deploy-preview-1556--localai.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

mudler · 2024-01-06T18:47:07Z

I'd like to have an example for each backend in the one-click setup section, and update the docs while at it, as we miss several backends entirely

mudler · 2024-01-07T14:51:48Z

I've tried coqui and added a fix for it in this PR (#1549). Will follow-up an e2e example as well.

mudler · 2024-01-07T18:04:39Z

examples missing for:

stablediffusion
vllm
exllama(2)
autogptq

….0 by renovate (#17044) This PR contains the following updates: | Package | Update | Change | |---|---|---| | [docker.io/localai/localai](https://github.com/mudler/LocalAI) | minor | `v2.4.1-cublas-cuda11-ffmpeg-core` -> `v2.5.0-cublas-cuda11-ffmpeg-core` | --- > [!WARNING] > Some dependencies could not be looked up. Check the Dependency Dashboard for more information. --- ### Release Notes <details> <summary>mudler/LocalAI (docker.io/localai/localai)</summary> ### [`v2.5.0`](https://github.com/mudler/LocalAI/releases/tag/v2.5.0) [Compare Source](https://github.com/mudler/LocalAI/compare/v2.4.1...v2.5.0)  ##### What's Changed This release adds more embedded models, and shrink image sizes. You can run now `phi-2` ( see [here](https://localai.io/basics/getting_started/#running-popular-models-one-click) for the full list ) locally by starting localai with: docker run -ti -p 8080:8080 localai/localai:v2.5.0-ffmpeg-core phi-2 LocalAI accepts now as argument a list of short-hands models and/or URLs pointing to valid yaml file. A popular way to host those files are Github gists. For instance, you can run `llava`, by starting `local-ai` with: ```bash docker run -ti -p 8080:8080 localai/localai:v2.5.0-ffmpeg-core https://raw.githubusercontent.com/mudler/LocalAI/master/embedded/models/llava.yaml ``` ##### Exciting New Features 🎉 - feat: more embedded models, coqui fixes, add model usage and description by [@mudler](https://github.com/mudler) in [https://github.com/mudler/LocalAI/pull/1556](https://github.com/mudler/LocalAI/pull/1556) ##### 👒 Dependencies - deps(conda): use transformers-env with vllm,exllama(2) by [@mudler](https://github.com/mudler) in [https://github.com/mudler/LocalAI/pull/1554](https://github.com/mudler/LocalAI/pull/1554) - deps(conda): use transformers environment with autogptq by [@mudler](https://github.com/mudler) in [https://github.com/mudler/LocalAI/pull/1555](https://github.com/mudler/LocalAI/pull/1555) - ⬆️ Update ggerganov/llama.cpp by [@localai-bot](https://github.com/localai-bot) in [https://github.com/mudler/LocalAI/pull/1558](https://github.com/mudler/LocalAI/pull/1558) ##### Other Changes - ⬆️ Update docs version mudler/LocalAI by [@localai-bot](https://github.com/localai-bot) in [https://github.com/mudler/LocalAI/pull/1557](https://github.com/mudler/LocalAI/pull/1557) **Full Changelog**: mudler/LocalAI@v2.4.1...v2.5.0 </details> --- ### Configuration 📅 **Schedule**: Branch creation - "before 10pm on monday" in timezone Europe/Amsterdam, Automerge - At any time (no schedule defined). 🚦 **Automerge**: Enabled. ♻ **Rebasing**: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox. 🔕 **Ignore**: Close this PR and you won't be reminded about this update again. --- - [ ] If you want to rebase/retry this PR, check this box --- This PR has been generated by [Renovate Bot](https://github.com/renovatebot/renovate).

….0 by renovate (truecharts#17044) This PR contains the following updates: | Package | Update | Change | |---|---|---| | [docker.io/localai/localai](https://github.com/mudler/LocalAI) | minor | `v2.4.1-cublas-cuda11-ffmpeg-core` -> `v2.5.0-cublas-cuda11-ffmpeg-core` | --- > [!WARNING] > Some dependencies could not be looked up. Check the Dependency Dashboard for more information. --- ### Release Notes <details> <summary>mudler/LocalAI (docker.io/localai/localai)</summary> ### [`v2.5.0`](https://github.com/mudler/LocalAI/releases/tag/v2.5.0) [Compare Source](https://github.com/mudler/LocalAI/compare/v2.4.1...v2.5.0)  ##### What's Changed This release adds more embedded models, and shrink image sizes. You can run now `phi-2` ( see [here](https://localai.io/basics/getting_started/#running-popular-models-one-click) for the full list ) locally by starting localai with: docker run -ti -p 8080:8080 localai/localai:v2.5.0-ffmpeg-core phi-2 LocalAI accepts now as argument a list of short-hands models and/or URLs pointing to valid yaml file. A popular way to host those files are Github gists. For instance, you can run `llava`, by starting `local-ai` with: ```bash docker run -ti -p 8080:8080 localai/localai:v2.5.0-ffmpeg-core https://raw.githubusercontent.com/mudler/LocalAI/master/embedded/models/llava.yaml ``` ##### Exciting New Features 🎉 - feat: more embedded models, coqui fixes, add model usage and description by [@&truecharts#8203;mudler](https://github.com/mudler) in [https://github.com/mudler/LocalAI/pull/1556](https://github.com/mudler/LocalAI/pull/1556) ##### 👒 Dependencies - deps(conda): use transformers-env with vllm,exllama(2) by [@&truecharts#8203;mudler](https://github.com/mudler) in [https://github.com/mudler/LocalAI/pull/1554](https://github.com/mudler/LocalAI/pull/1554) - deps(conda): use transformers environment with autogptq by [@&truecharts#8203;mudler](https://github.com/mudler) in [https://github.com/mudler/LocalAI/pull/1555](https://github.com/mudler/LocalAI/pull/1555) - ⬆️ Update ggerganov/llama.cpp by [@&truecharts#8203;localai-bot](https://github.com/localai-bot) in [https://github.com/mudler/LocalAI/pull/1558](https://github.com/mudler/LocalAI/pull/1558) ##### Other Changes - ⬆️ Update docs version mudler/LocalAI by [@&truecharts#8203;localai-bot](https://github.com/localai-bot) in [https://github.com/mudler/LocalAI/pull/1557](https://github.com/mudler/LocalAI/pull/1557) **Full Changelog**: mudler/LocalAI@v2.4.1...v2.5.0 </details> --- ### Configuration 📅 **Schedule**: Branch creation - "before 10pm on monday" in timezone Europe/Amsterdam, Automerge - At any time (no schedule defined). 🚦 **Automerge**: Enabled. ♻ **Rebasing**: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox. 🔕 **Ignore**: Close this PR and you won't be reminded about this update again. --- - [ ] If you want to rebase/retry this PR, check this box --- This PR has been generated by [Renovate Bot](https://github.com/renovatebot/renovate).

mudler added 3 commits January 6, 2024 19:38

feat: add model descriptions and usage

6724e2c

remove default model gallery

a49be33

models: add embeddings and tts

5d0026d

docs: update table

b0fda36

docs: updates

d89fa27

mudler force-pushed the docs_updates_4 branch from e285810 to d89fa27 Compare January 6, 2024 19:10

mudler added 5 commits January 7, 2024 09:55

images: cleanup pip cache after install

edcdd47

images: always run apt-get clean

5ed2933

ux: improve gRPC connection errors

69e9db4

ux: improve some messages

ac93a3a

fix: fix coqui when no AudioPath is passed by

47f92e4

mudler changed the title ~~feat: more embedded models, add model usage and description~~ feat: more embedded models, coqui fixes, add model usage and description Jan 7, 2024

mudler added 2 commits January 7, 2024 15:52

embedded: add more models

191f31f

Add usage

b5538f1

mudler mentioned this pull request Jan 7, 2024

TTS with coqui: Examples missing and Error 404: sendfile: file /tmp/generated/audio/piper.wav not found #1549

Closed

mudler force-pushed the docs_updates_4 branch from c745217 to d395f3c Compare January 7, 2024 18:27

Reorder table

c4b2885

mudler force-pushed the docs_updates_4 branch from d395f3c to c4b2885 Compare January 7, 2024 21:59

mudler merged commit e19d722 into master Jan 7, 2024
24 checks passed

mudler deleted the docs_updates_4 branch January 7, 2024 23:37

dionysius mentioned this pull request Jan 8, 2024

TTS with piper: Error 500, terminate called after throwing an instance of 'nlohmann::json_abi_v3_11_2::detail::parse_error' #1548

Closed

mudler added the enhancement New feature or request label Jan 8, 2024

mudler mentioned this pull request Jan 10, 2024

docs/examples: enhancements #1572

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: more embedded models, coqui fixes, add model usage and description #1556

feat: more embedded models, coqui fixes, add model usage and description #1556

mudler commented Jan 6, 2024 •

edited

Loading

netlify bot commented Jan 6, 2024 •

edited

Loading

mudler commented Jan 6, 2024

mudler commented Jan 7, 2024

mudler commented Jan 7, 2024

feat: more embedded models, coqui fixes, add model usage and description #1556

feat: more embedded models, coqui fixes, add model usage and description #1556

Conversation

mudler commented Jan 6, 2024 • edited Loading

netlify bot commented Jan 6, 2024 • edited Loading

✅ Deploy Preview for localai ready!

mudler commented Jan 6, 2024

mudler commented Jan 7, 2024

mudler commented Jan 7, 2024

mudler commented Jan 6, 2024 •

edited

Loading

netlify bot commented Jan 6, 2024 •

edited

Loading