Add fastembed integration #210

NirantK · 2023-07-12T08:41:51Z

This PR adds two new functions add and query and a new return object type QueryResponse. This makes it a lot easier for folks coming from NLP and looking to use SoTA Embedding (beating OpenAI) but several times faster.

Improvements:

Configurable ONNX runtime — allows users to use GPU, Mac Metal M1/M2, CPU and more runtimes for creating batch embedding at insertion time. This is done via the ONNX Runtime and we default to the CPU Runtime
Quantized model using optimum — this makes the model ~2x faster compared to the PyTorch runtime on CPU

netlify · 2023-07-12T08:41:55Z

✅ Deploy Preview for poetic-froyo-8baba7 ready!

Name	Link
🔨 Latest commit	`8835f08`
🔍 Latest deploy log	https://app.netlify.com/sites/poetic-froyo-8baba7/deploys/64df3112e39d5c0008a60076
😎 Deploy Preview	https://deploy-preview-210--poetic-froyo-8baba7.netlify.app/qdrant_client.conversions.conversion
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

qdrant_client/qdrant_client.py

…rom fastembed instead of fastvector * fix(qdrant_client.py): remove unnecessary print statement

* feat(README.md): add section for Fast Embeddings + Simpler API * fix(README.md): fix formatting of code block and update code example for Qdrant Client usage *

qdrant_client/qdrant_client.py

generall · 2023-08-01T08:30:01Z

Overall it would be nice to have tests for this integration

* feat(qdrant_client.py): add support for adding and querying documents with fastembed installed * test(qdrant_client.py): add tests for adding and querying documents with and without fastembed installed

…ntAPIExtensions for upsert_docs and search_docs methods

* feat(qdrant_client.py): add return type hint to QdrantClient.search_docs method

…in qdrant_client.py

qdrant_client/qdrant_client.py

tests/test_qdrant_client.py

…lass

…ixture * test(test_qdrant_client.py): add test for client_close function

* chore(test_qdrant_client.py): reformat import statements for better readability

… DefaultEmbedding instead of FlagEmbedding * refactor(qdrant_client.py): refactor code to remove unnecessary loop

* feat(qdrant_client.py): add support for search parameters in search method * refactor(qdrant_client.py): refactor indexing logic to handle embeddings correctly

…s in collection * test(test_fast_embed.py): remove unused code * test(test_fast_embed.py): add TODO comment for future assertions

…dencies * feat(pyproject.toml): add fastembed dependency to fastembed group

…rantClient constructor

* fix(test_fast_embed.py): add default values for test_no_install parameters * fix(test_fast_embed.py): skip test if FastEmbed is installed

joein · 2023-08-16T21:32:59Z

haven't yet looked deep into the PR, but all the packages inside qdrant_client.http are auto-generated and should not be modified manually

if it is actually required to modify them, https://github.com/qdrant/pydantic_openapi_v3 should be upgraded first

NirantK · 2023-08-17T03:37:58Z

@joein for return types which are exclusive to the Python client for now — is the OpenAPI where the changes to be made?

Since it auto-generates from the Rust models directly, would not prefer doing that. Alternate proposal 4ff71e3 is to keep this in the Python client itself

* feat(qdrant_client.py): add QueryResponse class

…or QueryResponse class

generall · 2023-08-17T16:25:42Z

Hi @NirantK, I made some changes into PR:

Moved fastembed function into middleware, so it is easier to navigate
Dropped support for 3.7 python, as fastembed requires >3.7. If it is not actually a case, please change that in fastembed manifest and revert my changes regarding it. But we wanted to drop 3.7 sooner or later either way
Made interface a bit more compatible and consistent with existing functions. I hope it didn't compromise the "pythonistic" approach too much

I am going to suggest some changes in fastembed as well

NirantK · 2023-08-18T03:27:09Z