-
LMI
- Pittsburgh PA / DC
- https://hughesadam87.medium.com/
- in/adam-hughes-80882734
Stars
Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali
A high-throughput and memory-efficient inference and serving engine for LLMs
Unified framework for building enterprise RAG pipelines with small, specialized models
RAG architecture: index and query any data using LLM and natural language, track sources, show citations, asynchronous memory patterns.
Retrieval Augmented Generation (RAG) chatbot powered by Weaviate
Sample project showing how to set up DI with Cucumber using Spring Boot
Extensions to the StreamBase Unit testing API
hughesadam87 / scipy2012
Forked from scopatz/scipy2012Possible SciPy 2012 Talks
Tools used to generate the SciPy conference proceedings