llm
CLI platform to experiment with codegen. Precursor to: https://lovable.dev
LlamaIndex is the leading framework for building LLM-powered agents over your data.
Langflow is a low-code app builder for RAG and multi-agent AI applications. It’s Python-based and agnostic to any model, API, or database.
Build Conversational AI in minutes ⚡️
A blazing fast AI Gateway with integrated guardrails. Route to 200+ LLMs, 50+ AI Guardrails with 1 fast & friendly API.
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
GGML implementation of BERT model with Python bindings and quantization.
fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.
lightweight, standalone C++ inference engine for Google's Gemma models.
Interact with your documents using the power of GPT, 100% privately, no data leaks
Developer APIs to Accelerate LLM Projects
This project aims to enable language model inference on FPGAs, supporting AI applications in edge devices and environments with limited resources.
[NeurIPS'24] HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge across external documents. RAG + Knowledge Graphs + Personali…
RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry
👾 Open source implementation of the ChatGPT Code Interpreter
Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy
One-click deploy of a Knowledge Graph powered RAG (GraphRAG) in Azure
The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curatio…
Distributed LLM and StableDiffusion inference for mobile, desktop and server.
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
A Comprehensive Toolkit for High-Quality PDF Content Extraction
The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.
Start building LLM-empowered multi-agent applications in an easier way.
No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents
How to use bounding boxes with the Gemini API
LLaVA-JP is a Japanese VLM trained by LLaVA method
Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥
TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.