- San Francisco
- https://huyenchip.com
- @chipro
- in/chiphuyen
Highlights
- Pro
Cool LLM repos
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"
MTEB: Massive Text Embedding Benchmark
Faster Whisper transcription with CTranslate2
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image genera…
DSPy: The framework for programming—not prompting—language models
[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings
Turn expensive prompts into cheap fine-tuned models
A CLI that writes your git commit messages for you with AI
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Running large language models on a single GPU for throughput-oriented scenarios.
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
Making large AI models cheaper, faster and more accessible
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.
🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transf…
Machine Learning Engineering Open Book
đź“‹ A list of open LLMs available for commercial use.
The goal of this project is to enable users to create cool web demos using the newly released OpenAI GPT-3 API with just a few lines of Python.
S-LoRA: Serving Thousands of Concurrent LoRA Adapters
YaRN: Efficient Context Window Extension of Large Language Models
Reference implementation for DPO (Direct Preference Optimization)
Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)