Skip to content
View chiphuyen's full-sized avatar
đź’­
Doing cool stuff
đź’­
Doing cool stuff

Highlights

  • Pro

Block or report chiphuyen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

Cool LLM repos

323 repositories

Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model

Python 3,408 343 Updated Nov 3, 2024

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

HTML 10,006 837 Updated Feb 7, 2025

Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"

Python 2,267 165 Updated Dec 11, 2024

Inference code for CodeLlama models

Python 16,182 1,893 Updated Aug 12, 2024

MTEB: Massive Text Embedding Benchmark

Jupyter Notebook 2,148 313 Updated Feb 7, 2025

Faster Whisper transcription with CTranslate2

Python 13,926 1,165 Updated Jan 1, 2025

Minimalist ML framework for Rust

Rust 16,494 1,020 Updated Feb 4, 2025

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 4,592 486 Updated Feb 7, 2025

OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image genera…

Jupyter Notebook 7,049 1,068 Updated Aug 6, 2024

DSPy: The framework for programming—not prompting—language models

Python 21,699 1,640 Updated Feb 7, 2025

[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings

Python 1,907 143 Updated Jan 15, 2025

Turn expensive prompts into cheap fine-tuned models

TypeScript 2,535 135 Updated May 25, 2024

A CLI that writes your git commit messages for you with AI

TypeScript 8,220 406 Updated Aug 15, 2024

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,404 163 Updated Jun 25, 2024

Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

Python 7,368 520 Updated Sep 18, 2024

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

Python 10,141 798 Updated Feb 6, 2025

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,254 558 Updated Oct 28, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,332 1,089 Updated Feb 7, 2025

Making large AI models cheaper, faster and more accessible

Python 39,054 4,366 Updated Feb 6, 2025

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.

Go 121,786 9,770 Updated Feb 7, 2025

🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transf…

Go 30,102 2,259 Updated Feb 6, 2025

Machine Learning Engineering Open Book

Python 12,675 774 Updated Feb 3, 2025

đź“‹ A list of open LLMs available for commercial use.

11,619 796 Updated Feb 3, 2025

LLM Frontend for Power Users.

JavaScript 10,591 2,728 Updated Feb 7, 2025

The goal of this project is to enable users to create cool web demos using the newly released OpenAI GPT-3 API with just a few lines of Python.

JavaScript 2,888 875 Updated Oct 4, 2023

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Python 1,785 102 Updated Jan 21, 2024

Mamba SSM architecture

Python 13,901 1,200 Updated Jan 18, 2025

YaRN: Efficient Context Window Extension of Large Language Models

Python 1,410 119 Updated Apr 17, 2024

Reference implementation for DPO (Direct Preference Optimization)

Python 2,352 196 Updated Aug 11, 2024

Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)

Jupyter Notebook 2,887 261 Updated May 3, 2024