- Melbourne
-
18:47
(UTC +11:00) - www.linkedin.com/in/shamane-siriwardhana
- @gshamane
Stars
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
Democratizing Reinforcement Learning for LLMs
A list of awesome papers and resources of recommender system on large language model (LLM).
Tools for merging pretrained large language models.
Flexible and powerful framework for managing multiple AI agents and handling complex conversations
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
Langflow is a powerful tool for building and deploying AI-powered agents and workflows.
Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".
The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models
The repository contains all the set-up required to execute trainium training jobs.
Robust recipes to align language models with human and AI preferences
Domain Adapted Language Modeling Toolkit - E2E RAG
Codebase for KDD 2023 paper, Text Is All You Need: Learning Language Representations for Sequential Recommendation
The project targets to explore the use of Large Language models in education and develop an intelligent tutor.
A python package for benchmarking interpretability techniques on Transformers.
[WWW'23] PyTorch implementation for "Learning Vector-Quantized Item Representation for Transferable Sequential Recommenders".
The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.
A dataset for training/evaluating Question Answering Retrieval models on ChatGPT responses with the possibility to training/evaluating on real human responses.
Weakly-supervised BART-based autobiographical text summarization model.
PyTorch reimplementation of REALM and ORQA
Recent Advances in Vision and Language PreTrained Models (VL-PTMs)
PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021)