NVIDIA NIM APIs
Experience the leading models to build enterprise generative AI apps now.
Syllabi - Open Source AI Chatbot Platform with RAG
Source-code: https://github.com/Achu-shankar/Syllabi
LLMLingua Series | Effectively Deliver Information to LLMs via Prompt Compression
Source-code: https://github.com/microsoft/LLMLingua
Amazon Q - AWS
Stability-AI/StableSwarmUI: StableSwarmUI, A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.
StableSwarmUI, A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility. - Stability-AI/StableSwarmUI
Significant-Gravitas/AutoGPT · GitHub
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters. - GitHub - Significant-Gravitas/AutoGPT: Aut...
cumulo-autumn/StreamDiffusion · GitHub
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation - cumulo-autumn/StreamDiffusion: StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
alibaba/MNN · GitHub
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/READ...
AMD-AIG-AIMA/Instella: Fully Open Language Models with Stellar Performance
Fully Open Language Models with Stellar Performance - AMD-AIG-AIMA/Instella
microsoft/semantic-kernel: Integrate cutting-edge LLM technology quickly and easily into your apps
Integrate cutting-edge LLM technology quickly and easily into your apps - microsoft/semantic-kernel
BentoML: Build, Ship, Scale AI Applications
Source-code: https://github.com/bentoml/OpenLLM
Memori – The memory fabric for enterprise AI
Memori keeps context alive, helping your AI applications deliver smarter, faster answers without wasting tokens.
OpenRouter
The unified interface for LLMs. Find the best models & prices for your prompts
sgl-project/sglang: SGLang is a fast serving framework for large language models and vision language models.
SGLang is a fast serving framework for large language models and vision language models. - sgl-project/sglang
vllm-project/vllm: A high-throughput and memory-efficient inference and serving engine for LLMs
A high-throughput and memory-efficient inference and serving engine for LLMs - vllm-project/vllm