Baseten
Baseten provides a platform for deploying and managing AI models in production, offering performant model runtimes, cross-cloud high availability, and seamless developer workflows.
Websites:
Current Valuation
as of January 20, 2026 (Source)
Funding Summary
Total reported funding
Key People
Core OSS Projects
A unified library of state-of-the-art model optimization techniques such as quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM or TensorRT to optimize inference speed on NVIDIA GPUs.
Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serving systems.
Ready-to-use ML training recipes to help you build and deploy models on Baseten.
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
Renderer for the harmony response format to be used with gpt-oss
Business Information
Industries
Technologies
Sectors
Licenses
Socials and Communities
Cossmology Badge
COSS Weekly Newsletter
Stay up to date with the latest news, funding rounds, and announcements from the COSS universe.
Check out COSS Weekly on the web
