v1.80.14 - Manus API Support

January 10, 2026

Krrish Dholakia

CEO, LiteLLM

Ishaan Jaff

CTO, LiteLLM

Deploy this version

Docker
Pip

docker run litellm
docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
docker.litellm.ai/berriai/litellm:v1.80.14-stable

pip install litellm
pip install litellm==1.80.14

Key Highlights

Manus API Support - New provider support for Manus API on /responses and GET /responses endpoints
MiniMax Provider - Full support for MiniMax chat completions, TTS, and Anthropic native endpoint
AWS Polly TTS - New TTS provider using AWS Polly API
SSO Role Mapping - Configure role mappings for SSO providers directly in the UI
Cost Estimator - New UI tool for estimating costs across multiple models and requests
MCP Global Mode - Configure MCP servers globally with visibility controls
Interactions API Bridge - Use all LiteLLM providers with the Interactions API
RAG Query Endpoint - New RAG Search/Query endpoint for retrieval-augmented generation
92.7% Faster Provider Config Lookup - Major performance improvement for provider configuration
UI Usage - Endpoint Activity - Users can now see Endpoint Activity Metrics in the UI

UI Usage - Endpoint Activity

Users can now see Endpoint Activity Metrics in the UI.

New Providers and Endpoints

New Providers (11 new providers)

Provider	Supported LiteLLM Endpoints	Description
Manus	`/responses`	Manus API for agentic workflows
Manus	`GET /responses`	Manus API for retrieving responses
Manus	`/files`	Manus API for file management
MiniMax	`/chat/completions`	MiniMax chat completions
MiniMax	`/audio/speech`	MiniMax text-to-speech
AWS Polly	`/audio/speech`	AWS Polly text-to-speech API
GigaChat	`/chat/completions`	GigaChat provider for Russian language AI
LlamaGate	`/chat/completions`	LlamaGate chat completions
LlamaGate	`/embeddings`	LlamaGate embeddings
Abliteration AI	`/chat/completions`	Abliteration.ai provider support
Bedrock	`/v1/messages/count_tokens`	Bedrock as new provider for token counting

New LLM API Endpoints (3 new endpoints)

Endpoint	Method	Description	Documentation
`/responses/compact`	POST	Compact responses API endpoint	Docs
`/rag/query`	POST	RAG Search/Query endpoint	Docs
`/containers/{id}/files`	POST	Upload files to containers	Docs

New Models / Updated Models

New Model Support (100+ new models)

Provider	Model	Context Window	Input ($/1M tokens)	Output ($/1M tokens)	Features
Azure	`azure/gpt-5.2`	400K	$1.75	$14.00	Reasoning, vision, caching
Azure	`azure/gpt-5.2-chat`	128K	$1.75	$14.00	Reasoning, vision
Azure	`azure/gpt-5.2-pro`	400K	$21.00	$168.00	Reasoning, vision, web search
Azure	`azure/gpt-image-1.5`	-	Token-based	Token-based	Image generation/editing
Azure AI	`azure_ai/gpt-oss-120b`	131K	$0.15	$0.60	Function calling
Azure AI	`azure_ai/flux.2-pro`	-	-	$0.04/image	Image generation
Azure AI	`azure_ai/deepseek-v3.2`	164K	$0.58	$1.68	Reasoning, function calling
Bedrock	`amazon.nova-2-multimodal-embeddings-v1:0`	8K	$0.135	-	Multimodal embeddings
Bedrock	`writer.palmyra-x4-v1:0`	128K	$2.50	$10.00	Function calling, PDF
Bedrock	`writer.palmyra-x5-v1:0`	1M	$0.60	$6.00	Function calling, PDF
Bedrock	`moonshot.kimi-k2-v1:0`	-	-	-	Kimi K2 model
Cerebras	`cerebras/zai-glm-4.6`	128K	$2.25	$2.75	Reasoning, function calling
GigaChat	`gigachat/GigaChat-2-Lite`	-	-	-	Chat completions
GigaChat	`gigachat/GigaChat-2-Max`	-	-	-	Chat completions
GigaChat	`gigachat/GigaChat-2-Pro`	-	-	-	Chat completions
Gemini	`gemini/veo-3.1-generate-001`	-	-	-	Video generation
Gemini	`gemini/veo-3.1-fast-generate-001`	-	-	-	Video generation
GitHub Copilot	25+ models	Various	-	-	Chat completions
LlamaGate	15+ models	Various	-	-	Chat, vision, embeddings
MiniMax	`minimax/abab7-chat-preview`	-	-	-	Chat completions
Novita	80+ models	Various	Various	Various	Chat, vision, embeddings
OpenRouter	`openrouter/google/gemini-3-flash-preview`	-	-	-	Chat completions
Together AI	Multiple models	Various	Various	Various	Response schema support
Vertex AI	`vertex_ai/zai-glm-4.7`	-	-	-	GLM 4.7 support

Features

Gemini
- Add image tokens in chat completion - PR #18327
- Add usage object in image generation - PR #18328
- Add thought signature support via tool call id - PR #18374
- Add thought signature for non tool call requests - PR #18581
- Preserve system instructions - PR #18585
- Fix Gemini 3 images in tool response - PR #18190
- Support snake_case for google_search tool parameters - PR #18451
- Google GenAI adapter inline data support - PR #18477
- Add deprecation_date for discontinued Google models - PR #18550
Vertex AI
- Add centralized get_vertex_base_url() helper for global location support - PR #18410
- Convert image URLs to base64 for Vertex AI Anthropic - PR #18497
- Separate Tool objects for each tool type per API spec - PR #18514
- Add thought_signatures to VertexGeminiConfig - PR #18853
- Add support for Vertex AI API keys - PR #18806
- Add zai glm-4.7 model support - PR #18782
Azure
- Add Azure gpt-image-1.5 pricing to cost map - PR #18347
- Add azure/gpt-5.2-chat model - PR #18361
- Add support for image generation via Azure AD token - PR #18413
- Add logprobs support for Azure OpenAI GPT-5.2 model - PR #18856
- Add Azure BFL Flux 2 models for image generation and editing - PR #18764, PR #18766
Bedrock
- Add Bedrock Kimi K2 model support - PR #18797
- Add support for model id in bedrock passthrough - PR #18800
- Fix Nova model detection for Bedrock provider - PR #18250
- Ensure toolUse.input is always a dict when converting from OpenAI format - PR #18414
Databricks
- Add enhanced authentication, security features, and custom user-agent support - PR #18349
MiniMax
- Add MiniMax chat completion support - PR #18380
- Add Anthropic native endpoint support for MiniMax - PR #18377
- Add support for MiniMax TTS - PR #18334
- Add MiniMax provider support to UI dashboard - PR #18496
Together AI
- Add supports_response_schema to all supported Together AI models - PR #18368
OpenRouter
- Add OpenRouter embeddings API support - PR #18391
Anthropic
- Pass server_tool_use and tool_search_tool_result blocks - PR #18770
- Add Anthropic cache control option to image tool call results - PR #18674
Ollama
- Add dimensions for ollama embedding - PR #18536
- Extract pure base64 data from data URLs for Ollama - PR #18465
Watsonx
- Add Watsonx fields support - PR #18569
- Fix Watsonx Audio Transcription - filter model field - PR #18810
SAP
- Add SAP creds for list in proxy UI - PR #18375
- Pass through extra params from allowed_openai_params - PR #18432
- Add client header for SAP AI Core Tracking - PR #18714
Fireworks AI
- Correct deepseek-v3p2 pricing - PR #18483
ZAI
- Add GLM-4.7 model with reasoning support - PR #18476
Codestral
- Correctly route codestral chat and FIM endpoints - PR #18467
Azure AI
- Fix authentication errors at messages API via azure_ai - PR #18500

New Provider Support

AWS Polly - Add AWS Polly API for TTS - PR #18326
GigaChat - Add GigaChat provider support - PR #18564
LlamaGate - Add LlamaGate as a new provider - PR #18673
Abliteration AI - Add abliteration.ai provider - PR #18678
Manus - Add Manus API support on /responses, GET /responses - PR #18804
5 AI Providers via openai_like - Add 5 AI providers using openai_like - PR #18362

Bug Fixes

Gemini
- Properly catch context window exceeded errors - PR #18283
- Remove prompt caching headers as support has been removed - PR #18579
- Fix generate content request with audio file id - PR #18745
- Fix google_genai streaming adapter provider handling - PR #18845
Groq
- Remove deprecated Groq models and update model registry - PR #18062
Vertex AI
- Handle unsupported region for Vertex AI count tokens endpoint - PR #18665
General
- Fix request body for image embedding request - PR #18336
- Fix lost tool_calls when streaming has both text and tool_calls - PR #18316
- Add all resolution for gpt-image-1.5 - PR #18586
- Fix gpt-image-1 cost calculation using token-based pricing - PR #17906
- Fix response_format leaking into extra_body - PR #18859
- Align max_tokens with max_output_tokens for consistency - PR #18820

LLM API Endpoints

Features

Responses API
- Add new compact endpoint (v1/responses/compact) - PR #18697
- Support more streaming callback hooks - PR #18513
- Add mapping for reasoning effort to summary param - PR #18635
- Add output_text property to ResponsesAPIResponse - PR #18491
- Add annotations to completions responses API bridge - PR #18754
Interactions API
- Allow using all LiteLLM providers (interactions -> responses API bridge) - PR #18373
RAG Search API
- Add RAG Search/Query endpoint - PR #18376
CountTokens API
- Add Bedrock as a new provider for /v1/messages/count_tokens - PR #18858
Generate Content
- Add generate content in LLM route - PR #18405
General
- Enable async_post_call_failure_hook to transform error responses - PR #18348
- Calculate total_tokens manually if missing and can be calculated - PR #18445
- Add custom llm provider to get_llm_provider when sent via UI - PR #18638

Bugs

General
- Handle empty error objects in response conversion - PR #18493
- Preserve client error status codes in streaming mode - PR #18698
- Return json error response instead of SSE format for initial streaming errors - PR #18757
- Fix auth header for custom api base in generateContent request - PR #18637
- Tool content should be string for Deepinfra - PR #18739
- Fix incomplete usage in response object passed - PR #18799
- Unify model names to provider-defined names - PR #18573

Management Endpoints / UI

Features

SSO Configuration
- Add SSO Role Mapping feature - PR #18090
- Add SSO Settings Page - PR #18600
- Allow adding role mappings for SSO - PR #18593
- SSO Settings Page Add Role Mappings - PR #18677
- SSO Settings Loading State + Deprecate Previous SSO Flow - PR #18617
Virtual Keys
- Allow deleting key expiry - PR #18278
- Add optional query param "expand" to /key/list - PR #18502
- Key Table Loading Skeleton - PR #18527
- Allow column resizing on Keys Table - PR #18424
- Virtual Keys Table Loading State Between Pages - PR #18619
- Key and Team Router Setting - PR #18790
- Allow router_settings on Keys and Teams - PR #18675
- Use timedelta to calculate key expiry on generate - PR #18666
Models + Endpoints
- Add Model Clearer Flow For Team Admins - PR #18532
- Model Page Loading State - PR #18574
- Model Page Model Provider Select Performance - PR #18425
- Model Page Sorting Sorts Entire Set - PR #18420
- Refactor Model Hub Page - PR #18568
- Add request provider form on UI - PR #18704
Organizations & Teams
- Allow Organization Admins to See Organization Tab - PR #18400
- Resolve Organization Alias on Team Table - PR #18401
- Resolve Team Alias in Organization Info View - PR #18404
- Allow Organization Admins to View Their Organization Info - PR #18417
- Allow editing team_member_budget_duration in /team/update - PR #18735
- Reusable Duration Select + Team Update Member Budget Duration - PR #18736
Usage & Spend
- Add Error Code Filtering on Spend Logs - PR #18359
- Add Error Code Filtering on UI - PR #18366
- Usage Page User Max Budget fix - PR #18555
- Add endpoint to Daily Activity Tables - PR #18729
- Endpoint Activity in Usage - PR #18798
Cost Estimator
- Add Cost Estimator for AI Gateway - PR #18643
- Add view for estimating costs across requests - PR #18645
- Allow selecting many models for cost estimator - PR #18653
CloudZero
- Improve Create and Delete Path for CloudZero - PR #18263
- Add CloudZero UI Docs - PR #18350
Playground
- Add MCP test support to completions on Playground - PR #18440
- Add selectable MCP servers to the playground - PR #18578
- Add custom proxy base URL support to Playground - PR #18661
General UI
- UI styling improvements and fixes - PR #18310
- Add reusable "New" badge component for feature highlights - PR #18537
- Hide New Badges - PR #18547
- Change Budget page to Have Tabs - PR #18576
- Clicking on Logo Directs to Correct URL - PR #18575
- Add UI support for configuring meta URLs - PR #18580
- Expire Previous UI Session Tokens on Login - PR #18557
- Add license endpoint - PR #18311
- Router Fields Endpoint + React Query for Router Fields - PR #18880

Bugs

UI Fixes
- Fix Key Creation MCP Settings Submit Form Unintentionally - PR #18355
- Fix UI Disappears in Development Environments - PR #18399
- Fix Disable Admin UI Flag - PR #18397
- Remove Model Analytics From Model Page - PR #18552
- Useful Links Remove Modal on Adding Links - PR #18602
- SSO Edit Modal Clear Role Mapping Values on Provider Change - PR #18680
- UI Login Case Sensitivity fix - PR #18877
API Fixes
- Fix User Invite & Key Generation Email Notification Logic - PR #18524
- Normalize Proxy Config Callback - PR #18775
- Return empty data array instead of 500 when no models configured - PR #18556
- Enforce org level max budget - PR #18813

AI Integrations

New Integrations (4 new integrations)

Integration	Type	Description
Focus	Logging	Focus export support for observability - PR #18802
SigNoz	Logging	SigNoz integration for observability - PR #18726
Qualifire	Guardrails	Qualifire guardrails and eval webhook - PR #18594
Levo AI	Guardrails	Levo AI integration for security - PR #18529

Logging

DataDog
- Fix span kind fallback when parent_id missing - PR #18418
Langfuse
- Map Gemini cached_tokens to Langfuse cache_read_input_tokens - PR #18614
Prometheus
- Align prometheus metric names with DEFINED_PROMETHEUS_METRICS - PR #18463
- Add Prometheus metrics for request queue time and guardrails - PR #17973
- Add caching metrics for cache hits, misses, and tokens - PR #18755
- Skip metrics for invalid API key requests - PR #18788
Braintrust
- Pass span_attributes in async logging and skip tags on non-root spans - PR #18409
CloudZero
- Add user email to CloudZero - PR #18584
OpenTelemetry
- Use already configured opentelemetry providers - PR #18279
- Prevent LiteLLM from closing external OTEL spans - PR #18553
- Allow configuring arize project name for OpenTelemetry service name - PR #18738
LangSmith
- Add support for LangSmith organization-scoped API keys with tenant ID - PR #18623
Generic API Logger
- Add log_format option to GenericAPILogger - PR #18587

Guardrails

Content Filter
- Add content filter logs page - PR #18335
- Log actual event type for guardrails - PR #18489
Qualifire
- Add Qualifire eval webhook - PR #18836
Lasso Security
- Add Lasso guardrail API docs - PR #18652
Noma Security
- Add MCP guardrail support for Noma - PR #18668
Bedrock Guardrails
- Remove redundant Bedrock guardrail block handling - PR #18634
General
- Generic guardrail API update - PR #18647
- Prevent proxy startup failures from case-sensitive tool permission guardrail validation - PR #18662
- Extend case normalization to ALL guardrail types - PR #18664
- Fix MCP handling in unified guardrail - PR #18630
- Fix embeddings calltype for guardrail precallhook - PR #18740

Spend Tracking, Budgets and Rate Limiting

Platform Fee / Margins - Add support for Platform Fee / Margins - PR #18427
Negative Budget Validation - Add validation for negative budget - PR #18583
Cost Calculation Fixes
- Correct cost calculation when reasoning_tokens are without text_tokens - PR #18607
- Fix background cost tracking tests - PR #18588
Tag Routing - Support toggling tag matching between ANY and ALL - PR #18776

MCP Gateway

MCP Global Mode - Add MCP global mode - PR #18639
MCP Server Visibility - Add configurable MCP server visibility - PR #18681
MCP Registry - Add MCP registry - PR #18850
MCP Stdio Header - Support MCP stdio header env overrides - PR #18324
Parallel Tool Fetching - Parallelize tool fetching from multiple MCP servers - PR #18627
Optimize MCP Server Listing - Separate health checks for optimized listing - PR #18530
Auth Improvements
- Require auth for MCP connection test endpoint - PR #18290
- Fix MCP gateway OAuth2 auth issues and ClosedResourceError - PR #18281
Bug Fixes
- Fix MCP server health status reporting - PR #18443
- Fix OpenAPI to MCP tool conversion - PR #18597
- Remove exec() usage and handle invalid OpenAPI parameter names for security - PR #18480
- Fix MCP error when using multiple servers simultaneously - PR #18855
Migrate MCP Fetching Logic to React Query - PR #18352

Performance / Loadbalancing / Reliability improvements

92.7% Faster Provider Config Lookup - LiteLLM now stresses LLM providers 2.5x more - PR #18867
Lazy Loading Improvements
- Consolidate lazy import handlers with registry pattern - PR #18389
- Complete lazy loading migration for all 180+ LLM config classes - PR #18392
- Lazy load additional components (types, callbacks, utilities) - PR #18396
- Add lazy loading for get_llm_provider - PR #18591
- Lazy-load heavy audio library and loggers - PR #18592
- Lazy load 9 heavy imports in litellm/utils.py - PR #18595
- Lazy load heavy imports to improve import time and memory usage - PR #18610
- Implement lazy loading for provider configs, model info classes, streaming handlers - PR #18611
- Lazy load 15 additional imports - PR #18613
- Lazy load 15+ unused imports - PR #18616
- Lazy load DatadogLLMObsInitParams - PR #18658
- Migrate utils.py lazy imports to registry pattern - PR #18657
- Lazy load get_llm_provider and remove_index_from_tool_calls - PR #18608
Router Improvements
- Validate routing_strategy at startup to fail fast with helpful error - PR #18624
- Correct num_retries tracking in retry logic - PR #18712
- Improve error messages and validation for wildcard routing with multiple credentials - PR #18629
Memory Improvements
- Add memory pattern detection test and fix bad memory patterns - PR #18589
- Add unbounded data structure detection to memory test - PR #18590
- Add memory leak detection tests with CI integration - PR #18881
Database
- Add idx on LOWER(user_email) for faster duplicate email checks - PR #18828
- Proactive RDS IAM token refresh to prevent 15-min connection failed - PR #18795
- Clarify database_connection_pool_limit applies per worker - PR #18780
- Make base_connection_pool_limit default value the same - PR #18721
Docker
- Add libsndfile to database Docker image for audio processing - PR #18612
- Add line_profiler support for performance analysis and fix Windows CRLF issues - PR #18773
Helm
- Add lifecycle support to Helm charts - PR #18517
Authentication
- Add Kubernetes ServiceAccount JWT authentication support - PR #18055
- Use async anthropic client to prevent event loop blocking - PR #18435
Logging Worker
- Handle event loop changes in multiprocessing - PR #18423
Security
- Prevent expired key plaintext leak in error response - PR #18860
- Mask extra header secrets in model info - PR #18822
- Prevent duplicate User-Agent tags in request_tags - PR #18723
- Properly use litellm api keys - PR #18832
Misc
- Remove double imports in main.py - PR #18406
- Add LITELLM_DISABLE_LAZY_LOADING env var to fix VCR cassette creation issue - PR #18725
- Add xiaomi_mimo to LlmProviders enum to fix router support - PR #18819
- Allow installation with current grpcio on old Python - PR #18473
- Add Custom CA certificates to boto3 clients - PR #18852
- Fix bedrock_cache, metadata and max_model_budget - PR #18872
- Fix LiteLLM SDK embedding headers missing field - PR #18844
- Put automatic reasoning summary inclusion behind feat flag - PR #18688
- turn_off_message_logging Does Not Redact Request Messages in proxy_server_request Field - PR #18897

Documentation Updates

Provider Documentation
- Update MiniMax docs to be in proper format - PR #18403
- Add docs for 5 AI providers - PR #18388
- Fix gpt-5-mini reasoning_effort supported values - PR #18346
- Fix PDF documentation inconsistency in Anthropic page - PR #18816
- Update OpenRouter docs to include embedding support - PR #18874
- Add LITELLM_REASONING_AUTO_SUMMARY in doc - PR #18705
MCP Documentation
- Agentcore MCP server docs - PR #18603
- Mention MCP prompt/resources types in overview - PR #18669
- Add Focus docs - PR #18837
Guardrails Documentation
- Qualifire docs hotfix - PR #18724
Infrastructure Documentation
- IAM Roles Anywhere docs - PR #18559
- Fix formatting in proxy configs documentation - PR #18498
- Fix GCS cache docs missing for proxy mode - PR #13328
- Fix how to execute cloudzero sql - PR #18841
General
- LiteLLM adopters section - PR #18605
- Remove redundant comments about setting litellm.callbacks - PR #18711
- Update header to be markdown bold by removing space - PR #18846
- Manus docs - new provider - PR #18817

New Contributors

@prasadkona made their first contribution in PR #18349
@lucasrothman made their first contribution in PR #18283
@aggeentik made their first contribution in PR #18317
@mihidumh made their first contribution in PR #18361
@Prazeina made their first contribution in PR #18498
@systec-dk made their first contribution in PR #18500
@xuan07t2 made their first contribution in PR #18514
@RensDimmendaal made their first contribution in PR #18190
@yurekami made their first contribution in PR #18483
@agertz7 made their first contribution in PR #18556
@yudelevi made their first contribution in PR #18550
@smallp made their first contribution in PR #18536
@kevinpauer made their first contribution in PR #18569
@cansakiroglu made their first contribution in PR #18517
@dee-walia20 made their first contribution in PR #18432
@luxinfeng made their first contribution in PR #18477
@cantalupo555 made their first contribution in PR #18476
@andersk made their first contribution in PR #18473
@majiayu000 made their first contribution in PR #18467
@amangupta-20 made their first contribution in PR #18529
@hamzaq453 made their first contribution in PR #18480
@ktsaou made their first contribution in PR #18627
@FlibbertyGibbitz made their first contribution in PR #18624
@drorIvry made their first contribution in PR #18594
@urainshah made their first contribution in PR #18524
@mangabits made their first contribution in PR #18279
@0717376 made their first contribution in PR #18564
@nmgarza5 made their first contribution in PR #17330
@wileykestner made their first contribution in PR #18445
@minijeong-log made their first contribution in PR #14440
@Isaac4real made their first contribution in PR #18710
@marukaz made their first contribution in PR #18711
@rohitravirane made their first contribution in PR #18712
@lizzzcai made their first contribution in PR #18714
@hkd987 made their first contribution in PR #18673
@Mr-Pepe made their first contribution in PR #18674
@gkarthi-signoz made their first contribution in PR #18726
@Tianduo16 made their first contribution in PR #18723
@wilsonjr made their first contribution in PR #18721
@abliteration-ai made their first contribution in PR #18678
@danialkhan02 made their first contribution in PR #18770
@ihower made their first contribution in PR #18409
@elkkhan made their first contribution in PR #18391
@runixer made their first contribution in PR #18435
@choby-shun made their first contribution in PR #18776
@jutaz made their first contribution in PR #18853
@sjmatta made their first contribution in PR #18250
@andres-ortizl made their first contribution in PR #18856
@gauthiermartin made their first contribution in PR #18844
@mel2oo made their first contribution in PR #18845
@DominikHallab made their first contribution in PR #18846
@ji-chuan-che made their first contribution in PR #18540
@raghav-stripe made their first contribution in PR #18858
@akraines made their first contribution in PR #18629
@otaviofbrito made their first contribution in PR #18665
@chetanchoudhary-sumo made their first contribution in PR #18587
@pascalwhoop made their first contribution in PR #13328
@orgersh92 made their first contribution in PR #18652
@DevajMody made their first contribution in PR #18497
@matt-greathouse made their first contribution in PR #18247
@emerzon made their first contribution in PR #18290
@Eric84626 made their first contribution in PR #18281
@LukasdeBoer made their first contribution in PR #18055
@LingXuanYin made their first contribution in PR #18513
@krisxia0506 made their first contribution in PR #18698
@LouisShark made their first contribution in PR #18414

Full Changelog

View complete changelog on GitHub

Deploy this version​

Key Highlights​

UI Usage - Endpoint Activity​

New Providers and Endpoints​

New Providers (11 new providers)​

New LLM API Endpoints (3 new endpoints)​

New Models / Updated Models​

New Model Support (100+ new models)​

Features​

New Provider Support​

Bug Fixes​

LLM API Endpoints​

Features​

Bugs​

Management Endpoints / UI​

Features​

Bugs​

AI Integrations​

New Integrations (4 new integrations)​

Logging​

Guardrails​

Spend Tracking, Budgets and Rate Limiting​

MCP Gateway​

Performance / Loadbalancing / Reliability improvements​

Documentation Updates​

New Contributors​

Full Changelog​

Deploy this version

Key Highlights

UI Usage - Endpoint Activity

New Providers and Endpoints

New Providers (11 new providers)

New LLM API Endpoints (3 new endpoints)

New Models / Updated Models

New Model Support (100+ new models)

Features

New Provider Support

Bug Fixes

LLM API Endpoints

Features

Bugs

Management Endpoints / UI

Features

Bugs

AI Integrations

New Integrations (4 new integrations)

Logging

Guardrails

Spend Tracking, Budgets and Rate Limiting

MCP Gateway

Performance / Loadbalancing / Reliability improvements

Documentation Updates

New Contributors

Full Changelog