Skip to main content

v1.80.14 - Manus API Support

Krrish Dholakia
CEO, LiteLLM
Ishaan Jaff
CTO, LiteLLM

Deploy this version​

docker run litellm
docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
docker.litellm.ai/berriai/litellm:v1.80.14-stable

Key Highlights​


UI Usage - Endpoint Activity​

Users can now see Endpoint Activity Metrics in the UI.


New Providers and Endpoints​

New Providers (11 new providers)​

ProviderSupported LiteLLM EndpointsDescription
Manus/responsesManus API for agentic workflows
ManusGET /responsesManus API for retrieving responses
Manus/filesManus API for file management
MiniMax/chat/completionsMiniMax chat completions
MiniMax/audio/speechMiniMax text-to-speech
AWS Polly/audio/speechAWS Polly text-to-speech API
GigaChat/chat/completionsGigaChat provider for Russian language AI
LlamaGate/chat/completionsLlamaGate chat completions
LlamaGate/embeddingsLlamaGate embeddings
Abliteration AI/chat/completionsAbliteration.ai provider support
Bedrock/v1/messages/count_tokensBedrock as new provider for token counting

New LLM API Endpoints (3 new endpoints)​

EndpointMethodDescriptionDocumentation
/responses/compactPOSTCompact responses API endpointDocs
/rag/queryPOSTRAG Search/Query endpointDocs
/containers/{id}/filesPOSTUpload files to containersDocs

New Models / Updated Models​

New Model Support (100+ new models)​

ProviderModelContext WindowInput ($/1M tokens)Output ($/1M tokens)Features
Azureazure/gpt-5.2400K$1.75$14.00Reasoning, vision, caching
Azureazure/gpt-5.2-chat128K$1.75$14.00Reasoning, vision
Azureazure/gpt-5.2-pro400K$21.00$168.00Reasoning, vision, web search
Azureazure/gpt-image-1.5-Token-basedToken-basedImage generation/editing
Azure AIazure_ai/gpt-oss-120b131K$0.15$0.60Function calling
Azure AIazure_ai/flux.2-pro--$0.04/imageImage generation
Azure AIazure_ai/deepseek-v3.2164K$0.58$1.68Reasoning, function calling
Bedrockamazon.nova-2-multimodal-embeddings-v1:08K$0.135-Multimodal embeddings
Bedrockwriter.palmyra-x4-v1:0128K$2.50$10.00Function calling, PDF
Bedrockwriter.palmyra-x5-v1:01M$0.60$6.00Function calling, PDF
Bedrockmoonshot.kimi-k2-v1:0---Kimi K2 model
Cerebrascerebras/zai-glm-4.6128K$2.25$2.75Reasoning, function calling
GigaChatgigachat/GigaChat-2-Lite---Chat completions
GigaChatgigachat/GigaChat-2-Max---Chat completions
GigaChatgigachat/GigaChat-2-Pro---Chat completions
Geminigemini/veo-3.1-generate-001---Video generation
Geminigemini/veo-3.1-fast-generate-001---Video generation
GitHub Copilot25+ modelsVarious--Chat completions
LlamaGate15+ modelsVarious--Chat, vision, embeddings
MiniMaxminimax/abab7-chat-preview---Chat completions
Novita80+ modelsVariousVariousVariousChat, vision, embeddings
OpenRouteropenrouter/google/gemini-3-flash-preview---Chat completions
Together AIMultiple modelsVariousVariousVariousResponse schema support
Vertex AIvertex_ai/zai-glm-4.7---GLM 4.7 support

Features​

  • Gemini
    • Add image tokens in chat completion - PR #18327
    • Add usage object in image generation - PR #18328
    • Add thought signature support via tool call id - PR #18374
    • Add thought signature for non tool call requests - PR #18581
    • Preserve system instructions - PR #18585
    • Fix Gemini 3 images in tool response - PR #18190
    • Support snake_case for google_search tool parameters - PR #18451
    • Google GenAI adapter inline data support - PR #18477
    • Add deprecation_date for discontinued Google models - PR #18550
  • Vertex AI
    • Add centralized get_vertex_base_url() helper for global location support - PR #18410
    • Convert image URLs to base64 for Vertex AI Anthropic - PR #18497
    • Separate Tool objects for each tool type per API spec - PR #18514
    • Add thought_signatures to VertexGeminiConfig - PR #18853
    • Add support for Vertex AI API keys - PR #18806
    • Add zai glm-4.7 model support - PR #18782
  • Azure
    • Add Azure gpt-image-1.5 pricing to cost map - PR #18347
    • Add azure/gpt-5.2-chat model - PR #18361
    • Add support for image generation via Azure AD token - PR #18413
    • Add logprobs support for Azure OpenAI GPT-5.2 model - PR #18856
    • Add Azure BFL Flux 2 models for image generation and editing - PR #18764, PR #18766
  • Bedrock
    • Add Bedrock Kimi K2 model support - PR #18797
    • Add support for model id in bedrock passthrough - PR #18800
    • Fix Nova model detection for Bedrock provider - PR #18250
    • Ensure toolUse.input is always a dict when converting from OpenAI format - PR #18414
  • Databricks
    • Add enhanced authentication, security features, and custom user-agent support - PR #18349
  • MiniMax
    • Add MiniMax chat completion support - PR #18380
    • Add Anthropic native endpoint support for MiniMax - PR #18377
    • Add support for MiniMax TTS - PR #18334
    • Add MiniMax provider support to UI dashboard - PR #18496
  • Together AI
    • Add supports_response_schema to all supported Together AI models - PR #18368
  • OpenRouter
    • Add OpenRouter embeddings API support - PR #18391
  • Anthropic
    • Pass server_tool_use and tool_search_tool_result blocks - PR #18770
    • Add Anthropic cache control option to image tool call results - PR #18674
  • Ollama
    • Add dimensions for ollama embedding - PR #18536
    • Extract pure base64 data from data URLs for Ollama - PR #18465
  • Watsonx
    • Add Watsonx fields support - PR #18569
    • Fix Watsonx Audio Transcription - filter model field - PR #18810
  • SAP
    • Add SAP creds for list in proxy UI - PR #18375
    • Pass through extra params from allowed_openai_params - PR #18432
    • Add client header for SAP AI Core Tracking - PR #18714
  • Fireworks AI
  • ZAI
    • Add GLM-4.7 model with reasoning support - PR #18476
  • Codestral
    • Correctly route codestral chat and FIM endpoints - PR #18467
  • Azure AI
    • Fix authentication errors at messages API via azure_ai - PR #18500

New Provider Support​

Bug Fixes​

  • Gemini
    • Properly catch context window exceeded errors - PR #18283
    • Remove prompt caching headers as support has been removed - PR #18579
    • Fix generate content request with audio file id - PR #18745
    • Fix google_genai streaming adapter provider handling - PR #18845
  • Groq
    • Remove deprecated Groq models and update model registry - PR #18062
  • Vertex AI
    • Handle unsupported region for Vertex AI count tokens endpoint - PR #18665
  • General
    • Fix request body for image embedding request - PR #18336
    • Fix lost tool_calls when streaming has both text and tool_calls - PR #18316
    • Add all resolution for gpt-image-1.5 - PR #18586
    • Fix gpt-image-1 cost calculation using token-based pricing - PR #17906
    • Fix response_format leaking into extra_body - PR #18859
    • Align max_tokens with max_output_tokens for consistency - PR #18820

LLM API Endpoints​

Features​

Bugs​

  • General
    • Handle empty error objects in response conversion - PR #18493
    • Preserve client error status codes in streaming mode - PR #18698
    • Return json error response instead of SSE format for initial streaming errors - PR #18757
    • Fix auth header for custom api base in generateContent request - PR #18637
    • Tool content should be string for Deepinfra - PR #18739
    • Fix incomplete usage in response object passed - PR #18799
    • Unify model names to provider-defined names - PR #18573

Management Endpoints / UI​

Features​

  • SSO Configuration
    • Add SSO Role Mapping feature - PR #18090
    • Add SSO Settings Page - PR #18600
    • Allow adding role mappings for SSO - PR #18593
    • SSO Settings Page Add Role Mappings - PR #18677
    • SSO Settings Loading State + Deprecate Previous SSO Flow - PR #18617
  • Virtual Keys
    • Allow deleting key expiry - PR #18278
    • Add optional query param "expand" to /key/list - PR #18502
    • Key Table Loading Skeleton - PR #18527
    • Allow column resizing on Keys Table - PR #18424
    • Virtual Keys Table Loading State Between Pages - PR #18619
    • Key and Team Router Setting - PR #18790
    • Allow router_settings on Keys and Teams - PR #18675
    • Use timedelta to calculate key expiry on generate - PR #18666
  • Models + Endpoints
    • Add Model Clearer Flow For Team Admins - PR #18532
    • Model Page Loading State - PR #18574
    • Model Page Model Provider Select Performance - PR #18425
    • Model Page Sorting Sorts Entire Set - PR #18420
    • Refactor Model Hub Page - PR #18568
    • Add request provider form on UI - PR #18704
  • Organizations & Teams
    • Allow Organization Admins to See Organization Tab - PR #18400
    • Resolve Organization Alias on Team Table - PR #18401
    • Resolve Team Alias in Organization Info View - PR #18404
    • Allow Organization Admins to View Their Organization Info - PR #18417
    • Allow editing team_member_budget_duration in /team/update - PR #18735
    • Reusable Duration Select + Team Update Member Budget Duration - PR #18736
  • Usage & Spend
    • Add Error Code Filtering on Spend Logs - PR #18359
    • Add Error Code Filtering on UI - PR #18366
    • Usage Page User Max Budget fix - PR #18555
    • Add endpoint to Daily Activity Tables - PR #18729
    • Endpoint Activity in Usage - PR #18798
  • Cost Estimator
    • Add Cost Estimator for AI Gateway - PR #18643
    • Add view for estimating costs across requests - PR #18645
    • Allow selecting many models for cost estimator - PR #18653
  • CloudZero
    • Improve Create and Delete Path for CloudZero - PR #18263
    • Add CloudZero UI Docs - PR #18350
  • Playground
    • Add MCP test support to completions on Playground - PR #18440
    • Add selectable MCP servers to the playground - PR #18578
    • Add custom proxy base URL support to Playground - PR #18661
  • General UI
    • UI styling improvements and fixes - PR #18310
    • Add reusable "New" badge component for feature highlights - PR #18537
    • Hide New Badges - PR #18547
    • Change Budget page to Have Tabs - PR #18576
    • Clicking on Logo Directs to Correct URL - PR #18575
    • Add UI support for configuring meta URLs - PR #18580
    • Expire Previous UI Session Tokens on Login - PR #18557
    • Add license endpoint - PR #18311
    • Router Fields Endpoint + React Query for Router Fields - PR #18880

Bugs​

  • UI Fixes
    • Fix Key Creation MCP Settings Submit Form Unintentionally - PR #18355
    • Fix UI Disappears in Development Environments - PR #18399
    • Fix Disable Admin UI Flag - PR #18397
    • Remove Model Analytics From Model Page - PR #18552
    • Useful Links Remove Modal on Adding Links - PR #18602
    • SSO Edit Modal Clear Role Mapping Values on Provider Change - PR #18680
    • UI Login Case Sensitivity fix - PR #18877
  • API Fixes
    • Fix User Invite & Key Generation Email Notification Logic - PR #18524
    • Normalize Proxy Config Callback - PR #18775
    • Return empty data array instead of 500 when no models configured - PR #18556
    • Enforce org level max budget - PR #18813

AI Integrations​

New Integrations (4 new integrations)​

IntegrationTypeDescription
FocusLoggingFocus export support for observability - PR #18802
SigNozLoggingSigNoz integration for observability - PR #18726
QualifireGuardrailsQualifire guardrails and eval webhook - PR #18594
Levo AIGuardrailsLevo AI integration for security - PR #18529

Logging​

Guardrails​


Spend Tracking, Budgets and Rate Limiting​

  • Platform Fee / Margins - Add support for Platform Fee / Margins - PR #18427
  • Negative Budget Validation - Add validation for negative budget - PR #18583
  • Cost Calculation Fixes
    • Correct cost calculation when reasoning_tokens are without text_tokens - PR #18607
    • Fix background cost tracking tests - PR #18588
  • Tag Routing - Support toggling tag matching between ANY and ALL - PR #18776

MCP Gateway​

  • MCP Global Mode - Add MCP global mode - PR #18639
  • MCP Server Visibility - Add configurable MCP server visibility - PR #18681
  • MCP Registry - Add MCP registry - PR #18850
  • MCP Stdio Header - Support MCP stdio header env overrides - PR #18324
  • Parallel Tool Fetching - Parallelize tool fetching from multiple MCP servers - PR #18627
  • Optimize MCP Server Listing - Separate health checks for optimized listing - PR #18530
  • Auth Improvements
    • Require auth for MCP connection test endpoint - PR #18290
    • Fix MCP gateway OAuth2 auth issues and ClosedResourceError - PR #18281
  • Bug Fixes
    • Fix MCP server health status reporting - PR #18443
    • Fix OpenAPI to MCP tool conversion - PR #18597
    • Remove exec() usage and handle invalid OpenAPI parameter names for security - PR #18480
    • Fix MCP error when using multiple servers simultaneously - PR #18855
  • Migrate MCP Fetching Logic to React Query - PR #18352

Performance / Loadbalancing / Reliability improvements​

  • 92.7% Faster Provider Config Lookup - LiteLLM now stresses LLM providers 2.5x more - PR #18867
  • Lazy Loading Improvements
    • Consolidate lazy import handlers with registry pattern - PR #18389
    • Complete lazy loading migration for all 180+ LLM config classes - PR #18392
    • Lazy load additional components (types, callbacks, utilities) - PR #18396
    • Add lazy loading for get_llm_provider - PR #18591
    • Lazy-load heavy audio library and loggers - PR #18592
    • Lazy load 9 heavy imports in litellm/utils.py - PR #18595
    • Lazy load heavy imports to improve import time and memory usage - PR #18610
    • Implement lazy loading for provider configs, model info classes, streaming handlers - PR #18611
    • Lazy load 15 additional imports - PR #18613
    • Lazy load 15+ unused imports - PR #18616
    • Lazy load DatadogLLMObsInitParams - PR #18658
    • Migrate utils.py lazy imports to registry pattern - PR #18657
    • Lazy load get_llm_provider and remove_index_from_tool_calls - PR #18608
  • Router Improvements
    • Validate routing_strategy at startup to fail fast with helpful error - PR #18624
    • Correct num_retries tracking in retry logic - PR #18712
    • Improve error messages and validation for wildcard routing with multiple credentials - PR #18629
  • Memory Improvements
    • Add memory pattern detection test and fix bad memory patterns - PR #18589
    • Add unbounded data structure detection to memory test - PR #18590
    • Add memory leak detection tests with CI integration - PR #18881
  • Database
    • Add idx on LOWER(user_email) for faster duplicate email checks - PR #18828
    • Proactive RDS IAM token refresh to prevent 15-min connection failed - PR #18795
    • Clarify database_connection_pool_limit applies per worker - PR #18780
    • Make base_connection_pool_limit default value the same - PR #18721
  • Docker
    • Add libsndfile to database Docker image for audio processing - PR #18612
    • Add line_profiler support for performance analysis and fix Windows CRLF issues - PR #18773
  • Helm
    • Add lifecycle support to Helm charts - PR #18517
  • Authentication
    • Add Kubernetes ServiceAccount JWT authentication support - PR #18055
    • Use async anthropic client to prevent event loop blocking - PR #18435
  • Logging Worker
    • Handle event loop changes in multiprocessing - PR #18423
  • Security
    • Prevent expired key plaintext leak in error response - PR #18860
    • Mask extra header secrets in model info - PR #18822
    • Prevent duplicate User-Agent tags in request_tags - PR #18723
    • Properly use litellm api keys - PR #18832
  • Misc
    • Remove double imports in main.py - PR #18406
    • Add LITELLM_DISABLE_LAZY_LOADING env var to fix VCR cassette creation issue - PR #18725
    • Add xiaomi_mimo to LlmProviders enum to fix router support - PR #18819
    • Allow installation with current grpcio on old Python - PR #18473
    • Add Custom CA certificates to boto3 clients - PR #18852
    • Fix bedrock_cache, metadata and max_model_budget - PR #18872
    • Fix LiteLLM SDK embedding headers missing field - PR #18844
    • Put automatic reasoning summary inclusion behind feat flag - PR #18688
    • turn_off_message_logging Does Not Redact Request Messages in proxy_server_request Field - PR #18897

Documentation Updates​

  • Provider Documentation
    • Update MiniMax docs to be in proper format - PR #18403
    • Add docs for 5 AI providers - PR #18388
    • Fix gpt-5-mini reasoning_effort supported values - PR #18346
    • Fix PDF documentation inconsistency in Anthropic page - PR #18816
    • Update OpenRouter docs to include embedding support - PR #18874
    • Add LITELLM_REASONING_AUTO_SUMMARY in doc - PR #18705
  • MCP Documentation
  • Guardrails Documentation
  • Infrastructure Documentation
    • IAM Roles Anywhere docs - PR #18559
    • Fix formatting in proxy configs documentation - PR #18498
    • Fix GCS cache docs missing for proxy mode - PR #13328
    • Fix how to execute cloudzero sql - PR #18841
  • General
    • LiteLLM adopters section - PR #18605
    • Remove redundant comments about setting litellm.callbacks - PR #18711
    • Update header to be markdown bold by removing space - PR #18846
    • Manus docs - new provider - PR #18817

New Contributors​

  • @prasadkona made their first contribution in PR #18349
  • @lucasrothman made their first contribution in PR #18283
  • @aggeentik made their first contribution in PR #18317
  • @mihidumh made their first contribution in PR #18361
  • @Prazeina made their first contribution in PR #18498
  • @systec-dk made their first contribution in PR #18500
  • @xuan07t2 made their first contribution in PR #18514
  • @RensDimmendaal made their first contribution in PR #18190
  • @yurekami made their first contribution in PR #18483
  • @agertz7 made their first contribution in PR #18556
  • @yudelevi made their first contribution in PR #18550
  • @smallp made their first contribution in PR #18536
  • @kevinpauer made their first contribution in PR #18569
  • @cansakiroglu made their first contribution in PR #18517
  • @dee-walia20 made their first contribution in PR #18432
  • @luxinfeng made their first contribution in PR #18477
  • @cantalupo555 made their first contribution in PR #18476
  • @andersk made their first contribution in PR #18473
  • @majiayu000 made their first contribution in PR #18467
  • @amangupta-20 made their first contribution in PR #18529
  • @hamzaq453 made their first contribution in PR #18480
  • @ktsaou made their first contribution in PR #18627
  • @FlibbertyGibbitz made their first contribution in PR #18624
  • @drorIvry made their first contribution in PR #18594
  • @urainshah made their first contribution in PR #18524
  • @mangabits made their first contribution in PR #18279
  • @0717376 made their first contribution in PR #18564
  • @nmgarza5 made their first contribution in PR #17330
  • @wileykestner made their first contribution in PR #18445
  • @minijeong-log made their first contribution in PR #14440
  • @Isaac4real made their first contribution in PR #18710
  • @marukaz made their first contribution in PR #18711
  • @rohitravirane made their first contribution in PR #18712
  • @lizzzcai made their first contribution in PR #18714
  • @hkd987 made their first contribution in PR #18673
  • @Mr-Pepe made their first contribution in PR #18674
  • @gkarthi-signoz made their first contribution in PR #18726
  • @Tianduo16 made their first contribution in PR #18723
  • @wilsonjr made their first contribution in PR #18721
  • @abliteration-ai made their first contribution in PR #18678
  • @danialkhan02 made their first contribution in PR #18770
  • @ihower made their first contribution in PR #18409
  • @elkkhan made their first contribution in PR #18391
  • @runixer made their first contribution in PR #18435
  • @choby-shun made their first contribution in PR #18776
  • @jutaz made their first contribution in PR #18853
  • @sjmatta made their first contribution in PR #18250
  • @andres-ortizl made their first contribution in PR #18856
  • @gauthiermartin made their first contribution in PR #18844
  • @mel2oo made their first contribution in PR #18845
  • @DominikHallab made their first contribution in PR #18846
  • @ji-chuan-che made their first contribution in PR #18540
  • @raghav-stripe made their first contribution in PR #18858
  • @akraines made their first contribution in PR #18629
  • @otaviofbrito made their first contribution in PR #18665
  • @chetanchoudhary-sumo made their first contribution in PR #18587
  • @pascalwhoop made their first contribution in PR #13328
  • @orgersh92 made their first contribution in PR #18652
  • @DevajMody made their first contribution in PR #18497
  • @matt-greathouse made their first contribution in PR #18247
  • @emerzon made their first contribution in PR #18290
  • @Eric84626 made their first contribution in PR #18281
  • @LukasdeBoer made their first contribution in PR #18055
  • @LingXuanYin made their first contribution in PR #18513
  • @krisxia0506 made their first contribution in PR #18698
  • @LouisShark made their first contribution in PR #18414

Full Changelog​

View complete changelog on GitHub