LibreChat

mirror of https://github.com/danny-avila/LibreChat.git synced 2026-06-25 17:06:24 +00:00

History

Danny Avila 06a6c42435 ♊ feat: Model-Aware Max Output Tokens for Google/Gemini (#13390 ) * 📤 feat: Model-Aware Max Output Tokens for Google/Gemini Resolves #13384. Current Gemini text models (2.5 and 3+, including Gemini 3.5 Flash) support 64K output tokens, but LibreChat defaulted every Google model to the legacy 8K value — most visibly in the Agents model-parameter panel. - Add model-aware `reset`/`set` to `googleSettings.maxOutputTokens`, mirroring the Anthropic pattern: Gemini 2.5/3+ -> 65536, legacy (2.0 and earlier) and Gemma -> 8192. - Resolve the default server-side in `getGoogleConfig` and in the Agents, preset, and standard Google settings panels via a shared `applyModelAwareDefaults` helper. - Make `compactGoogleSchema` and `generateGoogleSchema` model-aware so explicit user values are preserved and not overwritten. * 🛡️ fix: Cap Google max output at Vertex-safe limits Addresses Codex review (P1) on #13390. Vertex AI caps current Gemini text models at 65,535 output tokens (vs 65,536 on AI Studio) and image models at 32,768, so an unconditional 65,536 default could make otherwise-default Vertex requests fail validation. - Lower the modern text default/ceiling to 65535 (valid on both Vertex and AI Studio). - Resolve Gemini image models (e.g. gemini-2.5-flash-image) to 32768. - Add reset/set + getGoogleConfig tests for image models and the Vertex default path. * 🧮 fix: Respect configured Google defaults and legacy image caps Addresses Codex review round 2 on #13390 (one P2, two P3). - P2 (llm.ts): apply the model-aware maxOutputTokens default as the final fallback instead of pre-filling it, so an explicit value, `defaultParams`, and `addParams` all take precedence and `dropParams` is honored. Empty-string values stay stripped (preserves prior Gemini empty-payload handling). - P3 (panels): pass the resolved params endpoint (`overriddenEndpointKey`) to `applyModelAwareDefaults`, so custom endpoints with `defaultParamsEndpoint: 'google'` also surface the model-aware default. - P3 (schemas): nest the image-model check inside the 2.5+/3+ version check, so legacy image IDs (e.g. gemini-2.0-flash-preview-image-generation) keep the 8K cap instead of being treated as 32K models. - Add tests for defaultParams precedence, dropParams, legacy image models, and the Vertex default path. * 🧭 fix: Base Google defaults on final model and configured overrides Addresses Codex review round 3 on #13390 (two P2). - llm.ts: resolve the model-aware maxOutputTokens default from the final `llmConfig.model` (after defaultParams/addParams) instead of the model captured from modelOptions, so a model forced via addParams/paramDefinitions on a Google-compatible custom endpoint gets its correct limit. - Panels: apply model-aware defaults to the built-in settings first, then overlay `customParams.paramDefinitions`, so an admin-configured maxOutputTokens default wins in the UI (consistent with backend precedence). - Add parameterSettings.spec for applyModelAwareDefaults (incl. override precedence) and a getGoogleConfig final-model test.		2026-05-29 08:09:32 -07:00
..
Agents	🧪 ci: Stabilize Virtualized Agent Grid Tests (#13214 )	2026-05-20 14:41:36 -04:00
Artifacts	🪡 fix: Artifact Edit Saves (#13358 )	2026-05-27 22:03:42 -07:00
Audio	♿️ fix: Accessibility, UI consistency, dialog & avatar refactors (#9975 )	2025-10-07 14:12:49 -04:00
Auth	🪃 fix: Prevent Recursive Login Redirect Loop (#11964 )	2026-02-26 16:10:14 -05:00
Banners	🛡️ fix: Sanitize HTML In Admin Banner And MCP Config Dialog (#12927 )	2026-05-04 11:17:44 +09:00
Bookmarks	🪄 refactor: UI Polish and Admin Dialog Unification (#11108 )	2025-12-28 11:01:25 -05:00
Chat	🛡️ fix: Harden Model Spec Icon Rendering (#13356 )	2026-05-27 22:01:58 -07:00
Conversations	⏱️ refactor: Optimistically Show New Chats In Sidebar (#13298 )	2026-05-24 20:03:46 -04:00
Endpoints	♊ feat: Model-Aware Max Output Tokens for Google/Gemini (#13390 )	2026-05-29 08:09:32 -07:00
Files	🔇 fix: Hide Button Icons from Screen Readers (#10776 )	2025-12-11 16:35:17 -05:00
Input	🪙 feat: Add AWS Bedrock API key support (#8690 )	2026-05-23 12:41:38 -04:00
MCP	🔏 fix: Prevent Browser Autofill From Silently Dropping MCP CustomUserVars on Save (#12770 )	2026-04-22 08:35:14 -07:00
MCPUIResource	🪝 fix: Safe Hook Fallbacks for Tool-Call Components in Search Route (#12423 )	2026-03-26 16:40:37 -04:00
Messages	🧯 fix: Harden Data Retention Semantics (#13049 )	2026-05-19 21:58:42 -04:00
Nav	🆔 feat: Built-in Build Metadata for Support Triage (#12756 )	2026-05-23 09:41:13 -04:00
OAuth	🪟 feat: DataTable update + Various UI enhancements (#9698 )	2025-12-11 16:39:06 -05:00
Plugins/Store	💎 fix: Gemini Image Gen Tool Vertex AI Auth and File Storage (#11923 )	2026-02-24 08:21:02 -05:00
Prompts	📜 feat: Skills UI + Initial E2E CRUD / Sharing (#12580 )	2026-04-25 04:02:00 -04:00
Share	🧬 refactor: Derive Latest Message From Cache (#13294 )	2026-05-24 16:20:27 -04:00
SharePoint	🎨 style: Theming in SharePointPickerDialog, PrincipalAvatar, and PeoplePickerSearchItem	2025-08-13 16:24:23 -04:00
Sharing	📜 feat: Skills UI + Initial E2E CRUD / Sharing (#12580 )	2026-04-25 04:02:00 -04:00
SidePanel	♊ feat: Model-Aware Max Output Tokens for Google/Gemini (#13390 )	2026-05-29 08:09:32 -07:00
Skills	📜 feat: Add Explicit new Skill Route from Agent Builder (#13119 )	2026-05-23 08:58:15 -04:00
System	☕ feat: Prevent Screen Sleep During Response Generation (#10597 )	2025-11-21 09:14:32 -05:00
Tools	♻️ refactor: On-demand MCP connections: remove proactive reconnect, default to available (#11839 )	2026-02-17 22:33:57 -05:00
ui	🔨 fix: Custom Role Permissions (#12528 )	2026-04-03 13:24:11 -04:00
UnifiedSidebar	🗂️ feat: Sidebar Icon Toggle & New Chat History Switch (#12642 )	2026-04-13 09:46:38 -04:00
Web	🧭 fix: Navigate Signed CDN Downloads (#12998 )	2026-05-07 13:36:57 -04:00
index.ts	🔧 refactor: Organize Sharing/Agent Components and Improve Type Safety	2025-08-13 16:24:20 -04:00