LibreChat

mirror of https://github.com/danny-avila/LibreChat.git synced 2026-07-01 20:01:35 +00:00

History

Danny Avila 0789a04d11 Some checks are pending Docker Dev Branch Images Build / build (Dockerfile, lc-dev, node) (push) Waiting to run Details Docker Dev Branch Images Build / build (Dockerfile.multi, lc-dev-api, api-build) (push) Waiting to run Details GitNexus Index / index (push) Waiting to run Details GitNexus Index / post-index (push) Blocked by required conditions Details 🪟 feat: Faithful Over-Window Context Estimate via Prune Mirror and Overhead Reserve (#13959 ) * ✨ feat: Mirror send-path pruning in the over-window context estimate For a snapshot-less branch whose tokens exceed the window, the send path prunes oldest-first (getMessagesWithinTokenLimit), so the next call can sit well under the window. The gauge previously clamped the full sum to 100%, hiding that headroom. Add prunedBranchTokens — a newest->oldest walk that keeps messages until the next would overflow the message budget (max minus the summary baseline), mirroring the pruner — and use it on the estimate path in place of the clamp. Approximation: omits the instruction/tool overhead and tool-call pairing the real pruner accounts for (unknowable for a snapshot-less branch); superseded by an exact snapshot once the branch is generated. * ✨ feat: Reserve cached instruction/tool overhead in the snapshot-less estimate The over-window prune mirror and the gauge couldn't account for the fixed instruction + tool-schema overhead the next call always sends, because a snapshot-less branch has no breakdown. The backend already emits that overhead in the ON_CONTEXT_USAGE breakdown, so cache it per agent/model (keyed endpoint::model::agentId, already inclusive of tool schemas) from the live usage events, then reserve it from the prune budget and add it to used so the estimate is consistent with snapshots. Falls back to message-only until the agent has run once this session. Surfaced as a System row in the estimate breakdown. * 🩹 fix: Address Codex review on the over-window estimate - Key the overhead cache by agentId when present. useTokenLimits resolves an agent to its real provider/model, so the reader keyed `provider::model::agent` while the writer stored `agents::::agent` — a cache miss for the main agents case. Both sides now resolve to `agent:<id>` (non-agent configs: endpoint:model). - Skip the overhead reserve when a summary baseline exists: computeSummaryUsedTokens already folds instruction/tool overhead into that marker, so adding it again double-counted on summarized branches. - Collapse the breakdown's input/output/estimated rows into one pruned Messages row when over-window pruning ran, so the popover matches the gauge instead of summing to the discarded pre-prune history.		2026-06-25 17:12:53 -04:00
..
public	🎨 chore: Update Agent Tool with new SVG assets (#12065 )	2026-03-04 09:28:19 -05:00
scripts
src	🪟 feat: Faithful Over-Window Context Estimate via Prune Mirror and Overhead Reserve (#13959 )	2026-06-25 17:12:53 -04:00
sw	🛟 fix: Auto-Recover from Stale Service Worker Assets After Deploys (#13686 )	2026-06-11 11:57:06 -04:00
test	🧑‍🎨 refactor: Prompts/Sidebar styles for improved UI Consistency (#12426 )	2026-04-09 00:02:31 -04:00
babel.config.cjs	🧑‍🎨 refactor: Prompts/Sidebar styles for improved UI Consistency (#12426 )	2026-04-09 00:02:31 -04:00
check_updates.sh
index.html	🛟 fix: Auto-Recover from Stale Service Worker Assets After Deploys (#13686 )	2026-06-11 11:57:06 -04:00
jest.config.cjs	⬆️ chore: Migrate off deprecated @ariakit/react-core to @ariakit/react-components (#13940 )	2026-06-24 23:13:57 -04:00
jest.resolver.cjs	⬆️ chore: Migrate off deprecated @ariakit/react-core to @ariakit/react-components (#13940 )	2026-06-24 23:13:57 -04:00
nginx.conf	🐳 feat: Bundle Admin Panel in Docker Compose Stacks (#13876 )	2026-06-22 16:59:08 -04:00
package.json	⬆️ chore: Migrate off deprecated @ariakit/react-core to @ariakit/react-components (#13940 )	2026-06-24 23:13:57 -04:00
postcss.config.cjs
tailwind.config.cjs	⌨️ feat: Keyboard Shortcuts (#12425 )	2026-06-22 17:02:46 -04:00
tsconfig.json	👷 ci: Type-check the Client Workspace (#13560 )	2026-06-06 18:40:31 -04:00
vite.config.ts	🛟 fix: Auto-Recover from Stale Service Worker Assets After Deploys (#13686 )	2026-06-11 11:57:06 -04:00