codeburn

mirror of https://github.com/AgentSeal/codeburn.git synced 2026-05-20 17:47:19 +00:00

Author	SHA1	Message	Date
ozymandiashh	5a837c94e9	Track OpenCode child sessions (#343 ) Some checks are pending CI / semgrep (push) Waiting to run Details	2026-05-18 05:51:08 -07:00
ozymandiashh	2013ecbfd9	Track agent calls across providers (#340 )	2026-05-18 05:51:01 -07:00
Resham Joshi	303c9458cb	Fix OpenCode/Goose returning 0 sessions on fresh install (#347 ) * Add CodeBurn Pro Mac App Store app SwiftUI MenuBarExtra with litellm-snapshot pricing, Claude/Codex/Copilot parsers, session discovery, auto-refresh timer, and dashboard UI matching the real menubar design. * Add appstore/ to .gitignore Private Mac App Store build, not for the public repo. * Fix OpenCode/Goose returning 0 sessions on fresh install SQLite-based providers use compound paths (db.path:sessionId) which caused fingerprintFile to fail stat() and silently skip all sessions. Fall back to stat on the DB file when the full compound path fails. Fixes #346	2026-05-18 05:45:20 -07:00
iamtoruk	1317af2acf	Bump to 0.9.10, fix per-provider chart history for today period Some checks are pending CI / semgrep (push) Waiting to run Details The fallback daily history path (used when rangeStartStr = todayStr) was missing cache-based historical data, showing only today's bar in the trend chart. Now includes allCacheDays filtered by provider.	2026-05-17 14:06:13 -07:00
Resham Joshi	58ccf84f02	Fix menubar per-provider performance (24s → 2s) and session cache safety (#344 ) Some checks are pending CI / semgrep (push) Waiting to run Details Per-provider menubar calls now use loadDailyCache() instead of hydrateCache(), splitting history into cache-based and fallback paths. Fixes session cache wipe when scanProjectDirs/parseProviderSources receive empty dirs. Strips _dirty flag before session cache serialization to prevent unnecessary 132MB rewrites. Removes dead keychain code from both credential stores.	2026-05-17 13:54:38 -07:00
iamtoruk	b0131f698c	Store credential cache in file instead of keychain, use cache for per-provider menubar Credential cache: switched from keychain to file-based storage under Application Support. Ad-hoc signed builds invalidate keychain ACLs on every rebuild, causing repeated macOS password prompts. Existing keychain entries are migrated to file on first read, then deleted. Per-provider menubar: the Codex/Claude/etc tabs previously re-parsed all sessions from scratch (22s). Now parses only today with the provider filter and uses the daily cache for historical days, matching the fast path the All tab already uses. Daily cache bumped to v7 to force a clean rebuild after pricing and provider changes since v6.	2026-05-17 07:18:58 -07:00
iamtoruk	dcbf6dcfbf	Merge pull request #300 : Track multiple provider plans Some checks are pending CI / semgrep (push) Waiting to run Details # Conflicts: # CHANGELOG.md # src/main.ts	2026-05-16 10:49:05 -07:00
iamtoruk	257e8924dd	Merge main into feat/multi-provider-plans, remove dead tautology	2026-05-16 10:42:03 -07:00
iamtoruk	06c90d0293	Merge main into feat/codebuff-provider to resolve conflicts	2026-05-16 08:02:27 -07:00
iamtoruk	adfbd0bd04	Merge main into feat/mistral-vibe-provider, add malformed input tests	2026-05-16 07:35:22 -07:00
iamtoruk	5696a74571	Merge main into feat/kimi-code-provider to resolve conflicts	2026-05-16 07:25:12 -07:00
iamtoruk	59a4d95b18	Merge main into feat/cline-provider to resolve conflicts	2026-05-16 05:58:10 -07:00
Resham Joshi	c56f7ea7db	Merge pull request #324 from ozymandiashh/fix/antigravity-windows-discovery Fix Antigravity Windows discovery	2026-05-16 05:16:05 -07:00
iamtoruk	bd41fa3962	Add persistent disk cache for parsed session data Some checks are pending CI / semgrep (push) Waiting to run Details Cache normalized turns/calls to ~/.cache/codeburn/session-cache.json so the CLI skips re-parsing unchanged JSONL files on subsequent runs. File reconciliation uses dev+ino+mtime+size fingerprinting; cost, classification, and summaries are recomputed at query time. Atomic writes via temp+fsync+rename, deep structural validation on load, per-provider env fingerprinting, and best-effort save so cache failures never break the CLI. ~6x speedup on warm cache.	2026-05-16 01:04:13 -07:00
iamtoruk	2fb078bdfb	Fix V8 OOM crash on 30-day period with Buffer-based line reader and large-line parser Three-layer fix for V8 heap exhaustion when parsing heavy session data: 1. Buffer-based readSessionLines (fs-utils.ts): Replace readline with raw Buffer streaming using Buffer.indexOf(0x0a). Eliminates ConsString trees that caused OOM when regex-flattening 100MB+ lines. Two-state machine (ACCUMULATING/SCANNING) skips old lines at ~2KB cost instead of 200MB. 2. Large-line streaming parser (parser.ts): Hand-written JSON scanner for lines >32KB extracts only cost/token/tool fields without JSON.parse, avoiding full object graph allocation. Dual string/Buffer paths. 3. Dashboard memory management (dashboard.tsx): Disable auto-refresh for heavy periods (30d/month/all), clear old dataset before reload via nextTick to allow GC, prevent overlapping reloads with mutex, lazy optimize scanning on keypress instead of useEffect. Also fixes three race conditions in dashboard reload deduplication: - Early return after nextTick bypassing finally block (permanent mutex lock) - A->B->A period switching dropping final reload (stale pending) - Stale pendingReloadRef not cleared when in-flight matches request	2026-05-15 23:15:26 -07:00
iamtoruk	57e48e6e55	Stop eager daily-cache hydration on CLI commands that never consume it Only `status --format menubar-json` uses getDaysInRange from the hydrated cache. The other 9 call sites (report, today, month, export, optimize, compare, models, yield) parse their own date ranges directly via parseAllSessions. Removing hydrateCache from these paths avoids a 365-day backfill parse that was the primary OOM multiplier on large session dirs.	2026-05-15 14:23:20 -07:00
iamtoruk	a7bb780618	Reduce Claude parser OOM risk via entry compaction (0.9.9) Strip heavy fields from JournalEntry immediately after JSON.parse in the JSONL hot loop. Keeps only what downstream consumers need: type, timestamp, sessionId, cwd, compacted user text (2000 char total cap), assistant model/usage/id, tool_use names with Skill and Bash inputs, and MCP inventory attachments. Text, thinking, and tool_result blocks are dropped. Also removes redundant hydrateCache() from status --format json and terminal status paths, and clears the session cache between period parses to avoid pinning both today and month result sets. This is a mitigation, not a full fix. Very large month ranges still materialize full ProjectSummary.turns arrays. The real fix is the streaming single-pass parser refactor.	2026-05-15 13:20:50 -07:00
iamtoruk	d342f2205b	Fix menubar all-provider refresh OOM	2026-05-14 21:57:51 -07:00
iamtoruk	909efcf989	Harden menubar refresh and installer	2026-05-14 18:32:15 -07:00
iamtoruk	403efd4727	Merge remote-tracking branch 'origin/main' into fix/menubar-version-prefix # Conflicts: # package.json # src/parser.ts	2026-05-13 20:32:22 -07:00
ozymandiashh	929c66e7d1	Fix Antigravity Windows discovery	2026-05-13 00:51:39 +03:00
Resham Joshi	fe2e622038	Skip Cursor bubble rows that lack a createdAt timestamp (#321 ) Bubble rows without createdAt were defaulting to new Date(), which misattributed historical or undated usage to Today and inflated the daily chart. Now filtered at the SQL level and skipped in application code. Based on the bubble-side fix from #262 by @darthrevanyunka.	2026-05-11 22:16:00 -07:00
Resham Joshi	3b71650f24	Fix mangled project paths in dashboard (#320 ) * Fix mangled project paths in By Project and Top Sessions panels shortProject() decoded Claude Code slugs by splitting on '-', which broke directory names containing dashes ('foo-bar' became 'foo/bar'). Switch the dashboard to consume ProjectSummary.projectPath (the canonical cwd already extracted by parser.ts) and rewrite shortProject to operate on a real absolute path. * shortProject: cache homedir, normalize Windows backslashes, fix stale test helper --------- Co-authored-by: Abdallah Meghraoui <abdallah.meghraoui@outlook.com>	2026-05-11 22:02:38 -07:00
Resham Joshi	38e41e93c3	Add Node version guard for unsupported runtimes (#319 ) Split CLI into a tiny launcher (src/cli.ts) that checks for Node >= 22.13.0 before dynamically importing the full CLI (src/main.ts). Users on Node 18 now get a clear upgrade message instead of a cryptic regex parse error from string-width. Closes #232.	2026-05-11 21:50:17 -07:00
AgentSeal	a1b5e4bd00	Fix OpenCode MCP usage reporting (#318 ) * Fix OpenCode MCP usage reporting * Move OpenCode MCP changelog entry to Unreleased section --------- Co-authored-by: ozymandiashh <234437643+ozymandiashh@users.noreply.github.com> Co-authored-by: iamtoruk <hello@agentseal.org>	2026-05-11 21:30:27 -07:00
AgentSeal	c85beeaeae	Fix Claude 1-hour cache write pricing (#317 ) Co-authored-by: ozymandiashh <234437643+ozymandiashh@users.noreply.github.com> Co-authored-by: iamtoruk <hello@agentseal.org>	2026-05-11 21:23:04 -07:00
AgentSeal	03e22ecb80	Add IBM Bob provider with workspace extraction (#316 ) Some checks are pending CI / semgrep (push) Waiting to run Details * Add IBM Bob provider * Add workspace extraction for Cline-family providers Extract project name from workspace directory in api_conversation_history.json so sessions show actual folder names instead of the provider display name. Thread projectPath through ParsedProviderCall to avoid unsanitizePath mangling hyphenated folder names. --------- Co-authored-by: ozymandiashh <234437643+ozymandiashh@users.noreply.github.com> Co-authored-by: iamtoruk <hello@agentseal.org>	2026-05-11 20:54:13 -07:00
iamtoruk	b4b28becc8	Harden menubar refresh recovery	2026-05-11 20:44:06 -07:00
ozymandiashh	9187bc590a	Add Cline provider	2026-05-12 00:31:41 +03:00
iamtoruk	469d956312	Preserve menubar bundle seal during install	2026-05-11 11:11:37 -07:00
AgentSeal	2301577e03	Merge pull request #310 from getagentseal/fix/menubar-wake-recovery Fix menubar wake recovery and release asset selection	2026-05-11 10:58:33 -07:00
iamtoruk	1149ab6e43	Fix menubar wake recovery and release asset selection	2026-05-11 10:57:02 -07:00
ozymandiashh	78cbfd3798	Add Kimi provider	2026-05-11 19:02:28 +03:00
ozymandiashh	e2d4e565f8	Add Mistral Vibe provider	2026-05-11 16:54:28 +03:00
ozymandiashh	a04c0cbfaa	Track multiple provider plans	2026-05-11 16:33:33 +03:00
Resham Joshi	02f4635cec	Fix node:sqlite V8 crash on invalid UTF-8 in text columns (#272 ) node:sqlite calls v8::String::NewFromUtf8 with kAbort on TEXT columns. Cursor chat blobs often contain truncated multi-byte chars from streaming boundaries, which triggers a V8 CHECK abort (not a JS exception). Select all text-content columns as CAST(col AS BLOB) so node:sqlite returns Uint8Array instead. Decode in JS with TextDecoder fatal:false which replaces bad bytes with U+FFFD. Covers all three SQLite providers (Cursor, Goose, OpenCode). Removes the version blocklist (MIN_NODE_22_PATCH) and lowers engines requirement from >=22.20 to >=22 since the BLOB cast approach works on all Node 22.x versions. Closes #264 Closes #250	2026-05-10 17:05:08 -07:00
Resham Joshi	d142bd97ef	daily-cache: discard pre-v5 caches (fixes menubar providers regression) (#297 ) PR #296 (Cursor per-project breakdown) bumped DAILY_CACHE_VERSION from 4 to 5 but left MIN_SUPPORTED_VERSION at 2. The migration path (isMigratableCache + migrateDays) only fills in missing default fields; it does NOT recompute the providers / categories / models rollups from session data, because raw sessions are not retained in the cache. So a v4 cache migrated to v5 carried forward its old per-day provider totals (single 'cursor' bucket) for the full retention window. Effect on users post-#296: the macOS menubar's `current.providers.cursor` would show the orphan-bucket subtotal instead of the full Cursor cost for any historical day whose daily entry was computed before #296 landed. Live-test on my machine showed cursor=$3.78 against a migrated v4 cache vs cursor=$4.08 (correct) after the daily cache was discarded — the $0.30 gap was the workspace projects whose costs were no longer aggregated under the 'cursor' label by the new code. Fix: raise MIN_SUPPORTED_VERSION to 5 so any cache with version < DAILY_CACHE_VERSION is renamed to `.bak` and the cache is recomputed from scratch on next run. The recompute is the same operation that backfills the cache for a new user, so the cost is a one-time cold-path hit (~3s on the test machine). Test for the migration case updated to assert the new discard-and-bak behavior. Full suite: 46 files / 654 tests pass.	2026-05-10 16:05:59 -07:00
Resham Joshi	810b214476	Cursor: per-project breakdown by workspace (closes per-project half of #196 ) (#296 ) Cursor's chat history showed as a single row labeled 'cursor' in the dashboard because the global state.vscdb has no workspace field on individual bubbles. The fix joins through Cursor's per-workspace storage: 1. Walk ~/Library/Application Support/Cursor/User/workspaceStorage/* 2. For each hash dir, read workspace.json -> folder URI 3. Open that dir's state.vscdb, read ItemTable['composer.composerData'] -> allComposers list 4. Build Map<composerId, folder URI> 5. emit one SessionSource per workspace plus a catch-all 'cursor' source for composers that did not register against any workspace (multi-root workspaces, no-folder-open windows, deleted workspaces with surviving global rows) The parser decodes source.path's #cursor-ws= tag, filters the parsed bubbles to the composerIds that belong to this workspace, and yields only those. The orphan-tag source negates the filter so it captures every composer not in any workspace. In passing, fix a real bug in the old code: parseBubbles set `sessionId: row.conversation_id ?? 'unknown'`, but the JSON `conversationId` field is empty in current Cursor builds, so every call shipped with `sessionId: 'unknown'`. We now derive the composer id from the row key (`bubbleId:<composerId>:<bubbleUuid>`) which is what the workspace map joins on. The old behavior masked the bug because every call went into a single 'cursor' project anyway; with per-workspace bucketing the bug becomes load-bearing. Cache version bumped 2 -> 3 to invalidate caches that still record 'unknown' as the session id. Live-tested against my real 1.9 GB Cursor DB: the single 'cursor' row with 1904 calls / $4.08 now breaks into 5 workspaces plus an orphan bucket, totals reconcile exactly. 8 fixture-based tests cover multi-workspace routing, orphan filtering, legacy bare DB path backwards compat, multi-root workspace skip, vscode-remote URI slugification, and total reconciliation across all sources. Full suite: 46 files, 653 tests passing.	2026-05-10 15:35:57 -07:00
Resham Joshi	cdf7169a89	Cursor model aliases: cover every variant so non-Auto sessions price (#159 ) (#290 ) Cursor emits model names in a `claude-<dot-version>-<tier>` shape (`claude-4.6-sonnet`, `claude-4.5-opus`, `claude-4.5-opus-high-thinking`, etc.) plus its own `composer-1` house model. None of these match the canonical LiteLLM pricing keys (`claude-sonnet-4-6`, `claude-opus-4-5`). The alias map in `src/models.ts` filled some of these in v0.9.4 but missed: - plain no-suffix forms: `claude-4.5-opus`, `claude-4.5-sonnet`, `claude-4.6-opus` - haiku tier: `claude-4.5-haiku`, `claude-4.6-haiku` - forward-looking: `claude-4.7-opus` - Cursor's house model: `composer-1` The dashboard rendered $0 for sessions that used any unaliased model — visible in the screenshots posted in #159 even after the v0.9.4 fix that added the `-thinking` variants. This PR fills the gaps and adds 16 regression tests under `Cursor model variants resolve to pricing` that assert every model name in `src/providers/cursor.ts:modelDisplayNames` plus the additional plain forms resolves to a non-null pricing entry with `inputCostPerToken > 0` and `outputCostPerToken > 0`. So a future LiteLLM snapshot bump or a typo in the alias map will fail the test before users see $0. Direct hits in the snapshot (no alias needed): `gpt-5`, `gpt-5.2`, `grok-code-fast-1`, `gemini-3-pro` (already aliased). These are covered in the test suite as well so a snapshot that drops them would also be caught. Tests: 45 files, 617 passing locally (16 new). Closes #159.	2026-05-10 03:27:44 -07:00
Resham Joshi	7a878f4d19	Classifier: feature verb wins over debug keyword (part of #196 ) (#289 ) Some checks are pending CI / semgrep (push) Waiting to run Details Messages like "add error handling", "create an issue tracker", or "implement the 404 page" were landing in the Debugging bucket because the classifier checked DEBUG_KEYWORDS (which matches `error`, `issue`, `404`) before FEATURE_KEYWORDS in both `refineByKeywords` (tool-bearing turns) and `classifyConversation` (chat-only turns). The position of the matched word in the sentence is a much stronger intent signal than the order of the checks in code, so we now pick whichever pattern matches earliest. The new helper `firstMatchingCategory` runs each candidate regex once with `RegExp.exec` and keeps the match with the lowest `index`. Ties (rare in practice — same start position) break by the order the candidates were listed, which is `refactoring > feature > debugging` for coding turns. That ordering preserves existing behavior for plain bug reports (e.g. "login is broken, traceback below") while flipping mislabeled feature work to its correct category. 8 regression tests in `tests/classifier.test.ts` cover the mislabel cases from #196 plus tie-break / chat-only cases. Full suite: 45 files / 609 tests, all green. Closes the activity-misattribution half of #196. The Cursor provider attribution half (single 'cursor' project for all sessions) is addressed in a separate PR.	2026-05-09 22:48:11 -07:00
Resham Joshi	b72e51e538	Support CLAUDE_CONFIG_DIRS for scanning multiple Claude data dirs (#208 ) (#288 ) Adds an OS-delimited list env var so a user with more than one Claude account or profile can scan all of them in a single run. Sessions across every configured dir merge into one ProjectSummary per project, matching the option-1 design agreed on the issue thread (no per-account splitting in the data model or the UI). Format: `CLAUDE_CONFIG_DIRS=~/.claude-work:~/.claude-personal` on POSIX, `;`-separated on Windows. Precedence is CLAUDE_CONFIG_DIRS > CLAUDE_CONFIG_DIR > ~/.claude. Empty entries in the list are skipped, duplicates are deduped on resolved path, and a missing or unreadable dir does not abort the scan of the others. If the user explicitly set CLAUDE_CONFIG_DIRS but every listed entry is unreadable, a one-line stderr hint identifies the attempted paths and the platform's expected delimiter, so a Windows user typing the POSIX `:` does not get a silent zero-row result. `~` is now also expanded in CLAUDE_CONFIG_DIR for consistency. Implementation is intentionally narrow: only `claude.ts` changes, plus a small parser-cache key update so a stale cache from one config does not bleed into a run with a different config (matters for the macOS menubar and GNOME extension which run as long-lived processes). The merge happens for free in `src/parser.ts:scanProjectDirs`, which keys ProjectSummary entries by canonical cwd (or the sanitized slug as a fallback). Two SessionSource entries with the same `project` field land under the same key and combine their sessions, regardless of which dir they came from. No new fields on SessionSource / SessionSummary / ProjectSummary, and no UI changes. Tests: 12 fixture-based cases covering the unset path (default ~/.claude), single-dir override via CLAUDE_CONFIG_DIR, multi-dir override via CLAUDE_CONFIG_DIRS, ~ expansion, dedup of repeated entries, leading/trailing/doubled delimiters, missing dir tolerated, file-not-directory entry tolerated, empty CLAUDE_CONFIG_DIRS falls back to single-dir env, and two parser-level integration tests asserting (a) two sessions from two dirs sharing one cwd produce one ProjectSummary with combined totals and no `account`/`accountPath` fields anywhere, and (b) two sessions sharing a slug but with different canonical cwds still merge by slug at the project-rollup layer (option 1 behavior pinned so a future refactor cannot quietly swap to cwd-aware merging without an explicit opt-in). Supersedes the alternative implementation in #227, which builds per-account attribution (option 2) instead.	2026-05-09 22:04:45 -07:00
Resham Joshi	d1eb13fb91	Expose per-day one-shot data in daily JSON output (#279 ) (#280 ) * Expose per-day one-shot data in daily JSON output Closes #279. Adds turns, editTurns, oneShotTurns, oneShotRate to each entry of the `daily[]` array in `codeburn report --format json` output. The data was already computed internally for activity-level rollups; this just buckets it by date so consumers building daily-resolution efficiency dashboards (streak tracking, heatmaps, rolling-window charts) don't have to re-derive the rate from period-level activities. Counting matches parser.ts categoryBreakdown semantics: - every turn counts toward `turns` - turns with hasEdits=true count toward `editTurns` - edit turns with retries=0 count toward `oneShotTurns` - oneShotRate is null (not 0) when editTurns=0 — a chat-only day's rate is undefined, and reading it as 0% would be misleading Real consumer named in the issue: a 10-developer internal usage tracker that scores days by cache hit + cost/call + (now) one-shot rate. * Strengthen daily/activities reconciliation + CHANGELOG entry - Fall back to turn.assistantCalls[0]?.timestamp when turn.timestamp is missing so daily aggregate doesn't drop turns that activities[] keeps. Previously sum(daily[].editTurns) could be < sum(activities[].editTurns) for sessions starting with assistant entries before any user line. - Add Unreleased CHANGELOG entry for the daily one-shot fields.	2026-05-09 21:01:05 -07:00
Resham Joshi	4c29f6b880	Add Crush provider plus per-provider icon column in README (#286 ) Closes #278. Adds Charmbracelet Crush as a lazy-loaded provider: - src/providers/crush.ts: walks ~/.local/share/crush/projects.json (XDG_DATA_HOME and CRUSH_GLOBAL_DATA aware), opens each project's crush.db read-only, queries root sessions where parent_session_id IS NULL. Emits one ParsedProviderCall per session with real prompt_tokens, completion_tokens, cost (dollars), and the dominant model resolved from messages.model. - src/providers/index.ts: register crush alongside cursor, goose, opencode, antigravity, cursor-agent in the lazy import path. - tests/providers/crush.test.ts: 10 fixture-based tests covering discovery, parsing, missing-registry, malformed JSON, missing db, child session exclusion, dominant model selection, dedup, and array-shaped legacy registry. Schema source: charmbracelet/crush@v0.66.1 internal/db/migrations/20250424200609_initial.sql, verified by spawning a research agent against upstream. The schema comments in that migration claim millisecond timestamps but every actual INSERT/UPDATE uses strftime('%s', 'now') which returns Unix seconds; the parser treats values as seconds. Tokscale's parser (junhoyeo/tokscale#346) gets this wrong and is off by 1000x, plus its parser misses the prompt_tokens/completion_tokens columns that exist in Crush's schema. Our integration uses both, so Crush sessions get real per-model attribution. Menubar: - mac/Sources/CodeBurnMenubar/AppStore.swift: add .crush case to ProviderFilter and its cliArg switch. - mac/Sources/CodeBurnMenubar/Views/AgentTabStrip.swift: add Crush color to the per-tab color extension. The visibleFilters computed property already filters by detected providers, so the Crush tab appears automatically when a user has Crush data. README: - Replace the provider table with an icon-led layout. Icons live under assets/providers/<name>.<ext>. 14 icons sourced from junhoyeo/tokscale (MIT) under nominative fair use, 4 sourced separately: codex (OpenAI org avatar), cursor-agent (reuses the Cursor icon), kiro (kiro.dev favicon, ico->png via sips), omp (can1357/oh-my-pi icon.svg, MIT). Attribution line added. - Add Crush row. Docs: - docs/providers/crush.md: full per-provider doc with verified schema excerpt, the seconds-vs-milliseconds quirk, and a "when fixing a bug here" checklist. - docs/architecture.md: provider count 17 -> 18, test count 41 -> 42, and crush in the lazy list. - docs/providers/README.md: add Crush row to the lazy index. - CONTRIBUTING.md: bump test count to 568 (was 558). All 568 tests pass locally; swift build clean.	2026-05-09 20:47:56 -07:00
Resham Joshi	b4ed98cfa4	Add `codeburn models` per-model + per-task breakdown command (#287 ) A single dense table of every (provider, model) you have used in the selected period, sorted by cost. Inspired by tokscale's per-model output and ccusage's responsive cli-table3 layout, ported to plain Node with no new runtime dependency. Default view: one row per (provider, model) with a Top Task cell showing the dominant task category and its cost share, e.g. `Coding (42%)`. `--by-task` explodes each model into one row per task type, with provider/model cells blanked on subsequent rows of the same group and a horizontal divider between groups so the sections read as distinct units. Output formats: table (Unicode box-drawn, default), markdown (GitHub-flavored, copy-paste friendly), json, csv. Filters: --period (today/week/30days/month/all, default 30days), --from/--to, --provider, --task, --top, --min-cost, --no-totals. The table renderer auto-sizes every column to its content (no fixed widths leaving trailing whitespace) and drops cache columns as a pair when the terminal is narrow, then input/output, then top-task, in that order. Provider, model, total, and cost stay regardless. Visible-width math uses strip-ansi (already a dependency) so styled cells pad correctly. Cyan headers, yellow totals, dim provider name. The aggregator walks every parsed turn and attributes each assistant call to its (provider, model, task) bucket, computing real input / output / cache_write / cache_read tokens and cost. Output tokens include reasoning. Cached input tokens are folded into cache_read so the column matches what users intuitively expect. 19 fixture-based tests cover aggregation correctness, byTask grouping, taskFilter, topN/minCost filters, reasoning-as-output, all four renderers (table/markdown/json/csv), narrow-terminal column dropping, CSV/markdown escaping, totals row toggle, and visible-width math under styled cells.	2026-05-09 20:45:21 -07:00
Resham Joshi	46e43a0ec3	Label optimize suggestions by destination (#281 ) Some checks are pending CI / semgrep (push) Waiting to run Details Closes #277. Every paste-style fix now declares an explicit `destination` so users can tell at a glance whether a suggestion belongs in CLAUDE.md as a permanent rule, in a one-time session opener, in the current chat as an ask, or in a shell config file. Previously the prompts had no labeled home and users were dropping one-time session openers into CLAUDE.md as permanent rules. Type changes: - New `PasteDestination` union: `claude-md` / `session-opener` / `prompt` / `shell-config` - `WasteAction.paste` gains `destination?: PasteDestination` Renderer changes: - CLI `optimize` command (renderOptimize → renderFinding) prints a section header above each fix block: -- Suggested CLAUDE.md addition (permanent rule) ─── -- One-time session opener (do NOT add to CLAUDE.md) ─── -- Ask Claude in the current session ─── -- Add to your shell config ─── -- Run this command ─── - Interactive dashboard (FindingAction in dashboard.tsx) gets the same treatment so the in-popover findings list reads identically. Existing fixes retagged appropriately. Two existing prompts that lacked destination context altogether ("Set a delivery checkpoint at the start of the next expensive thread", "Start the next expensive thread with a fresh-context constraint") now read as one-time session openers with a clear "do not add to CLAUDE.md" hint — the exact failure mode the reporter described. Tests: - Existing `detectJunkReads` test extended to assert the destination tag. - New regression block walks every detector that emits a paste-style fix and asserts each one declares a destination — future detectors that ship without one get caught here.	2026-05-08 23:30:53 -07:00
Resham Joshi	8208cf8ff5	Quiet routine pricing warnings + menubar recovery from stuck-loading (#266 ) * Quiet routine pricing warnings + menubar recovery from stuck-loading CLI: - Default `codeburn` invocation no longer prints "no pricing data for model" warnings on every run. Greeting a fresh user with three lines of stderr before the dashboard even draws looked like the tool was broken on first launch. The warning now requires --verbose, and the suppressed pricing miss still results in $0 cost (correct for unmapped models). - Local-model heuristic skips the warning entirely for Ollama tags (`qwen3.6:35b-a3b-bf16`), GGUF/quantized fingerprints, and similar names that will never have public pricing. The "update codeburn" hint was actively misleading there. - When the warning does fire (with --verbose), it points users at `codeburn model-alias <model> <known-model>` as the actual escape hatch alongside the package update suggestion. Menubar: - Replace perpetual "Loading…" spinner with a FetchErrorOverlay when the per-key fetch fails and the cache is empty. User sees the error and a Retry button instead of an infinite hang. - Add diagnostic breadcrumbs (NSLog, invisible to normal users — Console.app / `log stream --process CodeBurnMenubar` only) for the four states that produce a stuck loading overlay: - subprocess timeout after 45s - fetch result dropped due to Task cancellation (rapid tab switch) - fetch result dropped due to mid-fetch calendar rollover - retry attempt where the last successful fetch is >2 min stale - Track lastSuccessByKey separately from cache freshness so the staleness diagnostic survives day-rollover cache wipes. * Stop flashing the compare-view loading screen on background refresh When the 30s CLI tick updated `projects` while the user was reading the model comparison results, the projects-watching effect always fired setLoadTrigger, which flipped phase to 'loading' and re-ran the slow scanSelfCorrections walk over every provider's session directory. The user lost their scroll position and saw a loading flash mid-read. Recompute the comparison rows in place when: - the user is already on the results phase, AND - both picked models still exist in the new aggregate. Skip the corrections rescan on these in-place refreshes — corrections drift slowly enough that holding the previous value until the user re-enters compare is acceptable, and the rescan is the slow part of the load. Initial selection and post-selection load still run the full pipeline.	2026-05-08 20:33:48 -07:00
iamtoruk	b777730174	Wait for old menubar process to exit before launching new one killRunningApp sent SIGTERM but returned immediately. The subsequent open call raced against the dying process, failing with error -600 on --force reinstalls. Poll up to 5 seconds for exit.	2026-05-08 13:08:34 -07:00
Resham Joshi	04aeda71b6	Refuse to load node:sqlite on known-buggy Node 22.x patch versions (#265 ) Some checks are pending CI / semgrep (push) Waiting to run Details Reported in #264 as a V8 CHECK abort with `Check failed: (location_) != nullptr` inside `node::sqlite::StatementSync::ColumnToValue`. The crash happens when SQLite returns a TEXT column whose bytes V8's String::NewFromUtf8 rejects (invalid UTF-8 — common for Cursor's stored chat text where multi-byte chars are truncated at streaming boundaries). Node 22.x prior to 22.20 does not check the resulting MaybeLocal<String> for empty before dereferencing, aborting the whole process with a trace trap. A try/catch in JS can't recover — the abort runs in the C++ extension before the V8 exception handler. So we refuse to load node:sqlite at all when we detect a buggy Node version, surface a clear "upgrade Node" diagnostic, and let the rest of the CLI run with the file-based providers (Claude, Codex, Copilot, Gemini, etc.) instead of taking the whole tool down. - engines.node bumped to >=22.20 so npm warns at install time - src/sqlite.ts: checkBuggyNodeVersion() detects Node 22.x < 22.20 and routes through the existing isSqliteAvailable() / loadError diagnostic path	2026-05-07 09:48:57 -07:00
Resham Joshi	fb8f25fb97	Reject invalid --format and --period values instead of silently falling back (#258 ) getDateRange() silently fell back to week on unknown periods, and no command validated --format. A typo like --period mounth or --format yaml produced wrong output with exit 0. Now all 6 format-accepting commands and all period-accepting paths reject unknown values with a clear message and exit 1. Also fixes the status description (said today+week+month, only shows today+month).	2026-05-06 23:03:41 -07:00
Resham Joshi	daa673449c	Menubar and CLI hardening from multi-agent audit (#257 ) Some checks are pending CI / semgrep (push) Waiting to run Details Two passes of validators across CLI accuracy, dashboard UX, menubar Swift, performance, security, and end-to-end smoke tests on real session data. Data-correctness fixes: - parseLocalDate rejects month/day overflow. JS Date silently rolled Feb 31 to Mar 3, so --from 2026-02-31 --to 2026-03-15 quietly dropped sessions on Feb 28 - Mar 2. Now throws "Invalid date" with a clear reason. Leap-day case covered (2024-02-29 valid, 2025-02-29 rejected). - CSV/JSON exports use the active currency's natural decimal places. The previous round2 helper produced ¥412.37 in CSV while the dashboard rendered ¥412 — finance teams comparing the two surfaces saw a discrepancy. New roundForActiveCurrency consults Intl.NumberFormat for the right precision (0 for JPY/KRW/CLP, 2 for USD/EUR, etc). - Copilot toolRequests is Array.isArray-guarded in both modern and legacy event branches. Previously a corrupt session with toolRequests=null or a string aborted the whole file's parse loop and silently dropped every legitimate call after it. - Codex token_count dedup uses a null sentinel for prevCumulativeTotal so the first event is never confused with a duplicate. Sessions that emit only last_token_usage (no total_token_usage) report cumulativeTotal=0 on every event; with the previous 0-initialized prev, the first event matched the dedup guard and was dropped. - LiteLLM pricing values are clamped to [0, 1] per token via safePerTokenRate. Defense in depth against a tampered upstream JSON shipping negative or absurdly large per-token costs that would otherwise propagate into all cost totals. Performance: - Cursor SQLite parse no longer pegs at minutes on multi-GB DBs. Two changes: per-conversation user-message buffer uses an index pointer instead of Array.shift() (which was O(n) per call); and a real ROWID cutoff via subquery limits the scan to the most recent 250k bubbles with a stderr warning so power users get a partial report rather than a stalled CLI. - Spawned codeburn CLI subprocesses are terminated when the calling Task is cancelled. Without this, rapid period/provider tab clicks in the menubar cancelled the Task but left the subprocess running to completion, piling up zombie processes. UX: - Dashboard period switch flips to loading and clears projects synchronously before reloadData runs, eliminating the frame where the new period label rendered over the old period's projects. - Optimize findings tab paginates 3-at-a-time with j/k scroll. With 4 new detectors plus 7 originals, 8-10 findings * 6 lines was scrolling the StatusBar off the alt buffer top. - Custom --from/--to ranges hide the period tab strip and disable the 1-5 / arrow keys so a stray period press no longer abandons the user's explicit range. A "Custom range: X to Y" banner replaces the tab strip. - OpenCode storage-format warning is per-table-set, rate-limited to once per process, and points the user at OpenCode's migration step or the issue tracker. The previous all-or-nothing check fired the generic "format not recognized" string for any schema mismatch. Menubar / OAuth: - Both Claude and Codex bootstrap (Reconnect button) now honour the usageBlockedUntil 429 backoff that refreshIfBootstrapped respects. Spamming Reconnect during sustained rate-limit windows previously hammered the upstream endpoint on every click. - Codex Retry-After HTTP header is parsed (delta-seconds plus IMF-fixdate fallback) so we don't over-back-off when ChatGPT tells us a shorter window than our 5-minute floor. - Both credential cache files are written via SafeFile.write (O_CREAT \| O_EXCL \| O_NOFOLLOW with explicit 0600) so there is no race window where the temp file briefly exists at default umask, and a symlink at the destination cannot redirect the write. Reads now route through SafeFile.read with a 64 KiB cap, closing the symlink-follow gap on Data(contentsOf:). CI signal: - TypeScript strict typecheck (tsc --noEmit) is now zero errors. The six errors in src/providers/copilot.ts came from a discriminated-union catch-all branch whose `data: Record<string, unknown>` shape TS picked over the specific event branches when narrowing on `type`. Removed the catch-all; runtime falls through unknown event types via the existing if/else chain. Tests added: 16 new (now 555 total) - date-range-filter: month/day/year overflow rejection, leap-day correctness - currency-rounding: convertCost no-rounding contract, roundForActiveCurrency for USD/JPY/KRW/EUR - providers/copilot: malformed toolRequests does not abort the parse - providers/cursor-bubble-dedup: re-parse after token mutation does not double-count, single parse yields one call per bubble - providers/codex: first event with cumulativeTotal=0 not dropped, consecutive zero-cumulative duplicates still deduped	2026-05-06 22:15:11 -07:00

1 2 3 4 5 ...

301 commits