codeburn

mirror of https://github.com/AgentSeal/codeburn.git synced 2026-05-19 16:13:56 +00:00

Author	SHA1	Message	Date
Resham Joshi	38e41e93c3	Add Node version guard for unsupported runtimes (#319 ) Split CLI into a tiny launcher (src/cli.ts) that checks for Node >= 22.13.0 before dynamically importing the full CLI (src/main.ts). Users on Node 18 now get a clear upgrade message instead of a cryptic regex parse error from string-width. Closes #232.	2026-05-11 21:50:17 -07:00
Resham Joshi	d1eb13fb91	Expose per-day one-shot data in daily JSON output (#279 ) (#280 ) * Expose per-day one-shot data in daily JSON output Closes #279. Adds turns, editTurns, oneShotTurns, oneShotRate to each entry of the `daily[]` array in `codeburn report --format json` output. The data was already computed internally for activity-level rollups; this just buckets it by date so consumers building daily-resolution efficiency dashboards (streak tracking, heatmaps, rolling-window charts) don't have to re-derive the rate from period-level activities. Counting matches parser.ts categoryBreakdown semantics: - every turn counts toward `turns` - turns with hasEdits=true count toward `editTurns` - edit turns with retries=0 count toward `oneShotTurns` - oneShotRate is null (not 0) when editTurns=0 — a chat-only day's rate is undefined, and reading it as 0% would be misleading Real consumer named in the issue: a 10-developer internal usage tracker that scores days by cache hit + cost/call + (now) one-shot rate. * Strengthen daily/activities reconciliation + CHANGELOG entry - Fall back to turn.assistantCalls[0]?.timestamp when turn.timestamp is missing so daily aggregate doesn't drop turns that activities[] keeps. Previously sum(daily[].editTurns) could be < sum(activities[].editTurns) for sessions starting with assistant entries before any user line. - Add Unreleased CHANGELOG entry for the daily one-shot fields.	2026-05-09 21:01:05 -07:00
Resham Joshi	b4ed98cfa4	Add `codeburn models` per-model + per-task breakdown command (#287 ) A single dense table of every (provider, model) you have used in the selected period, sorted by cost. Inspired by tokscale's per-model output and ccusage's responsive cli-table3 layout, ported to plain Node with no new runtime dependency. Default view: one row per (provider, model) with a Top Task cell showing the dominant task category and its cost share, e.g. `Coding (42%)`. `--by-task` explodes each model into one row per task type, with provider/model cells blanked on subsequent rows of the same group and a horizontal divider between groups so the sections read as distinct units. Output formats: table (Unicode box-drawn, default), markdown (GitHub-flavored, copy-paste friendly), json, csv. Filters: --period (today/week/30days/month/all, default 30days), --from/--to, --provider, --task, --top, --min-cost, --no-totals. The table renderer auto-sizes every column to its content (no fixed widths leaving trailing whitespace) and drops cache columns as a pair when the terminal is narrow, then input/output, then top-task, in that order. Provider, model, total, and cost stay regardless. Visible-width math uses strip-ansi (already a dependency) so styled cells pad correctly. Cyan headers, yellow totals, dim provider name. The aggregator walks every parsed turn and attributes each assistant call to its (provider, model, task) bucket, computing real input / output / cache_write / cache_read tokens and cost. Output tokens include reasoning. Cached input tokens are folded into cache_read so the column matches what users intuitively expect. 19 fixture-based tests cover aggregation correctness, byTask grouping, taskFilter, topN/minCost filters, reasoning-as-output, all four renderers (table/markdown/json/csv), narrow-terminal column dropping, CSV/markdown escaping, totals row toggle, and visible-width math under styled cells.	2026-05-09 20:45:21 -07:00
Resham Joshi	fb8f25fb97	Reject invalid --format and --period values instead of silently falling back (#258 ) getDateRange() silently fell back to week on unknown periods, and no command validated --format. A typo like --period mounth or --format yaml produced wrong output with exit 0. Now all 6 format-accepting commands and all period-accepting paths reject unknown values with a clear message and exit 1. Also fixes the status description (said today+week+month, only shows today+month).	2026-05-06 23:03:41 -07:00
Resham Joshi	daa673449c	Menubar and CLI hardening from multi-agent audit (#257 ) Some checks are pending CI / semgrep (push) Waiting to run Details Two passes of validators across CLI accuracy, dashboard UX, menubar Swift, performance, security, and end-to-end smoke tests on real session data. Data-correctness fixes: - parseLocalDate rejects month/day overflow. JS Date silently rolled Feb 31 to Mar 3, so --from 2026-02-31 --to 2026-03-15 quietly dropped sessions on Feb 28 - Mar 2. Now throws "Invalid date" with a clear reason. Leap-day case covered (2024-02-29 valid, 2025-02-29 rejected). - CSV/JSON exports use the active currency's natural decimal places. The previous round2 helper produced ¥412.37 in CSV while the dashboard rendered ¥412 — finance teams comparing the two surfaces saw a discrepancy. New roundForActiveCurrency consults Intl.NumberFormat for the right precision (0 for JPY/KRW/CLP, 2 for USD/EUR, etc). - Copilot toolRequests is Array.isArray-guarded in both modern and legacy event branches. Previously a corrupt session with toolRequests=null or a string aborted the whole file's parse loop and silently dropped every legitimate call after it. - Codex token_count dedup uses a null sentinel for prevCumulativeTotal so the first event is never confused with a duplicate. Sessions that emit only last_token_usage (no total_token_usage) report cumulativeTotal=0 on every event; with the previous 0-initialized prev, the first event matched the dedup guard and was dropped. - LiteLLM pricing values are clamped to [0, 1] per token via safePerTokenRate. Defense in depth against a tampered upstream JSON shipping negative or absurdly large per-token costs that would otherwise propagate into all cost totals. Performance: - Cursor SQLite parse no longer pegs at minutes on multi-GB DBs. Two changes: per-conversation user-message buffer uses an index pointer instead of Array.shift() (which was O(n) per call); and a real ROWID cutoff via subquery limits the scan to the most recent 250k bubbles with a stderr warning so power users get a partial report rather than a stalled CLI. - Spawned codeburn CLI subprocesses are terminated when the calling Task is cancelled. Without this, rapid period/provider tab clicks in the menubar cancelled the Task but left the subprocess running to completion, piling up zombie processes. UX: - Dashboard period switch flips to loading and clears projects synchronously before reloadData runs, eliminating the frame where the new period label rendered over the old period's projects. - Optimize findings tab paginates 3-at-a-time with j/k scroll. With 4 new detectors plus 7 originals, 8-10 findings * 6 lines was scrolling the StatusBar off the alt buffer top. - Custom --from/--to ranges hide the period tab strip and disable the 1-5 / arrow keys so a stray period press no longer abandons the user's explicit range. A "Custom range: X to Y" banner replaces the tab strip. - OpenCode storage-format warning is per-table-set, rate-limited to once per process, and points the user at OpenCode's migration step or the issue tracker. The previous all-or-nothing check fired the generic "format not recognized" string for any schema mismatch. Menubar / OAuth: - Both Claude and Codex bootstrap (Reconnect button) now honour the usageBlockedUntil 429 backoff that refreshIfBootstrapped respects. Spamming Reconnect during sustained rate-limit windows previously hammered the upstream endpoint on every click. - Codex Retry-After HTTP header is parsed (delta-seconds plus IMF-fixdate fallback) so we don't over-back-off when ChatGPT tells us a shorter window than our 5-minute floor. - Both credential cache files are written via SafeFile.write (O_CREAT \| O_EXCL \| O_NOFOLLOW with explicit 0600) so there is no race window where the temp file briefly exists at default umask, and a symlink at the destination cannot redirect the write. Reads now route through SafeFile.read with a 64 KiB cap, closing the symlink-follow gap on Data(contentsOf:). CI signal: - TypeScript strict typecheck (tsc --noEmit) is now zero errors. The six errors in src/providers/copilot.ts came from a discriminated-union catch-all branch whose `data: Record<string, unknown>` shape TS picked over the specific event branches when narrowing on `type`. Removed the catch-all; runtime falls through unknown event types via the existing if/else chain. Tests added: 16 new (now 555 total) - date-range-filter: month/day/year overflow rejection, leap-day correctness - currency-rounding: convertCost no-rounding contract, roundForActiveCurrency for USD/JPY/KRW/EUR - providers/copilot: malformed toolRequests does not abort the parse - providers/cursor-bubble-dedup: re-parse after token mutation does not double-count, single parse yields one call per bubble - providers/codex: first event with cumulativeTotal=0 not dropped, consecutive zero-cumulative duplicates still deduped	2026-05-06 22:15:11 -07:00
Resham Joshi	afd0ee7011	Validator hardenings on the bug-hunt batch (#254 ) * Five correctness fixes from multi-agent bug hunt A multi-agent audit of the codeburn correctness surface found five real bugs each producing visibly wrong numbers or risking data loss. All five fixes were validated by parallel review agents and exercised end-to-end against real session data on this machine. - src/cli.ts: --refresh <seconds> was using bare parseInt as the commander callback. Commander invokes the callback as parseInt(value, previous), so previous becomes the radix: --refresh 30 was being parsed as parseInt('30', 30) = 90, and --refresh 60 became NaN. Replaced with parseInteger (already defined at line 48 with radix locked to 10) at all three sites. - src/providers/cursor.ts: parseAgentKv was timestamping every agentKv call as new Date().toISOString() because the Cursor SQLite schema has no per-message timestamp. Result: every Cursor agent call regardless of when it happened landed in today's date bucket. Now uses statSync(dbPath).mtimeMs as a bounded ceiling so calls land at the actual last-write time of the Cursor database, not today. Verified locally: a 1904-call Cursor history with March 22 mtime now correctly bucket into all-time only and shows 0 calls for today/week/30days. - src/providers/codex.ts: prev token counters were only updated inside the cumulative-fallback branch, so a session emitting N events with last_token_usage followed by one cumulative-only event computed the next delta against prev=0 and double-counted the entire cumulative window. Cost could be inflated 10-100x for any mixed-format Codex session. Now prev advances to the current cumulative state regardless of which branch ran. - src/providers/gemini.ts: totalOutput accumulated output+thoughts while totalThoughts was tracked separately. The result was outputTokens = output+thoughts AND reasoningTokens = thoughts; any consumer summing the two double-counted thoughts. Now totalOutput holds just output, reasoningTokens holds thoughts, and the cost calc folds thoughts into the output count to keep pricing correct (Google bills thoughts at the output rate; calculateCost has no reasoning parameter). - src/export.ts: exportJson had no safety check before writeFile, so codeburn export -f json -o ~/important.json would silently clobber the user's file. CSV path had a marker-file guard; JSON did not. Now refuses to overwrite a file unless its first 4KB contain the codeburn schema marker. Uses a streaming partial read so a large existing file does not OOM Node's ~512MB string limit. Refuses directories outright. Skipped intentionally: cursor-auto/copilot-auto/cline-auto/ qwen-auto are aliased to claude-sonnet-4-5. The audit flagged this as wrong pricing for non-Anthropic auto-routed turns, but Cursor's "auto" mode does not expose the actual model and any alternative estimate is equally arbitrary. README already documents this as a Sonnet-based estimate. vitest run: 38 files, 529 tests pass. * Five more correctness fixes from the bug-hunt round This commit closes out the remaining critical-tier findings from the multi-agent audit, with one item documented as a known limitation. - src/providers/cursor.ts: bubble dedup key included mutable inputTokens/outputTokens. Cursor mutates token counts on the row in place when streaming completes, so re-parsing the same DB produced a fresh dedup key per bubble and silently double-counted. Switched to the SQLite row key (`bubbleId:<unique>`) which is stable per bubble. Adjusted BubbleRow type and BUBBLE_QUERY_BASE to expose `key as bubble_key`. - src/providers/pi.ts: usage fields were destructured non-optionally, but real Pi/OMP session files sometimes omit individual fields. `calculateCost(model, undefined, ...)` returned NaN, and that NaN propagated into every aggregate cost total. Coerce each field to 0 with `?? 0`. - src/models.ts: getShortModelName and the getModelCosts startsWith fallback both walked the dictionary in insertion order. A model id like `gpt-5-mini` could resolve to the entry for `gpt-5` (matched by startsWith first) and silently get GPT-5's display name and pricing tier. Iterate longest keys first so more-specific prefixes win. Tightened the cost fallback's match condition from `startsWith(key) \|\| startsWith(key + '-')` to require either an exact match or a `key + '-'` continuation, removing accidental matches like `gpt-50` against `gpt-5`. - src/models.ts: calculateCost returned 0 silently for any model missing from the pricing snapshot. New Anthropic / OpenAI models shipped between snapshot refreshes look free until the user notices. Now warns once per unknown model name per process to stderr. Skips the warning for the `<synthetic>` placeholder so the noise floor stays low. - src/yield.ts: revert detection was broken on the canonical case. Two problems: (1) `subject.toLowerCase().includes('revert')` matched any commit whose subject mentioned the word ("Add revert button" was misclassified). (2) The window logic only counted reverts within the original session's 1-hour boundary, but real `git revert` commits land in later sessions, so original sessions always looked productive. Now: getRevertedShas runs once with `--grep=^This reverts commit` and parses bodies to build a Set of SHAs that were the target of a revert anywhere in history. CommitInfo.wasReverted is set when this commit's SHA appears in that set. categorizeSession then flags a session as reverted when its in-main commits were later reverted, regardless of when the revert itself happened. - src/providers/droid.ts: SKIPPED with comment. Droid records token usage only at session level. The current behavior splits evenly across emitted assistant calls and prices all of them at settings.model (the latest model). For sessions where the user switched models mid-stream, costs are approximate. Added an inline comment documenting this; a real fix requires per-message model data that isn't in the Droid JSONL schema. Verified end-to-end on this machine: - vitest run: 38 files, 529 tests pass - `codeburn report --format json` produces valid JSON - `codeburn yield -p week` runs without crashing, finds 0 reverts in the user's recent git history (plausible — fix changed the detection from "subject contains revert" to "this commit's SHA appears in a later 'This reverts commit ...' body") - Stderr now warns for unknown model ids: `openai/gpt-5.3`, `qwen3.6:35b-a3b-bf16`, `big-pickle`. These previously priced silently at $0. * Four high-severity fixes from the bug-hunt round - src/currency.ts: getExchangeRate wrapped fetchRate and cacheRate in one try/catch. If fetchRate succeeded but cacheRate threw (disk full, ENOSPC, no permissions on the cache dir), the catch block swallowed the error and returned 1. Every cost rendered after that point became USD-equivalent silently. Now the fetch and the cache write live in separate paths: a successful fetch returns the rate even if the persist fails, and the cache-write error is dropped to a fire-and-forget so transient disk problems do not corrupt the user's currency display. - src/cursor-cache.ts: writeFile was non-atomic. Two concurrent codeburn invocations writing to cursor-results.json could interleave bytes mid-write, leaving a truncated file that parsed-error on next read and forced a full SQLite re-scan every run. Switched to the temp-file + rename pattern with a randomized temp name so each writer gets its own staging file and the rename is atomic on POSIX. Crash mid-write also leaves only a leftover temp file, which gets unlinked in the catch path; the destination is never half-written. - mac/.../CodeBurnApp.swift refresh loop on sleep: the loop's Task.sleep keeps a wakeup pending across system sleep, so on wake the natural tick fires the same instant the wake observers do. Combined with didWakeNotification, screensDidWakeNotification, and the launchd com.codeburn.refresh distributed notification, that produced 2-3 concurrent CLI spawns within ms of every wake. Now: willSleepNotification cancels the loop task; didWakeNotification restarts it. The loop also reads lastRefreshTime and skips its natural tick if a wake/manual/distributed-notification refresh ran within the last 5 seconds, coalescing the two sources of refresh into one CLI spawn per wake event. - mac/.../CodeBurnApp.swift observeStore: the read closure had an implicit strong self capture (it accessed store.* without a capture annotation), pinning self for the lifetime of any unfired observation. Added [weak self] and a guard to make the capture explicit. withObservationTracking is one-shot per call, so there is at most one active subscription at a time; the earlier audit's claim of an unbounded leak overstated the issue, but tightening the capture pattern is still cleaner. Verified: - vitest run: 38 files, 529 tests pass - swift build -c release --arch arm64 --arch x86_64: clean, no diagnostics, no MainActor warnings - mac/Scripts/package-app.sh dev produces a valid universal bundle - Menubar launches and runs without crash * Eleven medium-severity fixes from the bug-hunt round - src/format.ts formatTokens: guard against Infinity, NaN, and negative input. Previously a corrupt aggregate could leak into the UI as the literal strings "NaN" or "Infinity". Negatives now render as "0" rather than "-500" with no scaling. - src/cli-date.ts parseDateRangeFlags: the missing-from default was new Date(0), which opened a 55-year scan from 1970 epoch whenever the user passed only --to. Default now anchors at 6 months back from now, matching the dashboard's all-time period. Test updated to assert the new bounded window. - src/cli-date.ts toPeriod: previously fell back silently to "week" for any unknown input, so a typo like `-p mounth` produced a quiet 7-day report while the user thought they were viewing the month. Now exits with a clear stderr error and exit code 1. Test updated to assert the loud-failure behavior. - src/optimize.ts urgencyScore: rebalanced weights so a high-impact finding with zero observed tokens cannot outrank a medium-impact finding with millions of tokens. Old 0.7/0.3 split made high+0 (0.70) beat medium+1B (0.65). New 0.5/0.5 split makes medium+1B (0.75) beat high+0 (0.50). Token normalization lifted to 5M so the ramp covers a realistic spend range. - src/models.ts calculateCost: clamp negative or non-finite token inputs to 0 before pricing. A corrupt JSONL emitting a negative count would otherwise produce a negative cost that silently subtracted from real spend in aggregates. - src/currency.ts convertCost: stop rounding during aggregation. For zero-fraction currencies (JPY, KRW, CLP) this clamped every per-session cost to a whole unit before sum, so a project of 1000 sessions averaging ¥0.4 each aggregated to ¥0 instead of ¥400. formatCost still rounds at the display boundary. - src/config.ts saveConfig: the temp file path was a fixed `${configPath}.tmp` suffix. Two simultaneous saveConfig calls (overlapping menubar and CLI runs) raced on the same staging file and could leave one writer reading partial bytes from the other. Randomized the temp suffix per call. - src/providers/antigravity.ts flushCache: the early return on `!cacheDirty` short-circuited eviction when liveCascadeIds was supplied but no cascade had been added or updated this run. As a result, deleted .pb files persisted in the cache forever once the user stopped writing to it. Eviction now runs whenever liveCascadeIds is provided, marks the cache dirty if anything was removed, and only then short-circuits if there is nothing to write. - src/daily-cache.ts addNewDays: cap retention at 2 years. The days array previously merged forever, growing the cache file by hundreds of bytes per day until JSON parse on every CLI invocation became measurable. The 6-month UI period plus the 365-day BACKFILL_DAYS bootstrap both fit comfortably inside the cap, with headroom for a future longer window. - src/dashboard.tsx useInput: period number keys (1-5) and arrow keys triggered a reload while the compare view was mounted. The parent's data state changed underneath the user with no visual affordance back to the dashboard. Now those keys are gated on view !== 'compare', and `b` / Esc inside compare returns to the dashboard. - mac/.../HeatmapSection.swift formatters: prettyDate, buildTrend Bars, computeTrendStats, computeForecast, and computeAllStats each allocated a fresh DateFormatter (and Calendar) on every call. SwiftUI re-evaluates these views many times per second during hover scrubbing on the trend chart, so the allocations were a measurable hot spot. Lifted the yyyy-MM-dd / "EEE MMM d" / "MMM d" formatters and the gregorian Calendar to fileprivate cached singletons. Two findings from the same bucket were not addressed here: - UpdateChecker SHA-256 / codesign verification is already performed by src/menubar-installer.ts (verifyChecksum at line 85). The Swift side just kicks off `codeburn menubar --force` which runs that path. The audit's claim of missing verification was a misread. - NSDistributedNotificationCenter sender validation: the `com.codeburn.refresh` listener accepts from any sender, but forceRefresh has a 5-second rate-limit gate so the abuse ceiling is one CLI spawn per 5 seconds. Mitigations (Mach IPC, per-launch shared secret) are disproportionate to the impact. vitest run: 38 files, 529 tests pass. swift build -c release: clean, no warnings. * Validator hardenings on the bug-hunt batch Hoist the per-call sort in getModelCosts and getShortModelName to module scope so model lookups on the hot path stop reallocating sorted key arrays. Sanitize the unknown-model stderr warning by stripping C0/C1 controls and capping length, so a hostile or corrupt JSONL cannot inject terminal escape sequences via the model field. Skip the daily-cache prune when newestDate fails to parse. The previous code produced a NaN cutoff and silently dropped every cached day on the next merge. Adds tests locking down the stable resolution of common model names (gpt-5-mini vs gpt-5, claude-haiku-4-5 vs claude-3-5-haiku, etc.) and the prune NaN guard.	2026-05-06 19:50:40 -07:00
Resham Joshi	be6068b244	feat(report): add per-model efficiency metrics Adds per-model efficiency metrics (edit turns, one-shot rate, retries/edit, cost/edit) to the TUI By Model panel, JSON report output, and CSV export. Closes item 4 of #12. Supersedes #226 with review fixes (units rename, min-sample guard in TUI, tighter <synthetic> filter, multi-model attribution test). Original implementation by @ozymandiashh.	2026-05-05 23:36:59 -07:00
ozymandiashh	fc4c4f0091	feat(export): support custom date ranges	2026-05-05 23:18:48 -07:00
ozymandiashh	3dc3e32715	fix(date-range): unify 'all' period semantics between CLI and dashboard `getDateRange` was duplicated across `src/cli.ts` and `src/dashboard.tsx` with conflicting semantics for `'all'`. The CLI intentionally bounded `'all'` to the last 6 months (justified inline: keeps Codex/Cursor parses responsive on sparse multi-year history). The dashboard returned `new Date(0)` instead, so the same `--period all` flag silently meant two different windows depending on which entry point you hit. `Period`, `PERIODS`, `PERIOD_LABELS`, and `toPeriod` were duplicated as well, and `cli-date.ts` already existed for date helpers (`parseDateRangeFlags`) so the consolidation lives there. Both call sites now go through a single `getDateRange(period: string)` in `cli-date.ts` that returns `{ range, label }`. The dashboard wraps it as `getPeriodRange(period: Period)` to keep the strict `Period` type at the React boundary while letting the CLI continue to accept extras like `'yesterday'`. `PERIOD_LABELS.all` becomes `'6 Months'` (short, for the dashboard tab strip; the previous `'All Time'` was misleading and the long-form `'Last 6 months'` from `getDateRange().label` already drives CLI output). Changes: - src/cli-date.ts: add `Period`, `PERIODS`, `PERIOD_LABELS`, `toPeriod`, `getDateRange`. Pull the existing 6-month rationale into a named `ALL_TIME_MONTHS` constant. - src/cli.ts: drop the local copies and import from cli-date. - src/dashboard.tsx: drop the local copies, route through `getPeriodRange`, alias the shared `getDateRange` import to `getDateRangeShared` to avoid shadowing the wrapper. - tests/cli-date.test.ts: 13 cases covering `'all'` regression guard (must never silently fall back to `Date(0)`), CLI/dashboard agreement, end-of-month clamping tolerance, `'yesterday'` support, and unknown-input fallback. - README.md, CHANGELOG.md: surface the bound and point heavy users at `--from`/`--to` for unbounded windows. The CLI flag `--period all` continues to be accepted; only the dashboard window changes to match what the CLI was already doing. No public API or schema change. Refs #93	2026-05-05 03:53:46 +03:00
iamtoruk	68c6f2c710	Fix timezone handling: menubar UTC bugs, --timezone flag, DST-safe dates Three fixes for issue #184: 1. Menubar Swift code used UTC instead of local timezone in two places: computeHistoryStats hardcoded TimeZone("UTC") and effectiveTokensInLast7Days used ISO8601DateFormatter (UTC default). Both now use .current to match CLI-produced local date keys. 2. Add --timezone flag and CODEBURN_TZ env var to override the system timezone for all date grouping. Sets process.env.TZ before any Date operations so all existing local-timezone code works unchanged. 3. Replace MS_PER_DAY arithmetic with Date constructor day-of-month math for yesterday/backfill computations. Subtracting 86400000ms from midnight skips a day on DST spring-forward (23-hour day). Fixes #184	2026-04-30 17:33:02 -07:00
AgentSeal	7b8f4df856	Fix review findings: error handling, constant dedup, migration persist - Wrap hydrateCache() in try/catch so disk errors don't crash commands that previously never touched the cache - Export MS_PER_DAY, BACKFILL_DAYS, toDateString from daily-cache.ts and remove duplicates from cli.ts - Remove double hydrateCache() call in report JSON path - Persist migrated cache to disk so old-version files aren't re-migrated on every run - Export emptyCache() for use as fallback on hydration failure	2026-04-28 22:56:47 +02:00
AgentSeal	888f592bd2	Make daily cache durable: hydrate from all commands, migrate instead of nuke - Extract ensureCacheHydrated() from menubar-json path into daily-cache.ts - Call it from every command that parses sessions (report, status, today, month, export, optimize, compare, yield) so CLI-only users also persist historical data that survives source file deletion - Replace strict version equality check with fill-defaults migration for cache versions 2-4, preserving history across schema changes - Back up old cache to .bak before discarding on unmigrateable versions - Fix Copilot auto bucket display names in menubar (Copilot (Anthropic), Copilot (OpenAI)) - Fix Roo Code / KiloCode provider key matching in menubar tab strip	2026-04-28 22:41:01 +02:00
AgentSeal	64259c929c	Fix Gemini provider for JSONL format (CLI 0.39+) Gemini CLI 0.39 switched from single JSON to JSONL with one object per line and $set metadata lines. Parser now handles both formats. Also updated --provider help text to list all providers.	2026-04-28 15:03:39 +02:00
iamtoruk	1c5ca45bd8	Add experimental yield command to track productive vs wasted spend New command: codeburn yield --period <period> Correlates AI sessions with git commits to categorize spend: - Productive: sessions with nearby commits that made it to main - Reverted: sessions with commits that were reverted - Abandoned: sessions with no commits or not in main Uses timestamp proximity heuristic (session time + 1 hour window). Works across branches, squash merges, and rebases by checking if commits are in main branch ancestry. Closes #152	2026-04-25 13:47:07 +02:00
iamtoruk	8ffbffcd7f	Fix double-counting in menubar-json period data PR #136 (`2e5e449`) changed parseAllSessions to use periodInfo.range (full period) instead of just today's range. When combined with cached historical data, this caused days to be counted twice: - Once from getDaysInRange(cache, ...) - Again from parseAllSessions(periodInfo.range, ...) Result: 7-day cost showed ~$402 instead of correct ~$209. Fix: Parse only today's sessions when using cache path. Historical data comes exclusively from cache, today's data from fresh parse.	2026-04-25 01:44:02 +02:00
AgentSeal	2e5e449ad1	fix: use consistent date range for menubar status queries The menubar showed stale prices because provider all used end:now while provider specific queries used end:endOfDay. Sessions with timestamps after now was captured were excluded from all providers but included in specific provider queries. Use periodInfo.range consistently across all parseAllSessions calls in menubar json status.	2026-04-22 21:28:12 +02:00
iamtoruk	4f1138290e	Merge main into feat/omp-support-model-aliases Second merge of main since the PR was opened. Main moved 30+ commits (0.8.5 bump, plan tracking feature, MiniMax pricing, menubar prefetchAll walk-back, aicrowd cache rewrite revert) so the branch needed another reconciliation before merging to main. Two new conflicts resolved. Took main's text in both cases per the policy of favoring main when the feature work is neutral: README.md Kept main's Node 20+ / better-sqlite3 Requirements wording and main's shorter src/ tree listing. Added OMP to the Requirements line. src/providers/pi.ts Main dropped the discovery-cache snapshot and the rich source-metadata fields as part of the aicrowd revert. Took main's simpler structure and only kept the providerName parameter so OMP sources still report the correct provider in the session source and dedup key. Earlier fixups carried forward from the prior merge commit: - Object.hasOwn guards in resolveAlias against prototype-pollution via a model literally named '__proto__'. - source.provider in the dedup key prefix so OMP rows no longer stamp 'pi:'. - Combined pi.js imports in providers/index.ts. - Trailing newline on pi.ts. - Unknown-model fallback in cursor-agent.ts from yesterday's PR #117 fixup (preserved via main). 353 tests pass (count dropped from 378 because main deleted the parse-progress / parser-cache / provider-colors / source-cache test files alongside the cache-rewrite revert). Feature work by @cgrossde.	2026-04-21 11:51:20 -07:00
Trevin Chow	2c43ec1ad0	fix(plan): resolve type errors in plan summary and isActivePlan guard Two pre-existing type errors surfaced during the rebase against main: 1. JsonPlanSummary.id was hardcoded to four plan ids, but PlanId now includes 'none' (PLAN_IDS was extended when 'codeburn plan clear' was added). toJsonPlanSummary only runs for active plans at runtime, but the static type still had to be widened. Use PlanId directly instead of the hand-rolled union. 2. isActivePlan used Boolean(plan) as the nullish guard, which doesn't narrow plan's type in TypeScript. Switch to an explicit 'plan !== undefined' so the subsequent .id and .monthlyUsd accesses type-check. npx tsc --noEmit is now clean; all 285 tests still pass.	2026-04-21 04:20:55 -07:00
Trevin Chow	3f7470d29b	feat(plan): subscription plan tracking with usage progress bar Adds `codeburn plan set <id>` to configure a subscription plan (Claude Pro, Claude Max, Cursor Pro, or custom). When set, the Overview panel renders an API-equivalent progress bar against subscription price with a projected month-end cost. Closes the loudest demand signal on the repo: issue #11 ("Subscription vs API Use") from two independent voices, plus the routing-decision use case raised in #12. - src/config.ts: extends CodeburnConfig with Plan, adds readPlan/savePlan/clearPlan - src/plans.ts: presets (claude-pro $20, claude-max $200, cursor-pro $20) - src/plan-usage.ts: getPlanUsage, resetDay-aware period math (1-28), median-of-7-day-trailing projection - src/cli.ts: `codeburn plan [show\|set\|reset]` subcommand, plan wired into JSON outputs for report/today/month/status (only when active) - src/dashboard.tsx: Plan row in Overview, color-coded (green under 80%, orange near, red over), with days-until-reset - README.md: Plans section with honest framing (API-equivalent vs subscription price, not token allowance) - tests/plan-usage.test.ts, tests/plans.test.ts, tests/cli-plan.test.ts: period math, presets, CLI round-trip Resets respect resetDay across month boundaries. Uses median daily spend (not mean) so one huge day doesn't distort the month-end projection. Fixes #11	2026-04-21 04:20:50 -07:00
iamtoruk	c2ab80d6e2	Merge main into feat/omp-support-model-aliases Brings the PR branch up to the current main so the OMP provider and the model-alias command can land cleanly. Resolves six merge conflicts and applies a handful of small fixups alongside the resolution so the feature matches the conventions set by the cursor-agent merge earlier today. Conflict resolutions: README.md Combine cursor-agent and OMP rows in provider list, Requirements, and data-location table; take main's Node 22+ and node:sqlite text. src/cli.ts Keep both new commands: model-alias and plan. src/config.ts Add modelAliases alongside plan on the config type. src/providers/index.ts Keep the cursor-agent lazy-loader from main and add omp to coreProviders. Fold the two pi-module imports into one statement. src/providers/pi.ts Keep the discovery-cache snapshot path from main and the providerName parameterization from the PR. Propagate providerName through saveDiscoveryCache, loadDiscoveryCache, the parserVersion tag, and the dedup key prefix so OMP sources no longer stamp 'pi:' inside their cache entries or dedup keys. tests/models.test.ts Keep main's pricing-and-short-name tests and add the PR's alias tests alongside, sharing a single loadPricing setup and an afterEach alias reset. Fixups in the same commit: src/models.ts Replace ?? chain in resolveAlias with Object.hasOwn checks. The previous form returned Object.prototype for a model named '__proto__' and broke downstream canonical.startsWith calls. Caught by the existing prototype-pollution test suite. src/providers/pi.ts Use source.provider in the dedup key prefix and add a trailing newline to the file. tests/providers/omp.test.ts Expect 'omp:' in the dedup key for OMP sources, matching the fix above. Feature work by @cgrossde.	2026-04-21 03:16:28 -07:00
iamtoruk	bf17d20476	fix: remove noCache override for menubar-json format Source cache fixes (empty-session guards, date range reordering) make the cache safe for menubar use. Forcing noCache on every 15s poll was re-parsing 5800+ files each time, causing the menubar to hang.	2026-04-20 18:25:43 -07:00
iamtoruk	b7ad5c5505	fix: source cache empty-session poisoning, TUI refresh, menubar stale data Source cache entries with zero sessions now treated as cache misses instead of serving stale empty data. Date range skip moved after fingerprint check so changed files are never incorrectly excluded. TUI refresh timer bypasses in-memory CachedWindow cache. Menubar-json forces noCache. Swift menubar adds explicit refreshStatusButton calls to avoid observation race.	2026-04-20 18:07:37 -07:00
Sharada Mohanty	2a9daec0ea	feat: add cache rebuild flag and progress	2026-04-21 00:03:49 +02:00
Trevin Chow	c0d24cc191	fix(plan): resolve type errors in plan summary and isActivePlan guard Two pre-existing type errors surfaced during the rebase against main: 1. JsonPlanSummary.id was hardcoded to four plan ids, but PlanId now includes 'none' (PLAN_IDS was extended when 'codeburn plan clear' was added). toJsonPlanSummary only runs for active plans at runtime, but the static type still had to be widened. Use PlanId directly instead of the hand-rolled union. 2. isActivePlan used Boolean(plan) as the nullish guard, which doesn't narrow plan's type in TypeScript. Switch to an explicit 'plan !== undefined' so the subsequent .id and .monthlyUsd accesses type-check. npx tsc --noEmit is now clean; all 285 tests still pass.	2026-04-20 14:55:07 -07:00
Trevin Chow	553cf2d706	feat(plan): subscription plan tracking with usage progress bar Adds `codeburn plan set <id>` to configure a subscription plan (Claude Pro, Claude Max, Cursor Pro, or custom). When set, the Overview panel renders an API-equivalent progress bar against subscription price with a projected month-end cost. Closes the loudest demand signal on the repo: issue #11 ("Subscription vs API Use") from two independent voices, plus the routing-decision use case raised in #12. - src/config.ts: extends CodeburnConfig with Plan, adds readPlan/savePlan/clearPlan - src/plans.ts: presets (claude-pro $20, claude-max $200, cursor-pro $20) - src/plan-usage.ts: getPlanUsage, resetDay-aware period math (1-28), median-of-7-day-trailing projection - src/cli.ts: `codeburn plan [show\|set\|reset]` subcommand, plan wired into JSON outputs for report/today/month/status (only when active) - src/dashboard.tsx: Plan row in Overview, color-coded (green under 80%, orange near, red over), with days-until-reset - README.md: Plans section with honest framing (API-equivalent vs subscription price, not token allowance) - tests/plan-usage.test.ts, tests/plans.test.ts, tests/cli-plan.test.ts: period math, presets, CLI round-trip Resets respect resetDay across month boundaries. Uses median daily spend (not mean) so one huge day doesn't distort the month-end projection. Fixes #11 Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-20 14:55:07 -07:00
iamtoruk	fc576f44ba	fix: real-time refresh for menubar and TUI dashboard Menubar: reduce cache TTL from 300s to 30s, background refresh from 60s to 15s, always fetch fresh data on tab switch instead of serving stale cache. TUI: default auto-refresh to 30s (--refresh 0 to disable). Closes #107	2026-04-19 08:55:48 -07:00
iamtoruk	e89706b549	feat(compare): add codeburn compare command	2026-04-19 05:37:34 -07:00
iamtoruk	e3395d241f	Fix daily cache gap fill using UTC instead of local time The gapStart date was constructed with T00:00:00.000Z (UTC midnight), causing it to land hours before local midnight. In PDT this meant the gap fill re-parsed a partial slice of the previous day, and the upsert replaced the full day with that partial data, losing cost. Bump DAILY_CACHE_VERSION to 3 to force cache rebuild.	2026-04-19 04:23:17 -07:00
iamtoruk	72ccf34a5a	fix: use local timezone for daily date bucketing instead of UTC Timestamps in session files are UTC ISO strings. Several code paths extracted the date via .slice(0, 10) which gives the UTC date, while date range filtering uses local-time boundaries. This caused turns between UTC midnight and local midnight to be bucketed under the wrong day -- the menubar showed lower today cost than the TUI because those turns were attributed to tomorrow (UTC) but filtered as today (local). format.ts already had a localDateString fix; this applies the same pattern everywhere via dateKey() in day-aggregator.ts.	2026-04-19 03:18:38 -07:00
iamtoruk	888030fce3	fix: recompute yesterday in daily cache to prevent stale menubar data The daily cache never re-processed yesterday once cached, so a mid-day run would freeze partial cost/call data permanently. The "All" provider path in menubar-json relied on this cache, causing the menubar to show wildly incorrect numbers while per-provider views (which parse fresh) were correct. Now yesterday is evicted and recomputed on every run, and addNewDays upserts instead of skipping duplicates as defense-in-depth.	2026-04-19 03:07:54 -07:00
Chris	0c5c6fa655	Merge branch 'main' into feat/omp-support-model-aliases	2026-04-19 11:36:49 +02:00
Ninym	c634b10560	feat(report): add --from/--to date range filtering and avgCostPerSession (#80 ) * test(cli): failing tests for parseDateRangeFlags helper * feat(cli): add parseDateRangeFlags helper with local-time dates * feat(report): add --from/--to date range filtering * feat(report): add avgCostPerSession to JSON report and CSV/JSON export	2026-04-18 15:11:33 -07:00
Resham Joshi	495a254338	feat(mac): native Swift menubar app + one-command install Introduces mac/ with a native SwiftUI menubar app that replaces the previous SwiftBar plugin entirely. Install via `npx codeburn menubar`, which downloads the .app from GitHub Releases, strips Gatekeeper quarantine, and drops it into ~/Applications. Highlights - mac/ SwiftUI app: agent tabs, Today/7/30/Month/All period switcher, Trend/Forecast/Pulse/Stats/Plan insights, activity + model breakdowns, optimize findings, CSV/JSON export, Star-on-GitHub banner, live 60s refresh, instant currency switching with offline FX cache. - Security: CodeburnCLI argv-based spawn (no shell interpretation), SafeFile symlink guards + O_NOFOLLOW writes, FX rate clamping to [0.0001, 1_000_000], keychain filtered to account == "default", removed byte-window credential log, in-flight refresh guard, POSIX flock on config.json writes, TerminalLauncher validates argv before AppleScript interpolation. - Performance: shared static NumberFormatter (thousands of allocations per popover redraw eliminated), concurrent pipe drain with 20 MB cap + 60s timeout in DataClient, Observation-tracked reactive UI, 5-min payload cache keyed on (period, provider). - CLI: new `codeburn menubar` subcommand that downloads + installs + launches the .app (no clone, no build). New `status --format menubar-json` payload builder. `export` rewritten to produce a folder of one-table-per-file CSVs with a `.codeburn-export` marker so arbitrary -o paths cannot be silently deleted. - Removed: src/menubar.ts (SwiftBar plugin generator), install-menubar / uninstall-menubar subcommands, `status --format menubar` directive output, tests/menubar.test.ts, tests/security/menubar-injection.test.ts. - Release: .github/workflows/release-menubar.yml builds universal binary, assembles .app, ad-hoc signs, zips, uploads on mac-v* tag push. Runs on the free macos-latest runner. Tests - 230 TypeScript tests pass - 10 Swift CapacityEstimator tests pass - TypeScript typecheck clean - Swift release build clean	2026-04-17 16:55:56 -07:00
Ninym	8f5927153e	feat(cli): add --verbose flag for stderr warnings Sets CODEBURN_VERBOSE=1 via commander preAction, which the fs-utils helpers check before emitting stderr lines on skipped or failed reads. Closes LOW-1 from the 2026-04-16 audit.	2026-04-17 08:32:20 +02:00
AgentSeal	1cd96ea19f	Merge origin/main into feat/optimize Resolve conflicts in src/dashboard.tsx: keep optimize view plumbing (setOptimizeResult, OptimizeResult state, o/b keys) while integrating project/exclude filters on reloadData and renderDashboard entry.	2026-04-16 16:08:20 -07:00
AgentSeal	d15532af0b	fix: apply --project/--exclude to menubar per-provider today totals The menubar status output computes per-provider today costs by iterating all providers after the main period blocks. That loop bypassed the project filter, so --project/--exclude affected the main totals but not the provider breakdown shown below them.	2026-04-16 15:50:00 -07:00
Travis Haley	67c504a60a	feat: add --project and --exclude filters for project-level filtering Adds two new repeatable flags to all commands (report, today, month, status, export): - --project <name>: include only projects matching name (substring, case-insensitive) - --exclude <name>: exclude projects matching name (substring, case-insensitive) Both flags can be specified multiple times to match multiple projects. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 15:54:37 -06:00
AgentSeal	4154413e59	chore: DRY json report branches and drop unused import - extract Period type alias - hoist repeated json action body into runJsonReport helper - remove unused getCostColumnHeader import	2026-04-16 14:43:03 -07:00
AgentSeal	79e67f0bc9	Add OMP provider support and model alias mapping - Add OMP provider reading from ~/.omp/agent/sessions (same JSONL format as Pi, shared parser) - Parameterize discoverSessionsInDir with provider name so sessions carry correct provider field - Add BUILTIN_ALIASES for proxy model name variants (anthropic--claude-* double-dash format) that don't match LiteLLM keys - Add model-alias CLI command for user-defined name mappings - Wire setModelAliases into preAction after config load - Add modelAliases field to CodeburnConfig - Update README: OMP in provider table, model-alias section	2026-04-16 23:35:46 +02:00
AgentSeal	21444df2bf	merge main into feat/optimize - resolve dashboard.tsx conflicts: keep optimize view + context budget column alongside main's all-time period and TopSessions panel - ProjectBreakdown: add avg/s column from main plus overhead column from optimize, widths 30/40 - StatusBar: 1-5 periods including all-time, plus o-optimize when findings exist - DashboardContent: all-time period handling and TopSessions panel preserved Copilot provider and its 253 tests from main merged cleanly as additions.	2026-04-16 12:56:53 -07:00
Travis Haley	7a061ea679	feat: add periodKey and topSessions to JSON output	2026-04-16 13:13:09 -06:00
Travis Haley	60712785d7	style: remove section-label comments per repo convention	2026-04-16 13:10:05 -06:00
Travis Haley	fad21f3097	fix: destructure cost before spread so convertCost isn't overwritten in models JSON output	2026-04-16 13:08:46 -06:00
Travis Haley	09139b1d92	feat: add --format json to report, today, and month commands Outputs full dashboard data as structured JSON to stdout, including: overview, daily breakdown, projects, models with token counts, activities with one-shot rates, core tools, MCP servers, and shell commands. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 12:21:51 -06:00
Ninym	cfd046a95c	feat: add All Time period, avg/s per project, and Top Sessions panel - Add 'all' period (key 5) to dashboard showing data from all time - Daily Activity panel shows all available days when All Time is active - Add avg cost per session column to Project breakdown - Add Top Sessions panel highlighting the 5 most expensive sessions	2026-04-16 13:14:34 +02:00
AgentSeal	710316053e	feat: add optimize command, in-TUI optimize view, and per-project context budget Optimize engine detects 8 waste patterns from Claude Code session data: - Junk directory reads (node_modules, .git, dist, etc.) - Duplicate file reads per session - Unused MCP servers (configured but never called) - Missing .claudeignore in projects with junk dirs - Bloated CLAUDE.md files (>200 lines) - Uncapped BASH_MAX_OUTPUT_LENGTH - Low Read:Edit ratio (edit-without-reading, per #42796) - High cache_creation overhead (per #46917) Each finding includes impact rating, token/cost savings estimate, and exact fix (paste, command, or file content). Dashboard integration: - o key switches to in-TUI optimize view, b key goes back - Background scan on load, o button only when findings exist - Per-project Context Budget column in By Project panel showing estimated per-call overhead (system + MCP tools + skills + CLAUDE.md) CLI: codeburn optimize [-p period] [--provider]	2026-04-16 04:04:37 -07:00
Christopher Sant	5cdcd2c2ee	feat: add rolling 30-day window to menubar status Add a '30 Days' section to the menubar format output, positioned between '7 Days' and 'Month' to match the tab order in the interactive report. The rolling 30-day date range logic already existed (used by report and export commands) - this wires it into the status menubar renderer. Both 30 Days (rolling window) and Month (calendar month) are shown, giving useful context early in the month when the calendar month total is nearly empty.	2026-04-15 21:48:20 -07:00
AgentSeal	91edd6f14a	chore: remove 120-day period, shrink Cursor lookback to 35 days 120 days was for testing. Max dashboard period is 30 days, so Cursor SQL lookback is now 35 days (30 + margin for This Month). Smaller scan window = faster cold starts, smaller cache file.	2026-04-15 05:24:34 -07:00
AgentSeal	5da6e772e4	feat: change 90-day period to 120 days	2026-04-15 04:05:12 -07:00
AgentSeal	b7b7b2c7d6	perf: lazy-load cursor provider to eliminate startup overhead Cursor module (sqlite.ts, better-sqlite3) now only loads when cursor provider is actually requested. Claude/Codex startup is unaffected -- cursor import never happens unless needed.	2026-04-15 03:59:49 -07:00

1 2

65 commits