Update CONTRIBUTORS.md and the README contributors section to reflect
130 contributors and 568 PR credits as of v0.51.44 (was 66/142 at
v0.50.245). The numbers grew because:
- The previous refresh was 1 release-cycle ago (50+ tags + 8 batch
releases of contributor PRs ago).
- The new counting rule explicitly includes closed-but-absorbed PRs:
PRs whose original branch shows "closed" on GitHub but whose content
shipped via batch-release squash with a Co-authored-by trailer, or
via salvage rewrite with CHANGELOG attribution. This better reflects
what users actually contributed.
The compilation pipeline:
1. Pull every closed PR from gh api (state=closed, both merged and
unmerged on GitHub) — 1421 PRs.
2. Walk CHANGELOG.md release-by-release and extract:
- `PR #N by @user` (canonical bullet form)
- `(#N by @user`, `(PR #N by @user`, `(#N, @user;`
- `PRs #A, #B by @user` (plural)
- `@user — PR #N`, `@user — N PR (#A, #B)`
- `(credit: @user)` and `(credit: @userA and @userB)`
3. For every PR# mentioned in CHANGELOG, union the explicit @-attributed
users with the gh PR author (when external). Maintainer accounts
(@nesquena, @nesquena-hermes) are excluded.
4. For PRs merged on GitHub but not mentioned in CHANGELOG (very early
PRs, non-noteworthy direct merges), credit the gh author.
5. Three salvaged-design contributors not directly in CHANGELOG are
credited in the special-thanks roll: @indigokarasu (#213 →
v0.50.0 design language), @andrewy-wizard (#177 → initial Chinese
locale absorbed into v0.42.0), @zenc-cp (#133 → anti-hallucination
guard absorbed into streaming.py).
Pre-cleaning step strips HTML entities (` ` etc.) before PR# scan
to avoid false matches. PR# regex requires a whitespace/paren/bracket
preceder so identifiers like `--key=123` and `(##10`-style headings
don't pollute the count.
Per-user first/last release computed from:
- For merged-on-GH PRs: the smallest tag whose creator-date is >= the
PR's merged_at timestamp.
- For absorbed PRs: the release section in CHANGELOG that explicitly
attributes to the user (or the earliest release that mentions the
PR# if no explicit attribution exists for that user).
CONTRIBUTORS.md sections:
- Top contributors (5+ PRs) — 20 people, ranked
- Sustained contributors (3–4 PRs) — 11 people
- Two-PR contributors — 14 people, flat list
- Single-PR contributors — 85 people, flat list
- How credit is tracked — four paths described
- Special thanks — 11 highlight blurbs
README contributors section trimmed to top-10 table + notable-
contribution blurbs (29 distinct contributors mentioned with concrete
PR numbers). Same data, condensed for the README.
No code changes. Docs only.
- New CONTRIBUTORS.md: full ranked credit roll for all 66 contributors
(5+ tiers), with first/latest release versions, single-PR roll, and
attribution methodology. Generated from git log + gh pulls API +
CHANGELOG mention parsing.
- README.md: stack-ranked top-10 contributors table at the top of the
Contributors section, link to CONTRIBUTORS.md for the full list.
Updated test count (1898 → 3309). Refreshed @franksong2702 and
@bergeouss entries to reflect their broader bodies of work (now
the #1 and #2 external contributors).
- ARCHITECTURE.md: removed stale 'tracks upstream v0.50.36' header;
bumped current shipped build to v0.50.245 with current architecture
state notes (streaming-markdown vendoring, byte-range streaming,
configurable-model-badges).
- ROADMAP.md / SPRINTS.md / TESTING.md: header/last-updated bumps to
v0.50.245 and 3309 tests. SPRINTS.md 'Where we are now' section
refreshed for current CLI/Claude parity (~95% Claude parity now).
Generated by aggregating CHANGELOG attribution lines, gh PR API
authors, and CHANGELOG version-section walks. Internal/bot accounts
filtered out.