Draft docs/benchmarks/report-v1.md head-to-head benchmarks skeleton #119

Open
opened 2026-05-27 09:07:13 -03:00 by navigator · 3 comments
Owner

Goal

Create the head-to-head benchmark report skeleton at docs/benchmarks/report-v1.md, listed as deliverable #5 in PLAN.md Section 10. The doc structures Phase 1 simulation-based performance comparisons (baseline Gemmini vs single-chip FluidPopSoC vs 8-chip board) so benchmark work has a canonical landing spot.

Sections to include:

  • Overview (scope: simulation-based Phase 1 numbers only; no silicon results)
  • Methodology (Verilator sim clock ~10 kHz per Section 8.3; cycle accounting; what counts as 'one inference')
  • Baselines (upstream GemminiRocketConfig from Phase 2 — reference baseline-gemmini-rocket.md)
  • Single-chip FluidPopSoC results (placeholder table; expected 2.5-3.5x speedup per Section 8.3)
  • 8-chip board results (placeholder table; expected 4-6x speedup over single chip per Section 8.3)
  • Workloads (ResNet-50 single-chip; Llama-7B 4-bit 8-chip partitioned per Section 8.3)
  • Caveats (sim-only; honest reporting per Section 9.3; analytic model vs measured per Section 14.6)
  • Open questions (which secondary metrics — power, BW utilization, scratchpad hit rate)

Acceptance criteria

  • docs/benchmarks/report-v1.md exists with the sections above
  • Status: Draft skeleton + Owner: TBD header
  • Each section has TODO markers and 2-3 line intent description
  • Placeholder tables present with header rows; all data cells TBD
  • Explicitly states sim-only scope and references Section 9.3 honest-reporting language
  • References Sections 7 (baseline), 8.3 (acceptance targets), 9.3 (honest reporting), 14.6 (analytic models)
  • docs/benchmarks/README.md created or updated to list this deliverable

Plan refs

Section 10 (deliverable #5), Section 7 (baseline benchmark), Section 8.3 (testbench plan + acceptance criteria), Section 9.3 (honest reporting), Section 14.6 (analytic performance models)

Notes

Skeleton only — actual numbers come from Phases 2-3 simulation runs. Do not fabricate any benchmark numbers; tables stay empty until measured.

## Goal Create the head-to-head benchmark report skeleton at `docs/benchmarks/report-v1.md`, listed as deliverable #5 in PLAN.md Section 10. The doc structures Phase 1 simulation-based performance comparisons (baseline Gemmini vs single-chip FluidPopSoC vs 8-chip board) so benchmark work has a canonical landing spot. Sections to include: - Overview (scope: simulation-based Phase 1 numbers only; no silicon results) - Methodology (Verilator sim clock ~10 kHz per Section 8.3; cycle accounting; what counts as 'one inference') - Baselines (upstream `GemminiRocketConfig` from Phase 2 — reference `baseline-gemmini-rocket.md`) - Single-chip FluidPopSoC results (placeholder table; expected 2.5-3.5x speedup per Section 8.3) - 8-chip board results (placeholder table; expected 4-6x speedup over single chip per Section 8.3) - Workloads (ResNet-50 single-chip; Llama-7B 4-bit 8-chip partitioned per Section 8.3) - Caveats (sim-only; honest reporting per Section 9.3; analytic model vs measured per Section 14.6) - Open questions (which secondary metrics — power, BW utilization, scratchpad hit rate) ## Acceptance criteria - [ ] `docs/benchmarks/report-v1.md` exists with the sections above - [ ] `Status: Draft skeleton` + `Owner: TBD` header - [ ] Each section has `TODO` markers and 2-3 line intent description - [ ] Placeholder tables present with header rows; all data cells TBD - [ ] Explicitly states sim-only scope and references Section 9.3 honest-reporting language - [ ] References Sections 7 (baseline), 8.3 (acceptance targets), 9.3 (honest reporting), 14.6 (analytic models) - [ ] `docs/benchmarks/README.md` created or updated to list this deliverable ## Plan refs Section 10 (deliverable #5), Section 7 (baseline benchmark), Section 8.3 (testbench plan + acceptance criteria), Section 9.3 (honest reporting), Section 14.6 (analytic performance models) ## Notes Skeleton only — actual numbers come from Phases 2-3 simulation runs. Do not fabricate any benchmark numbers; tables stay empty until measured.
Author
Owner
No description provided.
<!-- agent:claim by=dispatcher run=20260527T121054Z_issue119 ts=1779883855 -->
Author
Owner

Every acceptance criterion in this issue is already satisfied by docs/benchmarks/report-v1.md (landed in PR #90, commit b7962f3) and docs/benchmarks/README.md (landed alongside the baseline skeleton in PR #101, commit a0047d3).

Verification against issue #119 acceptance criteria:

  • docs/benchmarks/report-v1.md exists with all 8 sections (Overview, Methodology, Baselines, Single-chip FluidPopSoC, 8-chip board, Workloads, Caveats, Open questions).
  • Status: Draft skeleton (line 6) + Owner: TBD (line 7) header.
  • Each section opens with a TODO: marker and 2–3 lines of intent.
  • Placeholder tables present with header rows; every data cell is TBD (sections 3, 4, 5).
  • Sim-only scope stated explicitly (lines 22–25) and PLAN.md Section 9.3 honest-reporting language cited (lines 19–20, 25, 112–113).
  • References to PLAN.md Sections 7, 8.3, 9.3, 14.6 all present in the header block (lines 8–13) and reinforced in body sections.
  • docs/benchmarks/README.md lists report-v1.md as deliverable #5 with Plan ref + Status.

No new code or doc change is required. Recommend closing this issue as a duplicate of work already completed in #90 / #101.

<!-- agent:fail reason=blocker:already-resolved --> Every acceptance criterion in this issue is already satisfied by `docs/benchmarks/report-v1.md` (landed in PR #90, commit `b7962f3`) and `docs/benchmarks/README.md` (landed alongside the baseline skeleton in PR #101, commit `a0047d3`). Verification against issue #119 acceptance criteria: - [x] `docs/benchmarks/report-v1.md` exists with all 8 sections (Overview, Methodology, Baselines, Single-chip FluidPopSoC, 8-chip board, Workloads, Caveats, Open questions). - [x] `Status: Draft skeleton` (line 6) + `Owner: TBD` (line 7) header. - [x] Each section opens with a `TODO:` marker and 2–3 lines of intent. - [x] Placeholder tables present with header rows; every data cell is `TBD` (sections 3, 4, 5). - [x] Sim-only scope stated explicitly (lines 22–25) and PLAN.md Section 9.3 honest-reporting language cited (lines 19–20, 25, 112–113). - [x] References to PLAN.md Sections 7, 8.3, 9.3, 14.6 all present in the header block (lines 8–13) and reinforced in body sections. - [x] `docs/benchmarks/README.md` lists `report-v1.md` as deliverable #5 with Plan ref + Status. No new code or doc change is required. Recommend closing this issue as a duplicate of work already completed in #90 / #101.
Author
Owner
No description provided.
<!-- agent:fail reason=no-pr-marker run=20260527T121054Z_issue119 -->
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
Fluid/fluidpop-v1#119
No description provided.