Skip to content

hwLedgerLLM Capacity Planner + Fleet Ledger

Desktop inference runtime with enterprise-grade fleet management

hwLedger

Why hwLedger ​

Every public VRAM calculator (HF Accelerate, can-it-run-llm, LM Studio) gets MoE and MLA wrong. They undercount KV cache and overcount MoE throughput. hwLedger's math core is architecture-keyed: it dispatches per AttentionKind and treats resident-vs-active parameters separately for MoE.

The result: hobbyist-sized fleet with enterprise bones.

Quick Start ​

1. Clone the repository ​

bash
git clone https://github.com/KooshaPari/hwLedger.git
cd hwLedger

2. Build from source ​

bash
cargo build --release

3. Run the planner ​

bash
cargo run --bin hwledger-cli -- plan --model llama-2-70b
Live memory planning with colored VRAM breakdown
Live memory planning with colored VRAM breakdown
Keyframes (0 β€” VLM-friendly)

Loading manifest…

Project Status ​

PhaseStatus
P0 Foundationin progress
P1 Math coreplanned
P2 Ingestion + probeplanned
P3 macOS GUI MVPplanned
P4 Inferenceplanned (macOS only in MVP)
P5 Fleetplanned
P6 Windows GUIdeferred
P7 Linux GUIdeferred

Tracked in AgilePlus: feature hwledger-v1-macos-mvp (run agileplus status).

Documentation Structure ​

  • Architecture β€” System design, component map, and architecture decisions
  • Math Core β€” KV cache formulas with per-architecture derivations
  • Fleet Ledger β€” Fleet wire, cost model, and audit log design
  • Getting Started β€” Installation and first steps
  • Research β€” Archived research briefs on MLX, GPU telemetry, FFI patterns, and more

Tech Stack ​

  • Core: Rust workspace (hwledger-core, -arch, -ingest, -probe, -inference, -ledger, -fleet-proto, -agent, -server, -cli, -ffi)
  • Sidecar: oMlx fork (Apache-2.0, Python/PyObjC) with SSD-paged KV cache
  • Native apps: SwiftUI (macOS) + UniFFI, WinUI 3 + .NET 9 (Windows), Qt 6 + cxx-qt (Linux)
  • Fleet wire: Axum + rustls mTLS, russh agentless, reqwest cloud provider integrations

License ​

Apache-2.0. See LICENSE.


A hobbyist-sized fleet with enterprise bones.

Released under the Apache 2.0 License.