585 B
585 B
Architecture Board
Goal
Build a local-first Openclaw agent that becomes more intelligent over time through:
- typed memory
- typed tool graph
- trajectory logging
- reward signals
- shadow meta-controller
- offline policy learning
- sacred eval gates
Hosts
- Mac Studio: hot-path inference
- openclaw: orchestration and live logging
- Unraid: offline learning, retrieval, replay, eval batch jobs
- Kimi: offline teacher only
Phase A
- Typed trajectory schema
- Reward signals
- Replay buffer + policy stats
- Tool graph
- Uncertainty model
- Shadow meta-controller