<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Technical Reports on Todd W. Bucy — Research Blog</title><link>https://toddwbucy.github.io/toddwbucy_research_blog/projects/weavertools/reports/</link><description>Recent content in Technical Reports on Todd W. Bucy — Research Blog</description><generator>Hugo</generator><language>en-us</language><lastBuildDate>Wed, 27 May 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://toddwbucy.github.io/toddwbucy_research_blog/projects/weavertools/reports/index.xml" rel="self" type="application/rss+xml"/><item><title>WeaverTools — Current-State Snapshot (2026-05-14)</title><link>https://toddwbucy.github.io/toddwbucy_research_blog/projects/weavertools/reports/current-state-2026-05-14/</link><pubDate>Thu, 14 May 2026 00:00:00 +0000</pubDate><guid>https://toddwbucy.github.io/toddwbucy_research_blog/projects/weavertools/reports/current-state-2026-05-14/</guid><description>&lt;p&gt;&lt;em&gt;Methodology:&lt;/em&gt; Static analysis only — no tests run, no services hit. Every claim cites a file path. The intent is to ground the accompanying &lt;code&gt;weavertools-PRD.md&lt;/code&gt; (delivered in the same PR as this snapshot) in code reality, not in PRD-claim reality.&lt;/p&gt;
&lt;h2 id="0-tldr"&gt;0. TL;DR&lt;a class="anchor" href="#0-tldr"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;WeaverTools is a Rust-native harness that runs local language models as security-isolated agents (one OS user per agent) backed by per-agent ArangoDB memory graphs and a shared GPU-resident inference server. At HEAD the system can provision OS users + databases, spawn agents driven by &lt;code&gt;agents/*.yaml&lt;/code&gt;, and dispatch HeroBench benchmark tasks via &lt;code&gt;weaver agent start &amp;lt;task.yaml&amp;gt;&lt;/code&gt;. The nap (context-pressure memory offload) path fires on KV budget exhaustion and writes summaries to &lt;code&gt;belief_nodes&lt;/code&gt; with embed-on-write stamping (Pen + engine-nap both wired). The in-process Rust candle encoder (&lt;code&gt;EmbedderClient&lt;/code&gt;) is the sole embedder backend; the Python gRPC service is retired. OpenInference JSONL traces are emitted per run with &lt;code&gt;llm.sampling&lt;/code&gt;, &lt;code&gt;llm.weights_blake3&lt;/code&gt;, &lt;code&gt;encoder.weights_blake3&lt;/code&gt;, and &lt;code&gt;agent.prompt_stack&lt;/code&gt; attributes. Stubs: &lt;code&gt;kind: chat&lt;/code&gt; returns an error, &lt;code&gt;kind: chatroom&lt;/code&gt; and &lt;code&gt;kind: training-loop&lt;/code&gt; both hit &lt;code&gt;UnimplementedHandler&lt;/code&gt;. The HNSW vector index on &lt;code&gt;belief_nodes.embedding&lt;/code&gt; is defined in code but creation at collection-bootstrap time has not been verified by a passing integration test; the preseed materializer (batch embed-on-write at agent-load) is wired and tested. The decoder-side candle stack in &lt;code&gt;weaver-spu&lt;/code&gt; is present but the live inference path in production still uses the llama.cpp GGUF backend; the pure-candle decoder is Phase 2 work.&lt;/p&gt;</description></item><item><title>WeaverTools — Current-State Snapshot (2026-05-24)</title><link>https://toddwbucy.github.io/toddwbucy_research_blog/projects/weavertools/reports/current-state-2026-05-24/</link><pubDate>Sun, 24 May 2026 00:00:00 +0000</pubDate><guid>https://toddwbucy.github.io/toddwbucy_research_blog/projects/weavertools/reports/current-state-2026-05-24/</guid><description>&lt;p&gt;Predecessor snapshot: &lt;a href="https://toddwbucy.github.io/toddwbucy_research_blog/projects/weavertools/reports/current-state-2026-05-14/"&gt;&lt;code&gt;current-state-2026-05-14&lt;/code&gt;&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Methodology:&lt;/em&gt; Static analysis + targeted &lt;code&gt;grep&lt;/code&gt;/file reads; two code-grounded subagent passes (apparatus + attribution instrument). &lt;strong&gt;No tests run, no services hit, no sweep executed.&lt;/strong&gt; Every claim cites a file path or PR. This snapshot exists to refresh the &lt;code&gt;weavertools-PRD.md&lt;/code&gt; anchor, whose &amp;ldquo;verified commitments&amp;rdquo; table is now significantly out of date, and to answer one question: &lt;em&gt;how far are we from running proper experiments?&lt;/em&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope caveat — the memory/GNN cluster is a moving target.&lt;/strong&gt; A parallel session is actively pushing on the memory system; the inductive GNN build-out (GraphSAGE-class &lt;code&gt;graphsage-v1&lt;/code&gt;, replacing the &lt;code&gt;mean-pool-v0&lt;/code&gt; scaffold) begins ~2026-05-25. Status claims about surfacing / GNN / working-memory / engagement-edge code are bracketed as &lt;strong&gt;in-flux&lt;/strong&gt; throughout and should not be treated as stable.&lt;/p&gt;</description></item><item><title>WeaverTools — Current-State Snapshot (2026-05-27)</title><link>https://toddwbucy.github.io/toddwbucy_research_blog/projects/weavertools/reports/current-state-2026-05-27/</link><pubDate>Wed, 27 May 2026 00:00:00 +0000</pubDate><guid>https://toddwbucy.github.io/toddwbucy_research_blog/projects/weavertools/reports/current-state-2026-05-27/</guid><description>&lt;p&gt;Predecessor: &lt;a href="https://toddwbucy.github.io/toddwbucy_research_blog/projects/weavertools/reports/current-state-2026-05-24/"&gt;&lt;code&gt;current-state-2026-05-24&lt;/code&gt;&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Methodology — different from the prior two snapshots.&lt;/em&gt; 05-14 and 05-24 were &lt;strong&gt;static analysis only&lt;/strong&gt; (&amp;ldquo;no tests run, no services hit&amp;rdquo;). &lt;strong&gt;This one is static analysis + live verification.&lt;/strong&gt; Over this session the daemon was rebuilt/reinstalled and restarted, a 4-fresh-agent cohort was run end-to-end, per-agent ArangoDB databases were queried directly, and the seed-deletion bug was isolated by controlled create→load→inspect. Claims tagged &lt;strong&gt;[live]&lt;/strong&gt; were observed on the running system; &lt;strong&gt;[code]&lt;/strong&gt; claims are file-grounded; &lt;strong&gt;[proposed]&lt;/strong&gt; are PRD/decision items not yet built. This snapshot exists to ground an imminent &lt;strong&gt;direction decision&lt;/strong&gt; — read §9 first if that&amp;rsquo;s why you&amp;rsquo;re here.&lt;/p&gt;</description></item></channel></rss>