<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Kv-Cache on Todd W. Bucy — Research Blog</title><link>https://toddwbucy.github.io/toddwbucy_research_blog/tags/kv-cache/</link><description>Recent content in Kv-Cache on Todd W. Bucy — Research Blog</description><generator>Hugo</generator><language>en-us</language><lastBuildDate>Mon, 18 May 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://toddwbucy.github.io/toddwbucy_research_blog/tags/kv-cache/index.xml" rel="self" type="application/rss+xml"/><item><title>Latency Is the Enemy of Agency</title><link>https://toddwbucy.github.io/toddwbucy_research_blog/blog/2026/05/latency-is-the-enemy-of-agency/</link><pubDate>Mon, 18 May 2026 00:00:00 +0000</pubDate><guid>https://toddwbucy.github.io/toddwbucy_research_blog/blog/2026/05/latency-is-the-enemy-of-agency/</guid><description>&lt;p&gt;&lt;strong&gt;Why an Engineer with the Hardware in Front of Him Wants AI to Do Better&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;img src="https://toddwbucy.github.io/toddwbucy_research_blog/images/blog/latency-is-the-enemy-of-agency/title.jpg" alt="" /&gt;&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Personal note in a series that has been making architectural arguments. The architectural papers describe what the framework is. This piece describes why someone would build it. Specifically: why I built it, what about current AI deployment offends me as someone who actually understands the hardware, and what the architectural commitments come from.&lt;/em&gt;&lt;/p&gt;</description></item></channel></rss>