<?xml version="1.0" encoding="utf-8"?>
<feed xmlns="http://www.w3.org/2005/Atom">
  <title>Off-by-none Serverless Newsletter</title>
  <subtitle>Stay up to date on using serverless to build modern applications in the cloud. Get insights from experts, product releases, industry happenings, tutorials and much more, every week!</subtitle>
  <link href="https://offbynone.io/feed/" rel="self"/>
  <link href="https://offbynone.io/"/>
  <updated>2026-06-16T12:59:30Z</updated>
  <id>https://offbynone.io/</id>
  <author>
    <name>Jeremy Daly</name>
    <email>contact@jeremydaly.com</email>
  </author>
  <entry>
    <title>Issue #368: Claude Fable 5 is currently unavailable 🚫</title>
    <link href="https://offbynone.io/issues/368/"/>
    <updated>2026-06-16T12:00:00Z</updated>
    <summary>In this issue, the US government orders Anthropic to pull Fable 5 and Mythos 5, AWS WAF starts charging AI bots for content, and Bedrock adds Grok, Gemma, and a pair of GPTs.</summary>
    <id>https://offbynone.io/issues/368/</id>
    <content type="html">&lt;h2&gt;Claude Fable 5 is currently unavailable 🚫&lt;/h2&gt;
&lt;p&gt;In our &lt;a href=&quot;https://offbynone.io/issues/367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;previous issue&lt;/a&gt;, Anthropic shipped two major models, DynamoDB got &amp;quot;extended&amp;quot; to run locally on Postgres, and Aurora DSQL added JSONB support. In this issue, the US government orders Anthropic to pull Fable 5 and Mythos 5, AWS WAF starts charging AI bots for content, and Bedrock adds Grok, Gemma, and a pair of GPTs. Plus, we&#39;ve got lots of great content from the cloud, serverless, and AI communities.&lt;/p&gt;
&lt;h3&gt;News &amp;amp; Announcements&lt;/h3&gt;
&lt;p&gt;The biggest story from last week is a takedown, not a product launch. &lt;a href=&quot;https://www.anthropic.com/news/fable-mythos-access?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Anthropic published a statement responding to a US government directive to suspend access to Fable 5 and Mythos 5&lt;/a&gt; over &amp;quot;national security concerns.&amp;quot; Anthropic walks through its defense-in-depth approach and argues the jailbreak vulnerabilities that triggered the order are about the same as the ones in models that are still happily serving traffic to North Korea (I may have made that last part up). Two weeks ago Fable 5 was the first generally available Mythos-class model on AWS. Now it&#39;s &lt;em&gt;gone&lt;/em&gt;. Whatever you think of the merits, watching a government switch off a frontier model overnight is a preview of a world many of us haven&#39;t planned for. The residency assumptions baked into your architecture diagrams may be softer than you think.&lt;/p&gt;
&lt;p&gt;Speaking of Bedrock, the model menu keeps growing. &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/06/grok-amazon-bedrock?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Grok 4.3 from xAI is now available on Amazon Bedrock&lt;/a&gt; with configurable reasoning effort levels, running on Mantle, the new inference engine AWS built for price performance. &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/06/gemma-4-amazon-bedrock?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Google DeepMind&#39;s Gemma 4 family landed too&lt;/a&gt;, three open-weight variants with reasoning, multimodal understanding across text, image, video, and audio, native function calling, 35+ languages, and 256K-token context windows (the &lt;a href=&quot;https://aws.amazon.com/blogs/machine-learning/introducing-gemma-4-models-on-amazon-bedrock?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS ML blog has the deeper writeup&lt;/a&gt; covering the bedrock-mantle endpoint and OpenAI-compatible APIs). And &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/06/openai-gpt-us-east-virginia-amazon?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;OpenAI&#39;s GPT-5.4 and GPT-5.5 are now in US East (N. Virginia)&lt;/a&gt;, both with 272K-token context and Responses API streaming, GPT-5.5 aimed at coding and research and GPT-5.4 at production reasoning. Three model families through one endpoint is great until the bill shows up, which is why the cost attribution work AWS has been doing will eventually pay off.&lt;/p&gt;
&lt;p&gt;The item I keep coming back to is the one that turns your bot traffic into a revenue line. &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/06/aws-waf-ai-traffic-monetization?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS WAF announced AI traffic monetization&lt;/a&gt; using the x402 protocol for machine-to-machine payments, letting publishers set differentiated pricing for AI bots and collect stablecoin payouts through Coinbase. The &lt;a href=&quot;https://aws.amazon.com/blogs/aws/aws-waf-adds-ai-traffic-monetization-capability-to-help-content-owners-charge-ai-bots-for-content-access?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS blog has the mechanics&lt;/a&gt;: WAF returns an HTTP 402 with a machine-readable JSON price manifest, works with CloudFront distributions, and settles through Coinbase&#39;s x402 Facilitator. It&#39;s not happening in a vacuum, either. &lt;a href=&quot;https://techcrunch.com/2026/05/28/visa-invests-in-replit-to-power-agentic-payments-for-developers?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Visa invested in Replit to power agentic payments for developers&lt;/a&gt;, including work on Visa&#39;s Trusted Agent Protocol, so the plumbing for agents that pay for things is getting built on multiple fronts.&lt;/p&gt;
&lt;p&gt;Agent platforms keep on maturing as well. &lt;a href=&quot;https://claude.com/blog/whats-new-in-claude-managed-agents?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Claude Managed Agents added scheduled deployments and environment vaults&lt;/a&gt;, with Rakuten and Notion already running recurring spreadsheet analysis and report generation, plus Browserbase and KERNEL integrations for browser work. &lt;a href=&quot;https://openai.com/index/openai-to-acquire-ona?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;OpenAI is acquiring Ona&lt;/a&gt; to give Codex persistent cloud execution, so agents can grind on a task for hours or days inside a customer-controlled environment. OpenAI also struck a deal to &lt;a href=&quot;https://openai.com/index/openai-on-oracle-cloud?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;let OCI customers reach its models and Codex through Oracle Universal Credits&lt;/a&gt;, wiring AI spend into existing enterprise purchasing. And &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/06/opensearch-agentic-observability-mcp-app?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon OpenSearch Service launched MCP Apps for agentic observability&lt;/a&gt;, letting agents dig into logs, traces, metrics, and alerts for root cause analysis from inside Claude Desktop or VS Code.&lt;/p&gt;
&lt;p&gt;On the data and ops side, &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/agentcore-memory-scmetadata?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon Bedrock AgentCore Memory now supports strictly consistent metadata for long-term memory&lt;/a&gt;, so you can attach values from your application that pass through without LLM inference. That gives you department-scoped retrieval, compliance boundaries, and multi-tenant memory where each tenant gets processed on its own, which is the kind of thing that sounds boring until you try to build memory for more than one customer. And &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/06/amazon-cross-account-metrics-centralization?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon CloudWatch added cross-account metrics centralization&lt;/a&gt; through AWS Organizations, replicating metrics from many accounts and regions into one destination account for unified monitoring and governance.&lt;/p&gt;
&lt;p&gt;A few more worth your attention. &lt;a href=&quot;https://aws.amazon.com/blogs/architecture/introducing-the-snowflake-and-aws-custom-lens-for-the-aws-well-architected-framework?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS and Snowflake released a joint Custom Lens for the Well-Architected Framework&lt;/a&gt;, folding both platforms&#39; best practices into one review across seven pillars, so you can stop juggling two separate sets of guidance. &lt;a href=&quot;https://aws.amazon.com/blogs/developer/aws-cli-v1-maintenance-mode-announcing-changes-to-dependency-updates?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS CLI v1 is entering maintenance mode in July 2026&lt;/a&gt;, with botocore and s3transfer vendored directly into the codebase, which means if you&#39;re running CLI v1 and boto3 side by side, they&#39;ll each carry their own copies from here on out. And &lt;a href=&quot;https://kiro.dev/blog/kiro-pro-max?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Kiro shipped a $100/mo Pro Max tier&lt;/a&gt; with more credits and access to all premium models. The jump from $40 to $200 was definitely a bit much for your average user, so dropping a tier right in the middle is a smart read of who actually churns.&lt;/p&gt;
&lt;p&gt;Finally, I shipped a new Prisma 7 adapter in the &lt;a href=&quot;https://www.jeremydaly.com/data-api-client-v2-4-prisma-support?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Data API Client v2.4&lt;/a&gt;, so now you can point Prisma, Knex, Drizzle, or Kysely at the RDS Data API for Provisioned or Serverless Aurora clusters without a connection pool or VPC.&lt;/p&gt;
&lt;h3&gt;Tutorials&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/machine-learning/ai-agent-failure-detection-and-root-cause-analysis-with-strands-evals?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AI Agent Failure Detection and Root Cause Analysis with Strands Evals&lt;/a&gt; by Po-Shin Chen&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/machine-learning/evaluate-ai-agents-systematically-with-agent-evalkit?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Evaluate AI agents systematically with Agent-EvalKit&lt;/a&gt; by Ishan Singh&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/architecture/how-samsung-achieved-real-time-pricing-with-aws-lambda-response-streaming?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;How Samsung achieved real-time pricing with AWS Lambda Response Streaming&lt;/a&gt; by Vijay Naik&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://dev.to/aws-heroes/serverless-applications-on-aws-with-lambda-using-java-25-api-gateway-and-aurora-dsql-lambda-4hbj?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Serverless applications on AWS with Lambda using Java 25, API Gateway and Aurora DSQL - Lambda performance optimization approaches&lt;/a&gt; by Vadym Kazulkin&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://dev.to/qasim157/run-your-email-agent-on-serverless-42d2?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Run Your Email Agent on Serverless&lt;/a&gt; by Qasim Muhammad&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://medium.com/@yogigupta79/the-death-of-tmp-s3-mounting-for-lambda-is-a-game-changer-f394d456be2c?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;The Death of /tmp: S3 Mounting for Lambda is a Game-Changer&lt;/a&gt; by Yogesh Gupta&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://medium.com/@chiragmehta900/cut-your-aws-fargate-bill-by-40-10-waste-patterns-i-fixed-in-production-d657d61469d2?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Cut Your AWS Fargate Bill by 40% — 10 Waste Patterns I Fixed in Production&lt;/a&gt; by Chirag Mehta&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://sodkiewiczm.medium.com/mcp-apps-because-your-users-deserve-more-than-a-wall-of-text-96e2fda9c9d9?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;MCP Apps: Because Your Users Deserve More Than a Wall of Text&lt;/a&gt; by Maciej Sodkiewicz&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;Reads&lt;/h3&gt;
&lt;p&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/machine-learning/how-frontier-teams-are-reinventing-ai-native-development?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;How frontier teams are reinventing AI-native development&lt;/a&gt;&lt;br /&gt;
Swami details three approaches AWS used to test AI-native workflows, including pathfinder initiatives and structured sprints, and lays out five practices for teams restructuring around autonomous agents. If you&#39;re still treating AI as a fancier autocomplete, this is a nudge to think bigger about how the work itself changes.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://dev.to/jverhoeks/the-review-bottleneck-rethinking-software-and-infrastructure-design-for-the-agent-era-752?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;The Review Bottleneck: Rethinking Software and Infrastructure Design for the Agent Era&lt;/a&gt;&lt;br /&gt;
A look at how coding agents moved the delivery bottleneck from writing code to reviewing and coordinating it. The proposed fixes, bounded contexts, contract-driven development, and pushing review upstream to intent instead of output, line up with what a lot of teams are feeling right now but haven&#39;t named yet.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://charity.wtf/2026/06/15/ai-demands-more-engineering-discipline-not-less-xpost?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AI demands more engineering discipline. Not less.&lt;/a&gt;&lt;br /&gt;
Charity Majors makes the case for what she calls Phoenix Architectures, where code becomes a materialized view you can regenerate once it goes stale. She draws the line from immutable infrastructure to treating AI-generated code as disposable, with validation moving to production. Classic Charity, and definitely worth your time.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://claude.com/blog/building-with-claude-managed-agents?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;The evolution of agentic surfaces: building with Claude Managed Agents&lt;/a&gt;&lt;br /&gt;
Anthropic introduces Claude Managed Agents as a set of composable APIs for production agents, handling orchestration, session management, credential isolation, and observability so teams can spend their time on context management instead of babysitting execution harnesses. Pairs well with the scheduling and vaults news above.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://dev.to/amitkayal/takeway-from-aws-generative-ai-lens-14dj?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Takeaways from AWS Generative AI Lens&lt;/a&gt;&lt;br /&gt;
Amit Kayal breaks down the AWS Generative AI Lens with a focus on controlled AI-assisted workflows versus fully autonomous agents, walking through when AI should classify, when it should recommend, and when it should actually execute. The data governance and multi-tenant sections are the parts I&#39;d read twice.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://serverlessdna.com/strands/lambda/vpc?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Lambda in a VPC Is Fine&lt;/a&gt;&lt;br /&gt;
Michael Walmsley walks through the evolution of Lambda VPC networking, from the painful 2016 days of on-demand ENI creation to today&#39;s Hyperplane implementation. If you&#39;re still repeating the old &amp;quot;never put Lambda in a VPC&amp;quot; advice, this explains why it stopped being true years ago.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://thenewstack.io/aws-opensearch-serverless-agentic-rebuild?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Why AWS scrapped OpenSearch&#39;s architecture to chase agent workloads&lt;/a&gt;&lt;br /&gt;
Frederic Lardinois of The New Stack covers AWS&#39;s near-complete rebuild of OpenSearch Serverless, with separated storage and compute that scales to zero when idle and auto-scales 20x faster than before. It&#39;s built for the burst-and-idle usage that agent workloads generate, with log analytics arriving in June and agent memory features in H2 2026.&lt;/p&gt;
&lt;h3&gt;New from AWS&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/06/amazon-route-53-resolver-dns?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon Route 53 Resolver DNS Firewall now supports Palo Alto Networks Advanced DNS Security (Preview)&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/06/amazon-cloudwatch-log-analytics?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon CloudWatch introduces Log Analytics for unified log analysis&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/06/aws-lambda-managed-instances-tag-propagation?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS Lambda Managed Instances now supports Tag Propagation for Managed Resources&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/06/aws-management-console-private?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS Management Console Private Access now works without internet connectivity&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/06/amazon-mwaa-serverless-eventbridge?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon MWAA Serverless now supports Amazon EventBridge notifications&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/amazon-sagemaker-ft-nemotron-3?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;SageMaker AI now supports serverless fine-tuning for NVIDIA Nemotron models&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/06/aws-cost-explorer?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS launches Cost Explorer historical data retention for accounts in billing groups&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/06/amazon-cloudwatch-query-studio-generally-available?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon CloudWatch Query Studio is now generally available&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/06/cloudwatch-application-signals-supports%20infrastructure-logs-traces-context-for-faster%20troubleshooting?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon CloudWatch Application Signals now supports infrastructure, logs, and traces context for faster troubleshooting&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/06/aws-cost-usage-report?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS Cost and Usage Report 2.0 now supports table configurations update&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/06/amazon-sagemaker-unified-studio-emr?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon SageMaker Unified Studio Notebooks now support EMR Serverless&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/06/aws-cli-agent-toolkit?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;The AWS Command Line Interface (CLI) now supports the Agent Toolkit for AWS&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/06/aws-workload-credentials-provider?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS announces AWS Workload Credentials Provider&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;Security&lt;/h3&gt;
&lt;p&gt;&lt;a href=&quot;https://securosis.com/blog/aws-destroyed-the-value-proposition-for-bedrock?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23368&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS Destroyed the Value Proposition for Bedrock&lt;/a&gt; by Chris Farris&lt;br /&gt;
Chris digs into the part of the Fable 5 and Mythos 5 launch nobody put in the headline: the only allowed retention mode for these models on Bedrock is &lt;code&gt;provider_data_share&lt;/code&gt;. Using them means your prompts and outputs leave the AWS boundary, land with Anthropic for 30 days, and become subject to human review. That breaks the neutral-broker guarantee that sent regulated and European shops to Bedrock in the first place. He walks through the compliance fallout and the SCP you should deploy today to deny anything other than &lt;code&gt;none&lt;/code&gt;. Read this before you point a workload at either model, assuming they get turned back on.&lt;/p&gt;
&lt;h3&gt;From Socials&lt;/h3&gt;
&lt;blockquote class=&quot;twitter-tweet&quot;&gt;&lt;p lang=&quot;en&quot; dir=&quot;ltr&quot;&gt;Just spent the last two weeks reworking my local Agent Hub system to use &lt;a href=&quot;https://x.com/opencode?ref_src=twsrc%5Etfw&quot;&gt;@opencode&lt;/a&gt; as the harness with qwen, gemma4, and mistral local models. Then I get this at 7:01pm. 😑 &lt;a href=&quot;https://t.co/T9as70aqdQ&quot;&gt;pic.twitter.com/T9as70aqdQ&lt;/a&gt;&lt;/p&gt;&amp;mdash; Jeremy Daly (@jeremy_daly) &lt;a href=&quot;https://x.com/jeremy_daly/status/2066673284888891725?ref_src=twsrc%5Etfw&quot;&gt;June 16, 2026&lt;/a&gt;&lt;/blockquote&gt; &lt;script async=&quot;&quot; src=&quot;https://platform.x.com/widgets.js&quot; charset=&quot;utf-8&quot;&gt;&lt;/script&gt;
I&#39;m not sure whether to be excited by this message, or if I should prepare for another rug pull. Either way, it forced me down an interesting multi-harness orchestration path.
&lt;h3&gt;Final Thoughts 🤔&lt;/h3&gt;
&lt;p&gt;HTTP 402 had been sitting in the spec since the early 90s with a note that said &amp;quot;reserved for future use.&amp;quot; For three decades it was the status code nobody got to use, a placeholder for a payment layer the web never seemed to materialize. Then about a year ago, Coinbase introduced &amp;quot;x402: An open standard for&lt;br /&gt;
internet-native payments.&amp;quot; Wait, did the crypto bros get it right? 😬 (fyi, I&#39;m still a hard no on that)&lt;/p&gt;
&lt;p&gt;AWS WAF now returns a 402 with a machine-readable price manifest when an AI bot asks for your content. The bot&#39;s agent reads the manifest, pays in stablecoin through Coinbase&#39;s x402 facilitator, and gets the content. No human in the loop and no checkout page. At the same time, Visa is putting money into Replit to build agentic payments and pushing its Trusted Agent Protocol, so the same machinery is getting assembled by the incumbents who actually move money for a living. When a 30-year-old dead status code and a Visa investment point in the same direction, that&#39;s usually a signal worth paying attention to.&lt;/p&gt;
&lt;p&gt;What&#39;s happening here is a shift in how we treat bots. For most of the web&#39;s history, automated traffic was something you blocked, rate-limited, or grudgingly tolerated. The robots.txt era assumed crawlers were either friendly enough to respect a text file or hostile enough to fight. Now there&#39;s a third option: charge them. If an agent wants your content badly enough to pay for it, you can let it, and you can put a number on exactly how much that access is worth.&lt;/p&gt;
&lt;p&gt;I&#39;m not sure this scales, and there are real reasons for skepticism. Stablecoin payouts assume a settlement story most finance teams haven&#39;t signed off on. Differentiated pricing for bots assumes agents will agree to pay instead of routing around you, and the whole thing has a chicken-and-egg problem where it only matters once enough agents speak the protocol and enough publishers demand payment. None of that is solved. But the direction is clear, and for the first time the economics of serving an AI bot aren&#39;t automatically negative.&lt;/p&gt;
&lt;p&gt;There&#39;s a question worth thinking about if you run content or an API. &amp;quot;Block all bots&amp;quot; is no longer the only defensive move available to you. The more interesting question is which agents you&#39;d actually want to charge, which ones you&#39;d serve for free because they send value back, and what your content is worth to a machine that has a budget and no patience for a paywall modal. That&#39;s a pricing exercise, not a security one, and most of us have never had to think about it. We probably should start.&lt;/p&gt;
&lt;p&gt;See you next week,&lt;br /&gt;
Jeremy&lt;/p&gt;
&lt;hr /&gt;
&lt;p&gt;I hope you enjoyed this newsletter. We&#39;re always looking for ideas and feedback to make it better and more inclusive, so please feel free to reach out to me via &lt;a href=&quot;https://bsky.app/profile/jeremydaly.com&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Bluesky&lt;/a&gt;, &lt;a href=&quot;http://www.linkedin.com/in/jeremydaly&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;LinkedIn&lt;/a&gt;, &lt;a href=&quot;http://twitter.com/jeremy_daly&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;X&lt;/a&gt;, or &lt;a href=&quot;mailto:contact@jeremydaly.com&quot;&gt;email&lt;/a&gt;.&lt;/p&gt;
</content>
  </entry>
  <entry>
    <title>Issue #367: What did I miss? 🎓</title>
    <link href="https://offbynone.io/issues/367/"/>
    <updated>2026-06-09T12:00:00Z</updated>
    <summary>In this issue, Anthropic ships two major models, DynamoDB gets &#39;extended&#39; to run locally on Postgres, and Aurora DSQL adds JSONB support.</summary>
    <id>https://offbynone.io/issues/367/</id>
    <content type="html">&lt;h2&gt;What did I miss? 🎓&lt;/h2&gt;
&lt;p&gt;I took a couple of weeks off, so we&#39;re playing catch-up. My youngest daughter graduated from high school last week, and between that, the after-prom party she threw at my house, and her graduation party (also at my house), there wasn&#39;t a lot of time left for keeping up with serverless, AI, and cloud. So this one covers about three weeks of news, and it&#39;s a long one. Apologies in advance.&lt;/p&gt;
&lt;p&gt;In this issue, Anthropic ships two major models, DynamoDB gets &amp;quot;extended&amp;quot; to run locally on Postgres, and Aurora DSQL adds JSONB support. Plus, we&#39;ve got plenty of awesome content from the cloud, serverless, and AI communities.&lt;/p&gt;
&lt;h3&gt;News &amp;amp; Announcements&lt;/h3&gt;
&lt;p&gt;Let&#39;s start with the money, because it&#39;s the reason for everything else. &lt;a href=&quot;https://www.anthropic.com/news/series-h?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Anthropic raised a boatload of money&lt;/a&gt;, $65 billion in a Series H at a $965 billion post-money valuation. That kind of capital buys a lot of compute, and the spending showed up almost immediately in the product line.&lt;/p&gt;
&lt;p&gt;First came &lt;a href=&quot;https://www.anthropic.com/news/claude-opus-4-8?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Claude Opus 4.8&lt;/a&gt;, which introduced dynamic workflows in Claude Code as a research preview, better coding and browser-automation numbers, and effort control settings, all at the same price as Opus 4.7. Then, before anyone had a chance to settle in, Anthropic announced &lt;a href=&quot;https://www.anthropic.com/news/claude-fable-5-mythos-5?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Claude Fable 5 and Claude Mythos 5&lt;/a&gt;, the first generation of Mythos-class models built for autonomous, professional work. Fable 5 is the one you can actually use, and Mythos 5 remains the locked-down sibling. If you want a second opinion before you commit, Claire Vo&#39;s &lt;a href=&quot;https://www.lennysnewsletter.com/p/claude-fable-5-review-what-the-new?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;review of Fable 5&lt;/a&gt; puts it through three real-world scenarios and is honest about where it falls down.&lt;/p&gt;
&lt;p&gt;AWS, predictably, did not want to be left out. &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/claude-opus-4.8-aws?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Claude Opus 4.8 landed on AWS&lt;/a&gt; through Bedrock and Claude Platform, and then &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/06/claude-fable-5-aws?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Fable 5 showed up as the first generally available Mythos-class model on AWS&lt;/a&gt; too, with a &lt;a href=&quot;https://aws.amazon.com/blogs/aws/anthropic-claude-fable-5-on-aws-mythos-class-capabilities-with-built-in-safeguards-now-available?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;longer writeup on the AWS blog&lt;/a&gt; covering the built-in safeguards for autonomous operation. Anthropic wasn&#39;t the only model vendor getting the Bedrock treatment, either. &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/06/amazon-bedrock-openai-models-codex-generally-available?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;OpenAI&#39;s GPT-5.5, GPT-5.4, and Codex are now generally available on Bedrock&lt;/a&gt; with &lt;a href=&quot;https://aws.amazon.com/blogs/machine-learning/openai-models-and-codex-on-amazon-bedrock-are-now-generally-available?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;pay-per-token pricing matching OpenAI&#39;s direct rates&lt;/a&gt;, inference staying inside your chosen region, and the usual KMS, VPC, and CloudTrail story for compliance.&lt;/p&gt;
&lt;p&gt;To make all of this easier to work with, Bedrock also shipped a &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/06/amazon-bedrock-redesigned-console-optimized-openai-anthropic-compatible-apis?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;redesigned console optimized for the OpenAI- and Anthropic-compatible APIs&lt;/a&gt; (there&#39;s a &lt;a href=&quot;https://aws.amazon.com/blogs/aws/try-the-new-console-experience-in-amazon-bedrock-optimized-for-anthropic-and-openai-compatible-apis?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;hands-on writeup on the AWS blog&lt;/a&gt;) built around the &lt;code&gt;bedrock-mantle&lt;/code&gt; endpoint, with project-based organization, side-by-side comparisons, and prefilled code snippets. They rounded it out with &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/amazon-bedrock-request-level-usage-attribution?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;request-level usage attribution&lt;/a&gt; so you can tag individual inference calls by team or environment, &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/06/amazon-bedrock-supports-cloudwatch-metrics-bedrock-mantle-endpoint?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;CloudWatch metrics for the mantle endpoint&lt;/a&gt;, and &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/5/amazon-bedrock-service-quotas?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;expanded Service Quotas support&lt;/a&gt;. The cost attribution piece is the one I&#39;d pay attention to. Once you&#39;ve got three model families running through one endpoint, knowing which team is spending what stops being optional.&lt;/p&gt;
&lt;p&gt;The agent side of Bedrock kept pace. &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/06/amazon-bedrock-agentcore-runtime?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AgentCore Runtime added interactive shells&lt;/a&gt; via a new &lt;code&gt;InvokeAgentRuntimeCommandShell&lt;/code&gt; API, giving you WebSocket terminal access into a running agent&#39;s microVM to inspect files, run commands, or debug state without losing session context. &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/06/agentcore-identity-secrets-manager?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AgentCore Identity now lets you bring your own secrets through AWS Secrets Manager&lt;/a&gt;, and &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/06/aws-step-functions-agentcore?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Step Functions added an AgentCore-powered agentic reasoning step&lt;/a&gt; so you can drop a reasoning task into a state machine without bolting on extra infrastructure. The &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/06/aws-mcp-server?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS MCP Server picked up cross-account and cross-role access&lt;/a&gt; too, so a coding agent can finally hop between accounts and roles in a single session instead of stopping, swapping credentials, and starting over. Anyone who&#39;s managed agents across more than one account knows exactly how annoying that loop was.&lt;/p&gt;
&lt;p&gt;The most interesting database news of the bunch didn&#39;t get a flashy launch event. &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/aws-extenddb-dynamodb?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS released ExtendDB 0.1&lt;/a&gt;, an open source adapter that implements the DynamoDB API on top of pluggable storage backends, with PostgreSQL as the first reference implementation. That means you can write code against DynamoDB programming patterns and run it locally, in CI, or on-prem against Postgres. I&#39;ve been wanting something like this for years. DynamoDB Local has always been a reasonable stand-in, but a pluggable adapter that lets you point real DynamoDB access patterns at a Postgres backend opens up a lot of testing and migration scenarios that used to be a pain. It&#39;s 0.1, so temper your expectations, but the direction is genuinely useful.&lt;/p&gt;
&lt;p&gt;Aurora DSQL stayed busy, picking up &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/06/amazon-aurora-dsql-supports-jsonb?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;JSONB support with compression on by default&lt;/a&gt;, so you can store semi-structured config and API parameters next to your relational data and let DSQL compress the larger payloads for you. Over in search, the &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/amazon-opensearch-serverless-next-generation-generally-available?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;next generation of Amazon OpenSearch Serverless went GA&lt;/a&gt;, and the headline feature is scale-to-zero. There&#39;s a &lt;a href=&quot;https://aws.amazon.com/blogs/aws/introducing-the-next-generation-of-amazon-opensearch-serverless-for-building-your-agentic-ai-applications?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;proper deep-dive on the AWS blog&lt;/a&gt; that leans into the agentic AI angle with instant resource creation and Vercel and Kiro integrations, and OpenSearch Serverless also &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/06/opensearch-agentic-search?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;added Agentic Search&lt;/a&gt; on top. Scale-to-zero is the big one for me. Vector and search backends that scale to zero change the math on a whole category of side projects and low-traffic workloads that previously couldn&#39;t justify the always-on cost.&lt;/p&gt;
&lt;p&gt;A small but welcome bit of housekeeping: AWS is &lt;a href=&quot;https://aws.amazon.com/blogs/developer/announcing-updated-retry-behavior-for-aws-sdks-and-tools?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;standardizing retry behavior across all SDKs and tools&lt;/a&gt;. The change splits backoff into two strategies, a fast 50ms for transient errors and a slower 1000ms for throttling, which is a more sensible default than treating every failure the same way. It becomes the default in November 2026, but you can opt in today with &lt;code&gt;AWS_NEW_RETRIES_2026=true&lt;/code&gt;. If you&#39;ve ever hand-tuned retry configs to stop hammering a throttled service, this is the kind of quiet fix that saves you from rediscovering the same lesson on the next project.&lt;/p&gt;
&lt;p&gt;There was plenty more from AWS over the past few weeks. &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/06/aws-finops-agent-preview?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;FinOps Agent went into preview&lt;/a&gt;, answering cost questions and surfacing optimization opportunities out of Cost Optimization Hub and Compute Optimizer. &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/06/amazon-cognito-multi-region?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Cognito added multi-Region replication&lt;/a&gt; as an add-on for Essentials and Plus tier user pools, syncing identities to a standby Region so you can redirect traffic during a regional disruption. And AWS &lt;a href=&quot;https://aws.amazon.com/blogs/aws/meet-our-newest-aws-heroes-may-2026?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;named four new Heroes for May 2026&lt;/a&gt;, with serverless and AI/ML leaders from Italy, Canada, and Argentina. Congratulations to all of them. The community is better for the work you do.&lt;/p&gt;
&lt;p&gt;One last thing from me. I pushed an update to &lt;a href=&quot;https://github.com/jeremydaly/data-api-client?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;data-api-client&lt;/a&gt;, my &lt;code&gt;DocumentClient&lt;/code&gt;-style wrapper for the Amazon Aurora Serverless Data API. If you&#39;re working with the Data API and want the familiar parameter-mapping ergonomics instead of the raw request format, give it a look.&lt;/p&gt;
&lt;h3&gt;Tutorials&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/database/building-type-safe-applications-with-drizzle-orm-in-aurora-dsql?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Building type-safe applications with Drizzle ORM in Aurora DSQL&lt;/a&gt; by Dipen Patel&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/database/pagination-patterns-in-amazon-aurora-dsql?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Pagination patterns in Amazon Aurora DSQL&lt;/a&gt; by Sandhya Khanderia&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/machine-learning/its-safe-to-close-your-laptop-now-hosting-coding-agents-on-amazon-bedrock-agentcore?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;It&#39;s safe to close your laptop now: Hosting coding agents on Amazon Bedrock AgentCore&lt;/a&gt; by Evandro Franco&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/machine-learning/break-the-context-window-barrier-with-amazon-bedrock-agentcore?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Break the context window barrier with Amazon Bedrock AgentCore&lt;/a&gt; by Yuan Tian&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/machine-learning/building-multi-tenant-agents-with-amazon-bedrock-agentcore?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Building multi-tenant agents with Amazon Bedrock AgentCore&lt;/a&gt; by Dhawalkumar Patel&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/database/best-practices-for-amazon-dynamodb-global-tables-part-1-operational-readiness?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Best practices for Amazon DynamoDB Global Tables – Part 1: Operational readiness&lt;/a&gt; by Lee Hannigan&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/database/best-practices-for-amazon-dynamodb-global-tables-part-2-failover-strategies?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Best practices for Amazon DynamoDB Global Tables – Part 2: Failover strategies&lt;/a&gt; by Lee Hannigan&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/database/best-practices-for-amazon-dynamodb-global-tables-part-3-validating-regional-resilience-with-aws-fault-injection-service?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Best practices for Amazon DynamoDB Global Tables – Part 3: Validating regional resilience with AWS Fault Injection Service&lt;/a&gt; by Lee Hannigan&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://gunnargrosch.com/posts/sms-delivery-receipts-on-aws-lambda?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;SMS Delivery Receipts on AWS Lambda&lt;/a&gt; by Gunnar Grosch&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://www.readysetcloud.io/blog/allen.helton/your-agent-is-repeating-itself?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Your agent is repeating itself&lt;/a&gt; by Allen Helton&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://rustysl.com/en/blog/s3-on-demand-archive?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;On-Demand Archives on S3&lt;/a&gt; by Jérémie Rodon&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://darryl-ruggles.cloud/live-canary-deployments-with-aws-sam-the-new-websocket-api-resource-and-lambda-durable-functions?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS SAM WebSocket &amp;amp; Lambda Durable Functions: Canary Deploy&lt;/a&gt; by Darryl Ruggles&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://dev.to/aws-heroes/serverless-applications-on-aws-with-lambda-using-java-25-api-gateway-and-dynamodb-part-7-lambda-4po1?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Serverless applications on AWS with Lambda using Java 25, API Gateway and DynamoDB - Part 7 Lambda performance optimization approaches&lt;/a&gt; by Vadym Kazulkin&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://dev.to/aws-heroes/aws-lambda-managed-instances-with-java-25-and-aws-sam-part-7-implement-scheduled-scaling-4df9?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS Lambda Managed Instances with Java 25 and AWS SAM – Part 7 Implement scheduled scaling&lt;/a&gt; by Vadym Kazulkin&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://dev.to/aws-builders/s3-files-killed-my-least-favorite-lambda-pattern-25f9?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;S3 Files Killed My Least Favorite Lambda Pattern&lt;/a&gt; by Mwanza Simi&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://heeki.medium.com/closing-the-loop-from-code-generation-to-sandboxed-code-execution-b90cff6adbf7?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Closing the loop from code generation to sandboxed code execution&lt;/a&gt; by Heeki Park&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://pubudu.dev/posts/lambda-durable-functions-triggered-by-sqs?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Triggering Lambda Durable Functions from SQS&lt;/a&gt; by Pubudu Jayawardana&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;Reads&lt;/h3&gt;
&lt;p&gt;&lt;a href=&quot;https://darryl-ruggles.cloud/the-real-cost-of-vector-storage-s3-vectors-vs-opensearch-vs-pgvector-vs-pinecone?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Vector Storage Costs: S3, OpenSearch, pgvector, Pinecone&lt;/a&gt; by Darryl Ruggles&lt;br /&gt;
Darryl built a full cost model and benchmark harness comparing S3 Vectors, OpenSearch Serverless NextGen, Aurora pgvector, and Pinecone, including how the May 2026 scale-to-zero launch shifts the comparison. There&#39;s a calculator to find the crossover point for your own workload shape, which is exactly the kind of thing you want before you pick a vector store and regret it later.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://ranthebuilder.cloud/blog/ai-changed-how-we-build-our-tools-didn-t?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AI Changed How We Build. Our Tools Didn&#39;t.&lt;/a&gt; by Ran Isenberg&lt;br /&gt;
Ran walks through the gap between AI-driven development and the tooling we still use to manage it. IDEs, GitHub, Jira, and sprint planning were all built for a world where humans wrote the code, and they haven&#39;t caught up to one where agents write and engineers mostly review. He&#39;s got a &lt;a href=&quot;https://ranthebuilder.cloud/blog/ai-changed-the-engineer-s-job-here-s-how-to-adapt?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;companion piece on adapting the engineer&#39;s job&lt;/a&gt; that gets into burnout risk and rising token costs too.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://charity.wtf/2026/06/02/ai-enthusiasts-are-in-a-race-against-time-ai-skeptics-are-in-a-race-against-entropy-xpost?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AI enthusiasts are in a race against time, AI skeptics are in a race against entropy&lt;/a&gt; by Charity Majors&lt;br /&gt;
Charity uses Fin&#39;s productivity gains as a case study and lands on a point that&#39;s easy to lose in the hype: the wins came from engineering discipline and fast feedback loops, not from AI being magic. If you&#39;re trying to bridge the gap between the true believers and the people rolling their eyes on your team, this is a good framework.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://claude.com/blog/running-an-ai-native-engineering-org?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Running an AI-native engineering org&lt;/a&gt;&lt;br /&gt;
Anthropic&#39;s engineering team shares how their process changed once every commit became Claude-assisted, including the move from six-month roadmaps to far more fluid planning. The bit about going past &amp;quot;who changed this&amp;quot; to &amp;quot;what information do I actually need&amp;quot; is the part worth sitting with.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://claude.com/blog/lessons-from-building-claude-code-how-we-use-skills?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Lessons from building Claude Code: How we use skills&lt;/a&gt;&lt;br /&gt;
The Claude Code team breaks down nine skill types they use internally, from library reference to verification to scaffolding, plus the practices that make a skill actually work. If you&#39;re building anything with skills, this is required reading.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://claude.com/blog/using-claude-code-the-unreasonable-effectiveness-of-html?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Using Claude Code: The unreasonable effectiveness of HTML&lt;/a&gt; by Thariq Shihipar&lt;br /&gt;
Anthropic makes the case that HTML beats Markdown for AI output because of its density and interactivity, with examples spanning richer docs, code reviews, and throwaway custom editors. I said it last issue and I&#39;ll say it again: I&#39;m sold on the HTML move.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://serverlessdna.com/strands/ai-agents/agent-loops-are-hungrier-than-you-think?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Your Agent Loops are Hungrier Than You Think&lt;/a&gt; by Michael Walmsley&lt;br /&gt;
Michael lays out why agentic loops burn tokens quadratically: every turn replays the full conversation history, so turn 20 is paying for turns 1 through 19. He backs it with real token counts from actual scenarios, and if you&#39;ve been surprised by an agent bill, this explains where it went.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://claude.com/blog/the-claude-cowork-product-guide?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;The Claude Cowork product guide&lt;/a&gt;&lt;br /&gt;
Anthropic&#39;s guide to Claude Cowork, their desktop knowledge-work agent, covers local file access, Slack and Google Drive integration, when to reach for it over other Claude tools, and seven worked examples. A useful orientation if you&#39;re trying to figure out where Cowork fits.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://openai.com/index/codex-for-knowledge-work?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Codex is becoming a productivity tool for everyone&lt;/a&gt;&lt;br /&gt;
OpenAI shared usage data putting Codex at 5 million weekly active users, with knowledge workers growing three times faster than developers. The use cases have spread well past code into reports, spreadsheets, presentations, and analysis. The line between &amp;quot;coding tool&amp;quot; and &amp;quot;work tool&amp;quot; keeps getting blurrier.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://blogs.oracle.com/developers/from-rag-to-memory-systems-building-stateful-ai-architecture?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AI Memory Systems Explained: From Retrieval to Durable, Context-Aware Agents&lt;/a&gt; by Jeremy Daly&lt;br /&gt;
This is mine. It&#39;s a deep architectural walkthrough of how to move from basic RAG to a production-grade memory system, covering five memory types (policy, preference, fact, episodic, trace), how their storage patterns differ, hybrid retrieval, and why you need a memory manager controlling what gets stored and retrieved while keeping governance and privacy intact. If memory has been the fuzzy part of your agent design, this should sharpen it up.&lt;/p&gt;
&lt;h3&gt;Podcasts, Videos, and more&lt;/h3&gt;
&lt;p&gt;&lt;a href=&quot;https://www.youtube.com/watch?v=5KWIf0mFzy8?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Building TypeScript agents with Strands | Serverless Office Hours&lt;/a&gt;&lt;br /&gt;
Erik walks through the Strands Agents TypeScript SDK for building agents on AWS, including agents that run in Node.js and the browser, connecting multiple model providers, and orchestrating multi-agent workflows.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://www.youtube.com/watch?v=RzwYFL6wIOU?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Building with Claude: Lessons from real projects | Serverless Office Hours&lt;/a&gt;&lt;br /&gt;
Ran Isenberg joins Julian Wood to talk through practical Claude Code workflows in serverless development: custom skills, configuration strategies, and context management. Worth watching if you&#39;re still figuring out how these tools fit your process.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://www.youtube.com/watch?v=R5cA74Qv4hs?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AI-assisted development in practice | Serverless Office Hours&lt;/a&gt;&lt;br /&gt;
Darryl Ruggles builds a full serverless blogging platform with AI coding tools and is honest about what works (MCP servers for Terraform and AWS docs), what breaks, and how to keep security and best practices intact when you let AI write your infrastructure.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://www.youtube.com/watch?v=dWIng6-7wQw?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Serverless Craic Ep86 AI and Software Development - the Real Problem&lt;/a&gt;&lt;br /&gt;
The Serverless Edge crew makes the case that AI amplifies both good and bad engineering practices, with a discussion that wanders through platform engineering, cognitive load, and socio-technical systems.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://www.youtube.com/watch?v=kquxaSSj3BE?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Serverless CrAIc Ep85 Why Team Topologies Matters More Than Ever in the AI Era&lt;/a&gt;&lt;br /&gt;
The crew asks whether AI agents count as team members and what that does to cognitive load, working through how organizational frameworks bend when code generation speeds up but human collaboration stays the bottleneck.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://www.youtube.com/watch?v=lqw92QYqM88?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS Bites #154: S3 Files&lt;/a&gt;&lt;br /&gt;
Eoin and Luciano dig into S3 Files, explaining why S3 was never really a file system (no atomic renames, expensive listings, immutable objects) and how this service bridges the gap, with benchmark data and a frank look at the 60-second write-back delay and eventual consistency.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://www.lennysnewsletter.com/p/claude-fable-5-review-what-the-new?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Claude Fable 5 review: what the new Mythos model gets right (and very wrong)&lt;/a&gt;&lt;br /&gt;
Claire Vo reviews Anthropic&#39;s first generally available Mythos-class model and the launches around it, including Managed Agents and safety classifiers, testing it on product specs and multi-agent orchestration. A grounded look at a model that&#39;s getting a lot of breathless coverage everywhere else.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://www.lennysnewsletter.com/p/a-rational-conversation-on-where?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;A rational conversation on where AI is actually going | Benedict Evans&lt;/a&gt;&lt;br /&gt;
Benedict Evans argues foundation models won&#39;t hold lasting pricing power and that value moves up the stack, with distribution becoming the real moat now that software is cheap to build. A nice counterweight to the model-vendor news in this issue.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://www.lennysnewsletter.com/p/the-ai-paradox-dan-shipper?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;The AI paradox: More automation, more humans, more work | Dan Shipper&lt;/a&gt;&lt;br /&gt;
Dan Shipper draws on running Every to argue that work is moving inside AI agents, that SaaS is thriving rather than dying because agents drive more usage, and that roles like PM are getting more leverage from AI tooling.&lt;/p&gt;
&lt;h3&gt;New from AWS&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/secrets-manager-managed-external-secrets-datadog-snowflake?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS Secrets Manager adds managed external secrets support for Datadog vended keys and Snowflake Programmatic Access Tokens&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/aws-security-agent?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS Security Agent adds verification scripts for pentest findings&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/amazon-ecs-pause-continue-deployments?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon ECS introduces pause and continue controls for service deployments&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/5/docdb8-serverless?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon DocumentDB (with MongoDB compatibility) Serverless is now available on DocumentDB 8.0&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/06/amazon-cloudwatch-logs-insights-new?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon CloudWatch Logs Insights adds 23 new query commands and functions&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/06/aws-lambda-managed-instances-region-expansion?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS Lambda Managed Instances expands to additional AWS Regions&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/amazon-emr-serverless-spark-connect?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Run Interactive Workloads on Amazon EMR Serverless with Spark Connect&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/06/aws-msk-express-topic-support-kstreams?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon MSK Express Brokers now support automatic topic creation with Kafka Streams&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/06/aws-cost-explorer-intelligent-cost-explanations?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS Cost Explorer launches intelligent cost explanations powered by Amazon Q&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/06/aws-ai-powered-cost-investigations?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS now provides AI-powered cost investigations for cost anomalies&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/06/amazon-redshift-incremental-manual-snapshots?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon Redshift reduces manual snapshot cost for Serverless and RG instances&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/06/aws-savings-plans-coverage?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS Savings Plans Purchase Analyzer now supports target coverage analysis&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/06/amazon-cloudwatch-mi-extended-retention-region-expansion?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon CloudWatch now supports querying metrics data up to two weeks old&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/06/durability-amazon-elasticache?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon ElastiCache for Valkey now supports durability&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/monitor-aws-budgets-using-dashboards?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Monitor AWS Budgets directly in Billing and Cost Management Dashboards with new Budgets widget&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/oracle-database-aws-available-twenty-regions?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Oracle Database@AWS is now available in twenty AWS Regions&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/amazon-ses-global-deliverability?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon SES now offers inbox placement metrics and blocklist monitoring&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/06/amazon-ses-tenant-level-suppression-lists?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon SES now supports tenant-level suppression lists&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/aws-shield-ddos?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS Shield Advanced introduces DDoS attack flow logs&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/06/keyspaces-cdc-iterator-position?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon Keyspaces (for Apache Cassandra) now provides CDC iterator position&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;Developer Tools&lt;/h3&gt;
&lt;p&gt;&lt;a href=&quot;https://dynamosql.com/?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;DynamoSQL™ — ANSI SQL for Amazon DynamoDB&lt;/a&gt;&lt;br /&gt;
DynamoSQL is a SQL query engine for DynamoDB with JOINs, CTEs, aggregations, and subqueries, no pipelines or ETL required. It&#39;s in beta with early access through AWS Marketplace and offers MCP integration for AI applications.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://dev.to/himaan4149/i-built-pretext-pdf-serverless-pdfs-without-chromium-1pg5?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;I Built pretext-pdf: Serverless PDFs Without Chromium&lt;/a&gt; by Himanshu Jain&lt;br /&gt;
Himanshu built pretext-pdf, a Node.js library that generates PDFs from JSON without Chromium, aimed at structured documents like invoices and reports with 40-100ms generation times. If you&#39;ve ever wrestled a headless Chromium into a Lambda just to make a PDF, this is a lighter path.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/developer/introducing-open-source-skills-for-aws-sdk-best-practices?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23367&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Introducing Open-Source Skills for AWS SDK Best Practices&lt;/a&gt; by David Yaffe&lt;br /&gt;
AWS released open-source skills for their Agent Toolkit to improve how AI coding agents generate SDK code, currently for Swift, JavaScript v3, and Python (Boto3), targeting the common mistakes like wrong API names, bad parameter types, and missed paginators.&lt;/p&gt;
&lt;h3&gt;Final Thoughts 🤔&lt;/h3&gt;
&lt;p&gt;That $65 billion raise is the smoke rising from the Anthropic and OpenAI IPO talk, with their valuations looking shakier than the headlines suggest once you do the math on token economics. Burning compute to win benchmarks is one thing. Making the unit economics work when customers actually use the product is another, and that&#39;s where the recent billing changes come in. Anthropic pulling &lt;code&gt;claude -p&lt;/code&gt; out of what your Max subscription covers, plus the GitHub Copilot billing changes are already having a real effect on how people use these tools. The tokenmaxing that let everyone ship slop faster is getting expensive, and maybe that&#39;s (kind of) a good thing.&lt;/p&gt;
&lt;p&gt;It forces discipline, which is the thread running through several pieces in this issue. Charity Majors makes the case that the AI productivity wins came from engineering discipline and tight feedback loops, not magic. Ran Isenberg points out that our tools were built for humans writing code and are straining under agents doing it. Both are circling the same idea: the teams that come out ahead won&#39;t be the ones with the most tokens, they&#39;ll be the ones with the most discipline. If that discipline doesn&#39;t show up, we&#39;re all in trouble.&lt;/p&gt;
&lt;p&gt;The model you use this year will be obsolete by next. The patterns you build around storage, cost, and testing will outlast all of them.&lt;/p&gt;
&lt;p&gt;See you next week,&lt;br /&gt;
Jeremy&lt;/p&gt;
&lt;hr /&gt;
&lt;p&gt;I hope you enjoyed this newsletter. We&#39;re always looking for ideas and feedback to make it better and more inclusive, so please feel free to reach out to me via &lt;a href=&quot;https://bsky.app/profile/jeremydaly.com&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Bluesky&lt;/a&gt;, &lt;a href=&quot;http://www.linkedin.com/in/jeremydaly&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;LinkedIn&lt;/a&gt;, &lt;a href=&quot;http://twitter.com/jeremy_daly&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;X&lt;/a&gt;, or &lt;a href=&quot;mailto:contact@jeremydaly.com&quot;&gt;email&lt;/a&gt;.&lt;/p&gt;
</content>
  </entry>
  <entry>
    <title>Issue #366: The Flat-Rate Honeymoon is Over 📈</title>
    <link href="https://offbynone.io/issues/366/"/>
    <updated>2026-05-19T12:00:00Z</updated>
    <summary>In this issue, Anthropic brings subscription clarity to Claude, Codex goes mobile, and Amazon DSQL gets CDC on DPUs.</summary>
    <id>https://offbynone.io/issues/366/</id>
    <content type="html">&lt;h2&gt;The Flat-Rate Honeymoon is Over 📈&lt;/h2&gt;
&lt;p&gt;In our &lt;a href=&quot;https://offbynone.io/issues/365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;previous issue&lt;/a&gt;, Claude Platform set up shop on AWS, ElastiCache learned to do full-text and hybrid search, and Ampt rolled out Node.js 24 as the default runtime. This week, Anthropic brings subscription clarity to Claude, Codex goes mobile, and Amazon DSQL gets CDC on DPUs. Plus, we&#39;ve got plenty of awesome content from cloud, serverless, and AI communities.&lt;/p&gt;
&lt;h3&gt;News &amp;amp; Announcements&lt;/h3&gt;
&lt;p&gt;Anthropic announced this past week that &lt;a href=&quot;https://x.com/ClaudeDevs/status/2054610152817619388?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;paid Claude plans will get a dedicated monthly credit for programmatic usage starting June 15&lt;/a&gt;. Pro gets $20 in monthly credits, Max 20x gets $200, and anything past the allocation rolls onto API rates. If you&#39;ve been running SDK loops, &lt;code&gt;claude -p&lt;/code&gt; jobs, or GitHub Actions agents on your subscription, the honeymoon is over. Theo Browne took it about as well as you&#39;d expect. His &lt;a href=&quot;https://www.youtube.com/watch?v=131yAOjxHHQ?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;reaction video&lt;/a&gt; is appropriately titled &amp;quot;I&#39;m done,&amp;quot; and his X post promised to &lt;a href=&quot;https://x.com/theo/status/2054734057368621176?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;donate $10 to open source for every screenshot of a cancelled Claude Code plan&lt;/a&gt; that was shared. He&#39;s not wrong to be frustrated. The &lt;a href=&quot;https://x.com/mattpocockuk/status/2040536403289764275&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;rules have been ambiguous for quite some time&lt;/a&gt;, and this announcement does provide some clarity, just not what most of us were hoping for. The bigger lesson here is about AI platform risk. Cancelling Claude Code doesn&#39;t fix the real problem, and if your business depends on one vendor&#39;s pricing staying frozen forever, your subscription isn&#39;t the thing that needs changing.&lt;/p&gt;
&lt;p&gt;Even with all the developer backlash, Anthropic seems undeterred. They continue to ship more and more enterprise plumbing. &lt;a href=&quot;https://claude.com/blog/claude-managed-agents-updates?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Claude Managed Agents now support self-hosted sandboxes and MCP tunnels&lt;/a&gt;, so your agents&#39; tools can run inside your own infrastructure while orchestration stays on Anthropic&#39;s platform. Cloudflare, Daytona, Modal, and Vercel are all in the launch lineup, with &lt;a href=&quot;https://blog.cloudflare.com/claude-managed-agents?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Cloudflare getting its own first-class slot&lt;/a&gt; including Browser Run for automation and quick-start templates. Anthropic also pushed two vertical packages: &lt;a href=&quot;https://claude.com/blog/claude-for-the-legal-industry?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Claude for the legal industry&lt;/a&gt; and &lt;a href=&quot;https://www.anthropic.com/news/claude-for-small-business?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Claude for Small Business&lt;/a&gt;. Legal feels like the obvious play. High-value industries with messy document workflows are where managed agents make a lot of sense, so long as it doesn&#39;t keep hallucinating case law.&lt;/p&gt;
&lt;p&gt;Elsewhere in the agent space, OpenAI brought &lt;a href=&quot;https://openai.com/index/work-with-codex-from-anywhere?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Codex to the ChatGPT mobile app&lt;/a&gt; with Remote SSH now GA, programmatic access tokens, and HIPAA compliance for healthcare. Coding from your phone still sounds like a stretch, but the strategy is right: meet developers wherever they happen to be. Also, &lt;a href=&quot;https://thenewstack.io/temporal-replay-2026-news?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Temporal added Workflow Streams and Standalone Activities to its durable execution platform&lt;/a&gt;, both aimed squarely at the AI-in-production folks. Durability and debuggability are the two things most agentic systems are starving for, so the direction makes sense.&lt;/p&gt;
&lt;p&gt;AWS had a busy week too. &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/amazon-aurora-dsql-change-data-capture-preview?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon Aurora DSQL now supports change data capture in preview&lt;/a&gt;, streaming database changes to Kinesis Data Streams for event-driven apps and real-time analytics. I think the Distributed Processing Units plus Kinesis pricing is going to trip a few people up, but I&#39;ve been wrong before. &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/amazon-bedrock-advanced-prompt-optimization-migration-tool?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon Bedrock launched Advanced Prompt Optimization&lt;/a&gt; (with a &lt;a href=&quot;https://aws.amazon.com/blogs/aws/amazon-bedrock-introduces-new-advanced-prompt-optimization-and-migration-tool?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;deeper writeup on the AWS blog&lt;/a&gt;), automating prompt comparison across up to 5 models with custom evaluation metrics, Lambda-based scoring, LLM-as-a-Judge rubrics, and multimodal inputs including images and PDFs.&lt;/p&gt;
&lt;p&gt;On the Lambda side, &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/aws-lambda-managed-instances?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS added scheduled scaling for functions on Lambda Managed Instances&lt;/a&gt; via EventBridge Scheduler, useful for adjusting capacity ahead of expected traffic, and &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/region-switch-lambda-esm-execution-block?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;ARC Region switch now automates Lambda event source mapping execution during failovers&lt;/a&gt; across Kinesis, DynamoDB Streams, MSK, and SQS, with cross-account support. That last one is very cool. CloudFront got two updates: &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/amazon-cloudfront-mtls-passthrough?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Passthrough Mode for mTLS&lt;/a&gt; that forwards certificates to origins without edge validation, and &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/cloudfront-configurable-premium-flat-rate-plans?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;configurable usage allowances on the Premium flat-rate plan&lt;/a&gt; from 500 million to 6 billion requests and 50 TB to 600 TB per month. And &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/amazon-eventbridge-sdk-integrations?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;EventBridge Scheduler added 619 new SDK API actions across 13 services&lt;/a&gt;, bringing the total coverage to over 270 AWS services.&lt;/p&gt;
&lt;p&gt;Finally, on the security side, &lt;a href=&quot;https://www.wiz.io/blog/introducing-runtime-threat-detection-for-google-cloud-run?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Wiz&#39;s Runtime Sensor for Google Cloud Run is now GA&lt;/a&gt;, with 2000+ detection rules and AI-driven investigation through their Blue Agent. Serverless container monitoring built for serverless containers. Who would have thought?&lt;/p&gt;
&lt;h3&gt;Tutorials&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/database/zero-downtime-dynamodb-construct-migration-from-table-to-tablev2-with-cdk-orphan?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Zero-downtime DynamoDB construct migration: from Table to TableV2 with cdk orphan&lt;/a&gt; by Lee Hannigan&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/database/getting-started-with-change-data-capture-in-amazon-aurora-dsql?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Getting started with Change Data Capture in Amazon Aurora DSQL&lt;/a&gt; by Vijay Karumajji&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://builder.aws.com/content/3DiJq5vQ2hnGq1ddBlCApT8uy6u/dynamic-looping-comes-to-aws-sam?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Dynamic Looping Comes to AWS SAM&lt;/a&gt; by Eric Johnson&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://claude.com/blog/best-practices-for-computer-and-browser-use-with-claude?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Best practices for computer and browser use with Claude&lt;/a&gt; by Lucas Gonzalez and Luca Weihs&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://tricksumo.com/at-most-once-vs-at-least-once-semantics-lambda-durable-function?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AtMostOncePerRetry vs AtLeastOncePerRetry Semantics in Lambda Durable Function Step&lt;/a&gt; by Rishi&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/machine-learning/build-custom-code-based-evaluators-in-amazon-bedrock-agentcore?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Build custom code-based evaluators in Amazon Bedrock AgentCore&lt;/a&gt; by Bharathi Srinivasan&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://serverlessdna.com/strands/ai-assisted-development/layered-configuration-claude-code?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Layered Configuration in Claude Code&lt;/a&gt; by Michael Walmsley&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://monicacolangelo.com/multi-tenant-dynamodb-token-vending-machine?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Per-tenant DynamoDB isolation with the Token Vending Machine pattern&lt;/a&gt; by Monica Colangelo&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://dev.to/aws-builders/lambda-durable-functions-when-you-dont-need-step-functions-20bn?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Lambda Durable Functions, When You Don&#39;t Need Step Functions&lt;/a&gt; by Lewis Sawe&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://medium.com/@RDarrylR/live-canary-deployments-with-aws-sam-the-new-websocket-api-resource-and-lambda-durable-functions-4b029533b34f?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Live Canary Deployments with AWS SAM, the New WebSocket API Resource, and Lambda Durable Functions&lt;/a&gt; by Darryl Ruggles&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;Reads&lt;/h3&gt;
&lt;p&gt;&lt;a href=&quot;https://claude.com/blog/the-founders-playbook?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;The founder&#39;s playbook: Building an AI-native startup&lt;/a&gt;&lt;br /&gt;
Anthropic walks through Idea, MVP, Launch, and Scale for AI-native startups, with real founder stories woven in throughout. Playbooks aren&#39;t the whole answer, but if you&#39;re staring at a blank canvas, this is a pretty good starting point.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://claude.com/blog/how-claude-code-works-in-large-codebases-best-practices-and-where-to-start?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;How Claude Code works in large codebases: Best practices and where to start&lt;/a&gt; by&lt;br /&gt;
A complete walkthrough of Claude Code&#39;s extension points (&lt;a href=&quot;http://claude.md/&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;CLAUDE.md&lt;/a&gt; files, LSP integrations, MCP servers, subagents) and how they shape behavior in enterprise codebases. If you&#39;re getting mediocre results from Claude Code once you push past your toy project, this will probably help you fill some gaps.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://blog.cloudflare.com/cyber-frontier-models?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Project Glasswing: what Mythos showed us&lt;/a&gt; by Grant Bourzikas&lt;br /&gt;
Cloudflare tested Anthropic&#39;s Mythos Preview model on a number of their repos and got firsthand knowledge of why off-the-shelf coding agents fall short. The multi-stage architecture they built is a useful reference for anyone doing serious agentic work outside of the basic coding use case.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://www.nytimes.com/2026/05/18/opinion/ai-boo-commencement-speeches.html?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Opinion | The Generation That Grew Up With A.I. Hates It&lt;/a&gt; by Michelle Goldberg&lt;br /&gt;
Only 18% of Gen Z is hopeful about AI, and 47% of voters under 30 rate it as mostly bad. As someone with two daughters in that demographic, I can&#39;t say I&#39;m surprised. They&#39;ve watched the technology arrive with lots of promises but not a lot of upside for them.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://www.readysetcloud.io/blog/allen.helton/local-agents-scare-me?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Local agents scare me&lt;/a&gt; by Allen Helton&lt;br /&gt;
Allen walks through four attack vectors for local AI agents (shared userland, network adjacency, poisoned context, persistent state) and makes the case that traditional IAM controls don&#39;t fit. Definitely worth reading before you give any agent unrestricted shell access on your machine.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://ranthebuilder.cloud/blog/is-aws-lambda-tenant-isolation-mode-enough-for-saas?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Is AWS Lambda Tenant Isolation Mode Enough for SaaS?&lt;/a&gt; by Ran Isenberg&lt;br /&gt;
Ran breaks down what Lambda&#39;s tenant isolation actually solves and what it doesn&#39;t. The compute side is handled, but data access control is still on you, which has always been the hard part of multi-tenancy.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://medium.com/@siddarthpatil/10-practical-serverless-architecture-lessons-from-aws-summit-london-2026-5ccb29621a27?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;10 Practical Serverless Architecture Lessons from AWS Summit London 2026&lt;/a&gt; by Siddarth Patil&lt;br /&gt;
A grab-bag of serverless patterns from AWS Summit London: Lambda boundaries, async with EventBridge and SQS, cold starts, cost management, and applying the same patterns to GenAI workloads. Most of it is table stakes if you&#39;ve been doing this a while, but the GenAI section is worth a skim.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://architectingautonomy.substack.com/p/cross-domain-governance?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Cross-Domain Governance&lt;/a&gt; by Aaron Sempf&lt;br /&gt;
Aaron explains how autonomous systems should behave when they cross organizational boundaries, proposing monotonic reduction: authority can only be restricted, never amplified, as you move outward. It&#39;s a tidy way to think about a problem most agent platforms haven&#39;t even acknowledged yet. I always feel smarter after reading his stuff.&lt;/p&gt;
&lt;h3&gt;Podcasts, Videos, and more&lt;/h3&gt;
&lt;p&gt;&lt;a href=&quot;https://www.youtube.com/watch?v=mUhYA_Obh-4?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Building Apps with AI + MCP Servers | Serverless Office Hours&lt;/a&gt;&lt;br /&gt;
Brian Zambrano joins Darko Mesaroš to build a serverless application from prompts using Kiro and MCP servers. A solid walkthrough from natural language to deployed AWS infrastructure.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://www.lennysnewsletter.com/p/how-i-ai-html-is-the-new-markdown?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;How I AI: HTML is the new Markdown: How Anthropic engineers are building with Claude Code&lt;/a&gt;&lt;br /&gt;
Claire Vo interviews Thariq Shihipar from Anthropic&#39;s Claude Code team on the shift from Markdown to HTML for AI output, plus patterns like living design systems and micro-apps. I&#39;m all for the HTML move. Markdown files have gotten easier and easier to gloss over, and giving the model a proper display layer makes the output feel a lot less disposable.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://www.youtube.com/watch?v=43tYW89iikU?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Serverless CrAIc Ep84 AI-Generated Code Is a Liability: Technical Debt &amp;amp; Engineering Excellence&lt;/a&gt;&lt;br /&gt;
The Serverless Craic crew digs into the velocity versus debt tradeoff in AI-generated code, including the awkward truth that more tests and more code don’t always mean better quality. The discussion around engineering excellence as a counterweight to AI-driven throughput is where this one gets especially heady. They argue that production code still has to be maintained by humans eventually. Let&#39;s hope.&lt;/p&gt;
&lt;h3&gt;New from AWS&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/aws-cloudformation-cdk-stack?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Reference stack outputs across accounts and Regions with AWS CloudFormation and CDK&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/aws-transform-developer-tools?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS Transform agents now available in Kiro, Claude, Cursor, and Codex&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/aws-transform-ai-assistant?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS Transform adds agentic AI assistant to the AWS Toolkit for Visual Studio&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/amazon-ec2-m3-ultra-mac-instances-generally-available?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Announcing general availability of Amazon EC2 M3 Ultra Mac instances&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/amazon-redshift-alter-table-iceberg?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon Redshift adds ALTER TABLE for Iceberg tables and writes via the AWS Glue Data Catalog mount&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/amazon-emr-serverless-aws-regions?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon EMR Serverless is now available in additional AWS Regions&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/aws-announces-AWS-interconnect-multicloud-oci-preview?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS announces AWS Interconnect - multicloud connectivity with Oracle Cloud Infrastructure in preview&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/cloudwatch-logs-query-results?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon CloudWatch Logs announces increased query result limits&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/aws-organizations-increased-scp-quotas?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS Organizations now supports higher quotas for service control policies (SCPs)&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/aws-security-agent-full-repository-code-review?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS Security Agent now supports full repository code reviews&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/amazon-route-53-domains?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon Route 53 Domains adds support for 34 new Top Level Domains including .app, .dev, and .health.&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/aws-sam-cli-cloudformation?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23366&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS SAM CLI adds AWS CloudFormation Language Extensions support to accelerate local serverless development&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;Final Thoughts 🤔&lt;/h3&gt;
&lt;p&gt;There&#39;s a technical nuance to this Claude Code change that&#39;s worth pulling apart. Running &lt;code&gt;claude -p&lt;/code&gt; isn&#39;t the same as hitting the API directly. Claude Code ships with caching, tool-use optimizations, context management, and prompt structuring that make the interactive product feel as useful as it does. When you wire &lt;code&gt;claude -p&lt;/code&gt; into a script, you get those same optimizations applied to your automated workflows. Hit the raw API yourself and you&#39;re rebuilding all of that from scratch, usually badly, burning extra tokens on every loop, and probably wrecking the economics in the process.&lt;/p&gt;
&lt;p&gt;That&#39;s why this change stings more than a normal pricing adjustment. I understand metering for large-scale programmatic use, but the bigger shift is that now the cheap, optimized path is effectively reserved for interactive use. If you want automation, you either pay API rates or stay inside one of Claude&#39;s tightly controlled (typically not great) interfaces.&lt;/p&gt;
&lt;p&gt;That&#39;s the part that bothers me. You&#39;re paying for more than just access to a raw model. You&#39;re paying for the orchestration layer around it: the caching, context handling, tool execution, prompt shaping, and all the little optimizations that make Claude Code actually useful day to day. Whether those requests originate from a human typing into a terminal or a script running in the background seems mostly immaterial.&lt;/p&gt;
&lt;p&gt;And that&#39;s the bigger question this raises for the industry. Are these systems ultimately meant to become programmable infrastructure, or are they meant to remain interactive products with a human sitting in front of them? Because the economics matter. Automation only works when the cost structure makes sense. If the optimized path is reserved for interactive use while automated use is pushed onto significantly more expensive APIs, then we&#39;re implicitly putting limits on how far these tools can evolve beyond &amp;quot;copilot&amp;quot; workflows.&lt;/p&gt;
&lt;p&gt;That&#39;s worth thinking about before we build entire engineering organizations around them.&lt;/p&gt;
&lt;p&gt;See you next week,&lt;br /&gt;
Jeremy&lt;/p&gt;
&lt;hr /&gt;
&lt;p&gt;I hope you enjoyed this newsletter. We&#39;re always looking for ideas and feedback to make it better and more inclusive, so please feel free to reach out to me via &lt;a href=&quot;https://bsky.app/profile/jeremydaly.com&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Bluesky&lt;/a&gt;, &lt;a href=&quot;http://www.linkedin.com/in/jeremydaly&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;LinkedIn&lt;/a&gt;, &lt;a href=&quot;http://twitter.com/jeremy_daly&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;X&lt;/a&gt;, or &lt;a href=&quot;mailto:contact@jeremydaly.com&quot;&gt;email&lt;/a&gt;.&lt;/p&gt;
</content>
  </entry>
  <entry>
    <title>Issue #365: Valkey 9 Unlocks Hybrid Search on ElastiCache 🔍</title>
    <link href="https://offbynone.io/issues/365/"/>
    <updated>2026-05-12T12:00:00Z</updated>
    <summary>In this issue, Claude Platform sets up shop on AWS, ElastiCache learns to do full-text and hybrid search, and Ampt rolls out Node.js 24 as the default runtime.</summary>
    <id>https://offbynone.io/issues/365/</id>
    <content type="html">&lt;h2&gt;Valkey 9 Unlocks Hybrid Search on ElastiCache 🔍&lt;/h2&gt;
&lt;p&gt;In our &lt;a href=&quot;https://offbynone.io/issues/364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;previous issue&lt;/a&gt;, Amazon Bedrock crossed the final frontier of hosted frontier models, AI agents started buying domain names, and Amazon Q Developer got a one-way ticket to the AWS graveyard. This week, Claude Platform sets up shop on AWS, ElastiCache learns to do full-text and hybrid search, and Ampt rolls out Node.js 24 as the default runtime. Plus, we&#39;ve got plenty of awesome cloud, serverless, and AI content from the community.&lt;/p&gt;
&lt;h3&gt;News &amp;amp; Announcements&lt;/h3&gt;
&lt;p&gt;Anthropic and AWS got even closer this week. &lt;a href=&quot;https://claude.com/blog/claude-platform-on-aws?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Anthropic introduced the Claude Platform on AWS&lt;/a&gt;, which sits alongside Claude on Bedrock as a second, distinct way to use Claude inside your AWS account. The split is worth understanding: Claude Platform is Anthropic-operated with data processed &lt;em&gt;outside&lt;/em&gt; AWS, while Claude on Bedrock keeps data &lt;em&gt;inside&lt;/em&gt; the AWS boundary. &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/claude-platform-aws?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Claude Platform on AWS is now generally available&lt;/a&gt; across 18 regions with direct access to Anthropic&#39;s APIs, console, Managed Agents, web search, and prompt caching, all billed through AWS Marketplace. AWS has &lt;a href=&quot;https://aws.amazon.com/blogs/machine-learning/introducing-claude-platform-on-aws-anthropics-native-platform-through-your-aws-account?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;its own post on the launch&lt;/a&gt; explaining the IAM and Marketplace plumbing. The short version: enterprises that want full Anthropic-native features without leaving their AWS account just got a much cleaner deployment path.&lt;/p&gt;
&lt;p&gt;AgentCore also had a heck of a week. &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/amazon-bedrock-agentcore-runtime?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AgentCore Runtime now supports bring-your-own file system from S3 and EFS&lt;/a&gt;, letting you mount durable storage directly at agent runtime paths instead of bolting on file access through tools. &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/agentcore-longterm-memory-metadata?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AgentCore Memory now supports metadata for long-term memory&lt;/a&gt; with up to ten indexed keys that can be set manually or inferred by an LLM, making retrieval over long-term memory actually targetable instead of a vector similarity guessing game. And in the &amp;quot;what could possibly go wrong&amp;quot; category, &lt;a href=&quot;https://aws.amazon.com/blogs/machine-learning/agents-that-transact-introducing-amazon-bedrock-agentcore-payments-built-with-coinbase-and-stripe?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Bedrock AgentCore Payments launched in preview&lt;/a&gt; (read the &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/amazon-bedrock-agentcore-payments-preview?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;official announcement blog&lt;/a&gt;), built with Coinbase and Stripe and using the x402 protocol to let agents pay for APIs, MCP servers, and web content in stablecoins. So agents now have file systems, memory with metadata, and a wallet. 🔥&lt;/p&gt;
&lt;p&gt;On the agent tooling side, &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/agent-toolkit?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS announced the Agent Toolkit for AWS&lt;/a&gt;, a managed suite of pre-validated skills for AI coding agents covering application development, data analytics, and AgentCore, with IAM guardrails baked in. Also, &lt;a href=&quot;https://aws.amazon.com/blogs/aws/the-aws-mcp-server-is-now-generally-available?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;the AWS MCP Server is generally available&lt;/a&gt;, now with IAM context keys, a sandboxed Python execution tool, and better token efficiency. AWS is trying really hard to be the default platform for AI coding agents. Giving devs an opinionated, authenticated entry point seems like the smart play, but AWS doesn&#39;t have the same head start they did with serverless.&lt;/p&gt;
&lt;p&gt;It was a big week for ElastiCache as well. &lt;a href=&quot;https://aws.amazon.com/blogs/database/valkey-turns-two?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Valkey turned two&lt;/a&gt;, with Docker pulls up 17x year over year and adoption across the major clouds, which is a pretty good trajectory given that it started as a Redis fork barely 24 months ago. They also announced the release of &lt;a href=&quot;https://aws.amazon.com/blogs/database/announcing-valkey-9-0-for-amazon-elasticache?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Valkey 9.0 for Amazon ElastiCache&lt;/a&gt;, which brings built-in search, hash field expiration, and multi-database support in cluster mode. The headline features got their own announcements: ElastiCache now supports &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/amazon-elasticache-enchanced-search?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;real-time full-text, exact-match, and numeric range search&lt;/a&gt;, &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/amazon-elasticache-hybrid-search?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;hybrid search combining vector similarity and full-text&lt;/a&gt;, and &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/amazon-elasticache-aggregations?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;real-time aggregations&lt;/a&gt;, all at microsecond latency and across all regions at no extra cost. Chaitanya Nuthalapati has a &lt;a href=&quot;https://aws.amazon.com/blogs/database/enhanced-search-for-amazon-elasticache?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;walkthrough of building search and recommendation engines on top of it&lt;/a&gt; with full code, and there&#39;s a &lt;a href=&quot;https://aws.amazon.com/blogs/database/announcing-aggregations-on-amazon-elasticache?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;separate post on the aggregations specifically&lt;/a&gt;. ElastiCache is turning into a serious AI workload backend, but it might also be the serverless full-text search service we&#39;ve been waiting for.&lt;/p&gt;
&lt;p&gt;For AWS SAM users, two nice quality-of-life updates: &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/aws-sam-websocket-apis-api-gateway?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;SAM now natively supports WebSocket APIs for API Gateway&lt;/a&gt;, auto-generating routes, integrations, and IAM permissions from your template, and &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/aws-sam-cli-buildkit-aws-lambda?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;SAM CLI 1.159.0 added BuildKit support for Lambda container images&lt;/a&gt;, bringing multi-stage builds, better caching, cross-architecture builds, and Docker secrets to the workflow. It seems like these updates should have shipped years ago, but I&#39;m glad to see them land.&lt;/p&gt;
&lt;p&gt;In other Anthropic news, &lt;a href=&quot;https://claude.com/blog/agent-view-in-claude-code?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Claude Code got agent view&lt;/a&gt;, a centralized UI for managing multiple coding sessions in parallel without juggling terminal tabs. If you&#39;ve been doing this manually with tmux and worktrees, this is going to save you some major pain. Anthropic also rolled out &lt;a href=&quot;https://claude.com/blog/collaborate-with-claude-across-excel-powerpoint-word-and-outlook?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Claude integrations across Excel, PowerPoint, Word, and Outlook&lt;/a&gt;, with Excel, PowerPoint, and Word now GA and Outlook in public beta. Context follows you across apps, and enterprises get OpenTelemetry logging and Analytics API access for governance. And &lt;a href=&quot;https://claude.com/blog/new-in-claude-managed-agents?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Claude Managed Agents picked up &amp;quot;dreaming,&amp;quot; outcomes, and multiagent orchestration&lt;/a&gt;, with outcomes being a rubric-based eval system showing up to 10-point improvements on hard tasks. Netflix and Wisedocs are already shipping with it.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://getampt.com/blog/nodejs24-support?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Ampt now supports Node.js 24&lt;/a&gt; as the default runtime, bringing Web Streams, URLPattern, iterator helpers, and a pile of features that used to require third-party npm packages.&lt;/p&gt;
&lt;p&gt;Finally, &lt;a href=&quot;https://blog.cloudflare.com/building-for-the-future?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Cloudflare is laying off over 1,100 employees&lt;/a&gt;, which they&#39;re framing as a reorganization for the AI era rather than cost-cutting. The severance package is genuinely good (full base pay through end of 2026 and accelerated equity vesting), but the framing is doing a lot of work. &amp;quot;Reorganization for the AI era&amp;quot; is becoming the corporate euphemism of the decade.&lt;/p&gt;
&lt;h3&gt;Tutorials&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href=&quot;https://loige.co/writing-middlewares-for-rust-lambda-functions?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Writing middlewares for Rust Lambda functions&lt;/a&gt; by Luciano Mammino&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/architecture/choosing-between-single-or-multiple-organizations-in-aws-organizations?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Choosing between single or multiple organizations in AWS Organizations&lt;/a&gt; by John White&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/database/amazon-aurora-dsql-connections-drivers-strings-and-best-practices?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon Aurora DSQL connections: Drivers, strings, and best practices&lt;/a&gt; by Rob Petersen&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/database/query-billion-scale-vectors-with-sql-integrating-amazon-s3-vectors-and-aurora-postgresql?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Query billion-scale vectors with SQL: Integrating Amazon S3 Vectors and Aurora PostgreSQL&lt;/a&gt; by Shayon Sanyal&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://dev.to/robertobelotti/how-i-locked-down-a-static-site-with-lambdaedge-and-cognito-no-backend-required-40el?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;How I Locked Down a Static Site with Lambda@Edge and Cognito (No Backend Required)&lt;/a&gt; by Roberto Belotti&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/database/migrating-data-from-an-amazon-aurora-snapshot-into-amazon-aurora-dsql?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Migrating data from an Amazon Aurora snapshot into Amazon Aurora DSQL&lt;/a&gt; by Dan Blaner&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;Reads&lt;/h3&gt;
&lt;p&gt;&lt;a href=&quot;https://chrisebert.net/notes-from-code-with-claude-2026?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Notes from Code with Claude 2026&lt;/a&gt; by Chris Ebert&lt;br /&gt;
Chris pulls together the announcements that mattered from Code with Claude 2026: the SpaceX compute deal, Multiagent Orchestration, and Dreaming inside Managed Agents. The context window observations are the most useful part for anyone actually shipping agents right now.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://dev.to/thegdsks/aws-lambda-is-dead-the-020-was-never-the-price-2k4j?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS Lambda Is Dead. The $0.20 Was Never the Price&lt;/a&gt;&lt;br /&gt;
The author migrated 47 Lambda functions to Cloudflare Workers and dropped their monthly bill from $8,362 to $1,790, with most of the savings coming from the orchestration tax (API Gateway, CloudWatch, NAT, egress) rather than Lambda itself. He&#39;s right that the bundle is where the real money goes, and the August 2025 INIT billing change is worth knowing about. But the workloads he&#39;s describing (HTTP APIs, webhooks, auth, edge functions waiting on a database) were never the shape Lambda was built for. Lambda&#39;s actual sweet spot is async event-driven work that needs to fan out to thousands of concurrent executions for seconds at a time, not synchronous request/response paths burning wall clock waiting on Postgres. High-volume systems need to be designed for the runtime you&#39;re putting them on. Putting a sync API behind API Gateway and a NAT&#39;d Lambda and then complaining about the bundle is a design problem dressed up as a pricing problem. Workers is a better fit for that workload, and he should use it. Just don&#39;t declare the tool dead because it was the wrong one for the job.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://www.databricks.com/blog/rethinking-distributed-systems-serverless-performance-and-reliability?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Rethinking Distributed Systems for Serverless Performance and Reliability&lt;/a&gt; by Aaron Davidson, Roland Fäustlin, and Zach Williams&lt;br /&gt;
Databricks walks through how their serverless Spark platform works, including Spark Connect that decouples apps from clusters, a Serverless Gateway that does the routing, and an autoscaler that earns its name. Using serverless to take 4-5 hour jobs down to 20 minutes is the kind of number that makes the architectural decisions worth reading about.&lt;/p&gt;
&lt;h3&gt;Podcasts, Videos, and more&lt;/h3&gt;
&lt;p&gt;&lt;a href=&quot;https://www.youtube.com/watch?v=irtcYhgQ-vA?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;How serverless experts build with AI today | Serverless Office Hours&lt;/a&gt;&lt;br /&gt;
Mark Sailes joins Julian Wood to share how serverless experts built Study from Experts, a focused video learning platform for AWS professionals.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://www.youtube.com/watch?v=tKO29SA7CAU?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Beyond the Basics: Production Serverless Patterns for Extreme Scale • Janak Agarwal • GOTO 2025&lt;/a&gt;&lt;br /&gt;
Janak digs into Lambda patterns that actually hold up under load, with two grounded examples: rapid scale-out for spiky traffic and real-time financial analytics built on Step Functions Distributed Map. This is the kind of content that should be louder than the &amp;quot;Lambda is dead&amp;quot; takes, because it shows what the architecture is genuinely good at.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://www.lennysnewsletter.com/p/spec-driven-development-the-ai-engineering?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Spec-driven development: The AI engineering workflow at Notion | Ryan Nystrom&lt;/a&gt;&lt;br /&gt;
Claire Vo interviews Ryan Nystrom about how Notion engineers use their internal Boxy system to @mention Codex from comments and get full PRs with screenshots in 20 minutes. The conversation covers practical workflows including configuring subagents, MCP integrations, and the shift toward spec-first development where AI handles implementation.&lt;/p&gt;
&lt;h3&gt;New from AWS&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/aws-waf-dynamic-label-interpolation?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS WAF introduces dynamic label interpolation for custom request and response handling&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/amazon-aurora-dsql-five-additional-aws-regions?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon Aurora DSQL is now available in five additional AWS Regions&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/aws-transform-containerization?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS Transform adds containerization capability during migrations&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/amazon-route-53-resolver-ipv6?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon Route 53 Resolver endpoints now support additional capabilities for IPv6 query traffic&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/aws-regional-planning-tool-notification?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS Capabilities by Region now supports availability notifications&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/aws-advanced-jdbc-wrapper-encryption?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS Advanced JDBC Wrapper now provides client-side encryption&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/concurrencyscaling-support-for-copy?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon Redshift now scales data ingestion automatically with concurrency scaling for batch workloads&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/amazon-elasticache-cloudwatch-metrics-network-engine-diagnostics?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23365&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon ElastiCache adds thirteen new Amazon CloudWatch metrics for network capacity planning and engine diagnostics&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;Final Thoughts 🤔&lt;/h3&gt;
&lt;p&gt;Look at what AWS shipped this week and squint a little. Claude Platform on AWS, Agent Toolkit, and AWS MCP Server GA, plus AgentCore gets durable file systems, metadata for long-term memory, and payments with stablecoin rails. AWS is staking out the substrate layer for the agentic era, and the feature list isn&#39;t random.&lt;/p&gt;
&lt;p&gt;The bet is straightforward. If agents need compute, identity, storage, memory, payment, and an authenticated way to call services, AWS already has four of those and is shipping the other two as fast as they can write their press releases. The pitch to enterprises is: your agents already run on AWS, your data already lives on AWS, your IAM already governs everything, so why would you run the agent loop anywhere else?&lt;/p&gt;
&lt;p&gt;It&#39;s a credible play. But the serverless comparison I mentioned earlier is the one worth thinking about. AWS had a multi-year head start with Lambda, and the platform shape was so unfamiliar that competitors took years to even define the category. Agents don&#39;t have that property. Cloudflare, Vercel, Modal, Fly, and a dozen smaller platforms are already shipping agent primitives. The Anthropic-AWS deal is notable, but Anthropic will sell its service to anyone willing to buy. Model providers are commodity inputs now. The differentiation has to come from somewhere else.&lt;/p&gt;
&lt;p&gt;The substrate fight will be won on governance, observability, and cost controls, not raw capability. Every platform is going to give agents file systems and wallets and OS-level actions. The platform that wins is the one where, when an agent does something dumb or expensive at 3 a.m., you can see exactly what happened, who authorized it, what it cost, and how to stop it from happening again. AWS has decades of muscle memory on that exact problem, which is their edge.&lt;/p&gt;
&lt;p&gt;If you&#39;re building on any of these primitives, the planning question is no longer &amp;quot;can the agent do this.&amp;quot; It&#39;s &amp;quot;when this agent does something I didn&#39;t expect, what&#39;s my blast radius and how fast can I close it.&amp;quot; Build for that and the rest takes care of itself.&lt;/p&gt;
&lt;p&gt;See you next week,&lt;br /&gt;
Jeremy&lt;/p&gt;
&lt;hr /&gt;
&lt;p&gt;I hope you enjoyed this newsletter. We&#39;re always looking for ideas and feedback to make it better and more inclusive, so please feel free to reach out to me via &lt;a href=&quot;https://bsky.app/profile/jeremydaly.com&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Bluesky&lt;/a&gt;, &lt;a href=&quot;http://www.linkedin.com/in/jeremydaly&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;LinkedIn&lt;/a&gt;, &lt;a href=&quot;http://twitter.com/jeremy_daly&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;X&lt;/a&gt;, or &lt;a href=&quot;mailto:contact@jeremydaly.com&quot;&gt;email&lt;/a&gt;.&lt;/p&gt;
</content>
  </entry>
  <entry>
    <title>Issue #364: Agents With Credit Cards 🛒</title>
    <link href="https://offbynone.io/issues/364/"/>
    <updated>2026-05-05T12:00:00Z</updated>
    <summary>In this issue, Amazon Bedrock crosses the final frontier of hosted frontier models, AI agents can now buy domain names for side projects they&#39;ll never finish, and Amazon Q Developer gets a one-way ticket to the AWS graveyard.</summary>
    <id>https://offbynone.io/issues/364/</id>
    <content type="html">&lt;h2&gt;Agents With Credit Cards 🛒&lt;/h2&gt;
&lt;p&gt;In our &lt;a href=&quot;https://offbynone.io/issues/363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;previous issue&lt;/a&gt;, serverless became less stateless, OpenAI dropped two major model upgrades, and Claude went after creatives. This week, Amazon Bedrock crosses the final frontier of hosted frontier models, AI agents can now buy domain names for side projects they&#39;ll never finish, and Amazon Q Developer gets a one-way ticket to the AWS graveyard. Plus, we&#39;ve got lots of amazing cloud, serverless, and AI content from the community.&lt;/p&gt;
&lt;h3&gt;News &amp;amp; Announcements&lt;/h3&gt;
&lt;p&gt;In AWS news, &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/aurora-dsql-json-support?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon Aurora DSQL now supports the JSON data type with compression&lt;/a&gt;, which is a great addition that pushes DSQL closer to Postgres-style storage semantics. Over on the edge, &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/amazon-cloudfront-websockets-vpc-origins?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon CloudFront announced WebSocket support for VPC origins&lt;/a&gt;, letting you keep origins secured inside the VPC while still allowing WebSocket traffic through. CloudFront also &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/cloudfront-invalidation-cache-tag?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;now supports invalidation by cache tag&lt;/a&gt;, which is a really big win. If you wanted to invalidate groups of files before, you had to specify all the URL patterns yourself and keep track of them. Tag-based invalidation lets you flush a logical batch of files without nuking the entire cache, which is way cheaper and more efficient.&lt;/p&gt;
&lt;p&gt;The agent autonomy story keeps getting bigger (and scarier). AWS announced that &lt;a href=&quot;https://aws.amazon.com/blogs/aws/modernize-your-workflows-amazon-workspaces-now-gives-ai-agents-their-own-desktop-preview?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon WorkSpaces now gives AI agents their own desktop in preview&lt;/a&gt;. If you still have your inventory managed with Microsoft Access on Windows 95, then this might be for you. We&#39;re slowly starting to treat AI agents as independent, autonomous things with increasingly more permissive sandboxes. That has real upside, but also real downside risk. Pair that with &lt;a href=&quot;https://aws.amazon.com/blogs/machine-learning/introducing-os-level-actions-in-amazon-bedrock-agentcore-browser?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;OS Level Actions in Amazon Bedrock AgentCore Browser&lt;/a&gt;, which lets agents interact with native popups and dialogs that previously blocked browser automation, and the sandbox metaphor gets thinner every minute. Cloudflare is on the same trajectory: &lt;a href=&quot;https://blog.cloudflare.com/agents-stripe-projects?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;agents can now create Cloudflare accounts, buy domains, and deploy&lt;/a&gt;, which is impressive, but means an agent that can stand up infrastructure is also an agent that can run up your cloud bills.&lt;/p&gt;
&lt;p&gt;Inside Bedrock AgentCore itself there was a steady stream of updates. &lt;a href=&quot;https://aws.amazon.com/blogs/machine-learning/introducing-the-agent-quality-loop-agentcore-optimization-now-in-preview?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AgentCore Optimization is now in preview&lt;/a&gt;, allowing agents to improve production performance by analyzing their own traces. &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/amazon-bedrock-agentcore?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AgentCore Identity now supports On-Behalf-Of token exchange&lt;/a&gt;, letting an agent log in as a delegated human user, which is again powerful and a little terrifying. And &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/amazon-bedrock-agentcore-runtime?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AgentCore Runtime now supports Node.js for direct code deployment&lt;/a&gt;, so you can ship Node agents as ZIP uploads with bundled &lt;code&gt;node_modules&lt;/code&gt; instead of needing a container. Also, &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/bedrock-openai-models-codex-managed-agents?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Bedrock now offers OpenAI models, Codex, and Managed Agents in limited preview&lt;/a&gt;, which means Bedrock now hosts effectively every major frontier model.&lt;/p&gt;
&lt;p&gt;On the compute and tooling side, &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/aws-lambda-adds-ruby?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS Lambda added support for Ruby 4.0&lt;/a&gt;. AWS is also leaning hard into Amazon Quick. You can now &lt;a href=&quot;https://aws.amazon.com/blogs/machine-learning/generate-dashboards-from-natural-language-prompts-in-amazon-quick?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;generate dashboards from natural language prompts&lt;/a&gt; and &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/amazon-quick-macos-windows-preview?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;it&#39;s now available as a desktop application for macOS and Windows in preview&lt;/a&gt;. Meanwhile, &lt;a href=&quot;https://aws.amazon.com/blogs/devops/amazon-q-developer-end-of-support-announcement?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon Q Developer got an end-of-support announcement&lt;/a&gt;, which we all knew was coming. Q Developer was a waypoint along AWS&#39;s agentic coding journey, not the destination. And the &lt;a href=&quot;https://aws.amazon.com/blogs/compute/serverless-icymi-q1-2026?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Serverless ICYMI Q1 2026 roundup&lt;/a&gt; is worth a look. Lots of interesting stuff including durable function updates, larger Lambda, SQS, and EventBridge payloads, DynamoDB cross-account replication, and a bunch of AgentCore infrastructure work.&lt;/p&gt;
&lt;p&gt;In Anthropic news, &lt;a href=&quot;https://claude.com/blog/claude-security-public-beta?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Claude Security is now in public beta&lt;/a&gt;, which scans codebases for vulnerabilities by inspecting how components interact rather than pattern-matching against a CVE list. They&#39;ve already tested it with hundreds of organizations over the past two months, and the approach is impressive. Also, the &lt;a href=&quot;https://claude.com/blog/claude-api-skill?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Claude API skill is now available in CodeRabbit, JetBrains, Resolve AI, and Warp&lt;/a&gt;, bundling production-ready knowledge of API patterns, prompt caching rules, and per-model configuration directly into those tools and staying current as you work.&lt;/p&gt;
&lt;p&gt;Finally, on the Cloudflare side, they &lt;a href=&quot;https://blog.cloudflare.com/dynamic-workflows?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;introduced Dynamic Workflows&lt;/a&gt;, which combines durable execution with dynamic Workers so the platform can route workflow instances to different tenant code without pre-deployed targets. It&#39;s another interesting AI-agent primitive, especially for things like per-tenant CI/CD pipelines.&lt;/p&gt;
&lt;h3&gt;Tutorials&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href=&quot;https://theburningmonk.com/2026/05/inbox-outbox-patterns-for-reliable-event-processing?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Inbox &amp;amp; Outbox patterns for reliable event processing&lt;/a&gt; by Yan Cui&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/machine-learning/organizing-agents-memory-at-scale-namespace-design-patterns-in-agentcore-memory?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Organizing Agents’ memory at scale: Namespace design patterns in AgentCore Memory&lt;/a&gt; by Noor Randhawa&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/machine-learning/run-custom-mcp-proxies-serverless-on-amazon-bedrock-agentcore-runtime?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Run custom MCP proxies serverless on Amazon Bedrock AgentCore Runtime&lt;/a&gt; by Nizar Kheir&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://www.sls.guru/blog/before-you-rebuild-your-rag-stack-7-reasons-your-answers-are-weak-its-not-the-model?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Before You Rebuild Your RAG Stack: Why Your Answers are Weak | Serverless Guru&lt;/a&gt; by Cyril Bandolo&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://darryl-ruggles.cloud/s3-files-the-end-of-download-process-upload-with-terraform?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;S3 Files: Simplified AWS Lambda Processing with Terraform&lt;/a&gt; by Darryl Ruggles&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://dev.to/accreditly/replacing-puppeteer-on-aws-lambda-for-screenshots-3622?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Replacing Puppeteer on AWS Lambda for Screenshots&lt;/a&gt; by Mike Griffiths&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://dev.to/aws-builders/how-i-used-amazon-quick-to-run-a-full-security-audit-on-my-saas-and-fixed-11-vulnerabilities-in-4n8o?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;How I Used Amazon Quick to Run a Full Security Audit on My SaaS — and Fixed 11 Vulnerabilities in One Session&lt;/a&gt; by Asad Marcus&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://dev.to/aws-builders/i-injected-three-faults-the-agent-found-all-of-them-5pi?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;I Injected Three Faults. The Agent Found All of Them.&lt;/a&gt; by Romar Cablao&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/database/building-agentic-ai-for-amazon-rds-for-sql-server-with-strands-and-agentcore?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Building agentic AI for Amazon RDS for SQL Server with Strands and AgentCore&lt;/a&gt; by Sudhir Amin&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://dev.to/rdarrylr/its-all-about-that-memory-using-long-and-short-term-memory-with-agents-2m21?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;It&#39;s All About That Memory - Using Long and Short Term Memory with Agents&lt;/a&gt; by Darryl Ruggles&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;Reads&lt;/h3&gt;
&lt;p&gt;&lt;a href=&quot;https://claude.com/blog/lessons-from-building-claude-code-prompt-caching-is-everything?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Lessons from building Claude Code: Prompt caching is everything&lt;/a&gt;&lt;br /&gt;
The Claude Code team treats prompt cache hit rate as an SRE metric with SEV alerts, because caching&#39;s prefix-match rule makes obvious optimizations backfire: switching to Haiku mid-session for an easy question costs more than letting Opus answer it. The post covers the patterns that follow, including modeling Plan Mode as tools, deferring MCP schemas via stubs, and cache-safe forking for compaction.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://architectingautonomy.substack.com/p/the-reinvention-problem?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;The Reinvention Problem&lt;/a&gt;&lt;br /&gt;
Hans Schabert and Aaron Sempf ran the same prescribed agent procedure hundreds of times and watched it splinter into dozens of execution paths, with the most common one accounting for barely a quarter of runs. Their argument: stuffing a workflow into a system prompt hands the model a reference manual when what governance actually requires is an order, and no amount of better prompting or larger context will close that gap.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://heeki.medium.com/interrupting-agents-with-human-in-the-loop-feedback-c46e806d36fe?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Interrupting agents with human-in-the-loop feedback&lt;/a&gt;&lt;br /&gt;
Heeki Park catalogs four ways to wedge human approval into an agent before it issues a refund or revokes access: model-moderated inline functions in AgentCore harness, Strands BeforeToolCallEvent hooks, in-tool ctx.interrupt() calls, and MCP server elicitations. Each comes with code samples and a clear &amp;quot;when to use&amp;quot; rubric depending on whether tool names are known upfront and who owns the tool code.&lt;/p&gt;
&lt;h3&gt;Podcasts, Videos, and more&lt;/h3&gt;
&lt;p&gt;&lt;a href=&quot;https://www.youtube.com/watch?v=FD1iDCj2r5Q?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Automating AWS Lambda runtime upgrades | Serverless Office Hours&lt;/a&gt;&lt;br /&gt;
Dan Fox and Brian Krygsman join Julian Wood to explore how AWS Transform custom can take the pain out of Lambda runtime migrations. They cover AWS Transform custom, a tool for automating Lambda runtime upgrades, and walk through how the AI agent manages code changes, dependency updates, and validation when migrating from deprecated to modern runtimes.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://www.youtube.com/watch?v=y4517SxUH_s?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Serverless &amp;amp; OpenTelemetry ❤️ Better Together&lt;/a&gt;&lt;br /&gt;
James Eastham shows you how to escape the pain of clicking through endless CloudWatch log groups and trying to piece together X-Ray by learning how to instrument your .NET serverless apps with OpenTelemetry.&lt;/p&gt;
&lt;h3&gt;New from AWS&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/workspaces-ai-agents?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon WorkSpaces now lets AI agents operate desktop applications (Preview)&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/aws-iam-increased-quotas?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS IAM now provides higher maximum quotas for roles, role trust policies, instance profiles, managed policies, and identity providers&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/amazon-eventbridge-data-aws-cloudtrail?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon EventBridge supports data plane logging to AWS CloudTrail&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/05/amazon-cloudwatch-logs-query-by-tags?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon CloudWatch Logs Insights supports querying by log group tags&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/amazon-opensearch-service-supports-index-level-encryption?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon OpenSearch Service now supports index-level encryption&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/amazon-cloudwatch-agent-ec2?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon CloudWatch adds visual agent configuration to the EC2 console&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/amazon-ecs-mi-gpu-metrics?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon ECS Managed Instances now supports NVIDIA GPU metrics&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/paraphrase-multilingual-table-transformer-bielik-on-sagemaker-jumpstart?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Paraphrase-multilingual-MiniLM-L12-v2, Table Transformer Detection, and Bielik-11B-v3.0-Instruct are now available in Amazon SageMaker JumpStart&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/gemma-4-models-on-sagemaker-jumpstart?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Gemma 4 models are now available in Amazon SageMaker JumpStart&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/quick-sharepoint-access-control?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon Quick now supports document-level access controls for SharePoint knowledge bases&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/custom-applications?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Build custom applications using natural language in Amazon Quick (Preview)&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/amazon-quick-google-workspace-zoom?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon Quick expands integrations to include Google Workspace, Zoom, Airtable, and more&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/amazon-quick-free-plus?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Start using Amazon Quick for free in minutes with Free and Plus pricing plans&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/amazon-quick?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23364&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon Quick now supports document and visual creation in chat&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;Final Thoughts 🤔&lt;/h3&gt;
&lt;p&gt;Agents now get their own Windows desktops. They can buy domains, spin up Cloudflare accounts, deploy infrastructure, dismiss native OS dialogs, and impersonate users via delegated tokens. A year ago we were arguing about whether agents should be allowed to run shell commands. Now AWS is handing them WorkSpaces and Cloudflare is handing them credit cards. The sandbox keeps getting roomier, and the blast radius keeps growing with it.&lt;/p&gt;
&lt;p&gt;I&#39;m not against any of this. The capability story is genuinely exciting, and most of these primitives are things real production systems need. But we&#39;re shipping the autonomy faster than the controls. On-Behalf-Of token exchange in AgentCore Identity is a great example: powerful for legitimate delegation, also a fantastic way to lose the audit trail if you&#39;re not careful about how you scope it. Same story with agents that can stand up cloud accounts. Great until one of them runs a runaway loop on your billing.&lt;/p&gt;
&lt;p&gt;The Bedrock news is the other shoe dropping. Adding OpenAI models, Codex, and Managed Agents in preview means Bedrock is now the universal hosting layer for frontier models. That&#39;s a real shift. Model choice is becoming an AWS configuration setting rather than a vendor commitment, which is good for builders and very interesting for the rest of the market.&lt;/p&gt;
&lt;p&gt;The pattern across all of this is clear: the platforms are racing to give agents more rope, and the governance, observability, and cost-control story is still catching up. If you&#39;re building on these primitives, that gap is where you live now. Plan for it.&lt;/p&gt;
&lt;p&gt;See you next week,&lt;br /&gt;
Jeremy&lt;/p&gt;
&lt;hr /&gt;
&lt;p&gt;I hope you enjoyed this newsletter. We&#39;re always looking for ideas and feedback to make it better and more inclusive, so please feel free to reach out to me via &lt;a href=&quot;https://bsky.app/profile/jeremydaly.com&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Bluesky&lt;/a&gt;, &lt;a href=&quot;http://www.linkedin.com/in/jeremydaly&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;LinkedIn&lt;/a&gt;, &lt;a href=&quot;http://twitter.com/jeremy_daly&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;X&lt;/a&gt;, or &lt;a href=&quot;mailto:contact@jeremydaly.com&quot;&gt;email&lt;/a&gt;.&lt;/p&gt;
</content>
  </entry>
  <entry>
    <title>Issue #363: Serverless Isn&#39;t Stateless Anymore 💾</title>
    <link href="https://offbynone.io/issues/363/"/>
    <updated>2026-04-28T12:00:00Z</updated>
    <summary>In this issue, serverless becomes less stateless, OpenAI drops two major model upgrades, and Claude goes after creatives.</summary>
    <id>https://offbynone.io/issues/363/</id>
    <content type="html">&lt;h2&gt;Serverless Isn&#39;t Stateless Anymore 💾&lt;/h2&gt;
&lt;p&gt;In our &lt;a href=&quot;https://offbynone.io/issues/362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;previous issue&lt;/a&gt;, Claude got a major upgrade, AWS made AI costs more visible, and Cloudflare went all-in on agents. This week, serverless becomes less stateless, OpenAI drops two major model upgrades, and Claude goes after creatives. Plus, we&#39;ve got plenty of content from the cloud, serverless, and AI communities.&lt;/p&gt;
&lt;h3&gt;News &amp;amp; Announcements&lt;/h3&gt;
&lt;p&gt;Maybe you noticed that AWS is turning serverless into something a lot more… stateful. &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/aws-lambda-amazon-s3?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Lambda can now mount S3 as a file system with S3 Files&lt;/a&gt;, which is a pretty big shift in how you think about data access in functions. Pair that with the &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/lambda-durable-execution-java-ga?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Lambda Durable Execution SDK for Java going GA&lt;/a&gt; and &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/lambda-durable-functions-16-new-regions?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;durable functions expanding to 16 more regions&lt;/a&gt;, and it’s clear AWS is moving Lambda toward long-running, stateful workflows without giving up the &amp;quot;serverless&amp;quot; model.&lt;/p&gt;
&lt;p&gt;On the agent side, AWS continues its work to remove developer friction. The &lt;a href=&quot;https://aws.amazon.com/blogs/machine-learning/get-to-your-first-working-agent-in-minutes-announcing-new-features-in-amazon-bedrock-agentcore?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;latest Amazon Bedrock AgentCore updates&lt;/a&gt; promise you can get a working agent running in minutes, with new capabilities around orchestration, tooling, and faster setup. That’s backed by additional &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/agentcore-new-features-to-build-agents-faster?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AgentCore feature releases&lt;/a&gt; and infrastructure improvements like &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2024/04/agentcore-gateway-identity-vpc?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Gateway + Identity support for VPC egress&lt;/a&gt;, which handles one of the more annoying real-world constraints when connecting agents to private systems.&lt;/p&gt;
&lt;p&gt;AWS and Anthropic also continue to get closer. There’s an &lt;a href=&quot;https://www.anthropic.com/news/anthropic-amazon-compute?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;expanded partnership for massive new compute capacity&lt;/a&gt;, and you can now run &lt;a href=&quot;https://aws.amazon.com/blogs/machine-learning/from-developer-desks-to-the-whole-organization-running-claude-cowork-in-amazon-bedrock?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Claude Cowork directly in Amazon Bedrock&lt;/a&gt;. I still think this is a great bet by AWS to own the integration point for the AI model ecosystem.&lt;/p&gt;
&lt;p&gt;After last week&#39;s Opus 4.7 announcement, you knew it wouldn&#39;t be long before OpenAI responded. &lt;a href=&quot;https://openai.com/index/introducing-gpt-5-5?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;GPT-5.5&lt;/a&gt; is here with all the expected benchmark wins and a 1M token context window, which is starting to feel less like a flex and more like table stakes. They also dropped &lt;a href=&quot;https://openai.com/index/introducing-chatgpt-images-2-0?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;ChatGPT Images 2.0&lt;/a&gt;, which is scary good. Alongside that, we got &lt;a href=&quot;https://openai.com/index/introducing-workspace-agents-in-chatgpt?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;workspace agents in ChatGPT&lt;/a&gt;, more signs of the &lt;a href=&quot;https://openai.com/index/next-phase-of-microsoft-partnership?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;next phase of the Microsoft partnership&lt;/a&gt;, and a fresh set of &lt;a href=&quot;https://openai.com/index/our-principles?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;“principles”&lt;/a&gt; to remind us everything is under control. 😳&lt;/p&gt;
&lt;p&gt;Anthropic isn&#39;t slowing down either. They just announced &lt;a href=&quot;https://www.anthropic.com/news/claude-for-creative-work?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Claude for Creative Work&lt;/a&gt;, which includes new plugins and integrations with partners like Blender, Autodesk, Adobe, Ableton, and Splice. These are tools that let Claude work directly alongside the software creative professionals are using every day. Their strategy is absolutely 🔥. They’re also rolling out &lt;a href=&quot;https://claude.com/blog/claude-managed-agents-memory?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;built-in memory for Claude managed agents&lt;/a&gt;, now in public beta. Memory is quickly becoming the differentiator, and everyone is racing to make it feel less like a hack and more like infrastructure.&lt;/p&gt;
&lt;h3&gt;Tutorials&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/database/dsql-sql-dialect-how-amazon-aurora-dsql-differs-from-single-instance-postgresql?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;DSQL SQL Dialect: How Amazon Aurora DSQL differs from single-instance PostgreSQL&lt;/a&gt; by Rob Petersen&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://dev.to/tanseer/your-aws-cognito-emails-are-going-to-spam-here-is-how-to-fix-it-step-by-step-4989?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Your AWS Cognito Emails Are Going to Spam — Here Is How to Fix It Step by Step&lt;/a&gt; by Tanseer&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://dev.to/abhishek_gupta_pinpo/dynamodb-vs-rds-at-10k-100k-and-1m-rps-a-pre-deployment-simulation-comparison-3eco?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;DynamoDB vs RDS at 10K, 100K, and 1M RPS: a pre-deployment simulation comparison&lt;/a&gt; by Abhishek Gupta&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://dev.to/aws-heroes/serverless-applications-on-aws-with-lambda-using-java-25-api-gateway-and-aurora-dsql-part-6-34ni?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Serverless applications on AWS with Lambda using Java 25, API Gateway and Aurora DSQL - Part 6 Using GraalVM Native Image&lt;/a&gt; by Vadym Kazulkin&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/database/best-practices-and-architecture-patterns-for-cross-account-sharing-in-oracle-databaseaws?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Best practices and architecture patterns for cross-account sharing in Oracle Database@AWS&lt;/a&gt; by Yamuna Palasamudram&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/machine-learning/cost-effective-multilingual-audio-transcription-at-scale-with-parakeet-tdt-and-aws-batch?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Cost-effective multilingual audio transcription at scale with Parakeet-TDT and AWS Batch&lt;/a&gt; by Gleb Geinke&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/machine-learning/build-strands-agents-with-sagemaker-ai-models-and-mlflow?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Build Strands Agents with SageMaker AI models and MLflow&lt;/a&gt; by Dheeraj Hegde&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://studyfromexperts.com/blogs/securing-private-video-content-with-cloudfront-signed-urls-and-serverless-on-aws?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Securing Private Video Content with CloudFront Signed URLs and Serverless on AWS&lt;/a&gt; by Lee Gilmore&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://heeki.medium.com/building-an-agent-harness-31942331d605?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Building an agent harness&lt;/a&gt; by Heeki Park&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://developers.openai.com/api/docs/guides/deployment-checklist?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;OpenAI API deployment checklist&lt;/a&gt; by OpenAI&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;Reads&lt;/h3&gt;
&lt;p&gt;&lt;a href=&quot;https://www.linkedin.com/pulse/anthropic-opus-46-vs-47-which-better-code-quality-experiment-goh-uroxc?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Anthropic Opus 4.6 vs 4.7 - Which is better? A code quality experiment.&lt;/a&gt;&lt;br /&gt;
An AWS AI Hero tests Claude Opus 4.6 against 4.7 using the same Tetris implementation requirements across 13 code quality dimensions. Some folks are calling 4.6 a step back, but 4.7 seems to be finding its footing. I’ve been pretty happy with it so far. Feels like a reminder that model progress isn’t always a straight line, but the trajectory still points up.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://dev.to/riya_mittal_cdd264250ad45/serverless-finops-why-lambda-cost-models-break-every-assumption-you-learned-from-vms-42c5?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Serverless FinOps: Why Lambda Cost Models Break Every Assumption You Learned from VMs&lt;/a&gt;&lt;br /&gt;
Riya Mittal explains how Lambda&#39;s three-dimensional pricing (invocations, duration, memory) creates a fundamentally different cost model than VMs. Keeping cost top of mind is table stakes now. But optimizing for cost alone misses the bigger picture. Scale, performance, and operational overhead all show up eventually. The real game is balancing all three without painting yourself into a corner.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://claude.com/blog/building-agents-that-reach-production-systems-with-mcp?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Building agents that reach production systems with MCP&lt;/a&gt;&lt;br /&gt;
Nice breakdown of three different ways to wire systems into MCP servers. More importantly, it’s another example of patterns starting to solidify. Still early, still messy, but the industry is slowly converging on what “good” looks like.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://edjgeek.com/blog/lambda-cold-starts-dead?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Cold Starts Are Dead&lt;/a&gt;&lt;br /&gt;
Cold starts aren’t what they used to be. There are still edge cases, but for most workloads, they’re manageable or negligible. Eric Johnson covers how platform improvements and better patterns minimize them, resulting in them rarely showing up where it actually matters.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://openai.com/index/speeding-up-agentic-workflows-with-websockets?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Speeding up agentic workflows with WebSockets in the Responses API&lt;/a&gt;&lt;br /&gt;
OpenAI explains how they reduced agentic workflow latency by 40% using WebSockets instead of repeated HTTP requests. The technical approach maintains persistent connections and caches conversation state, eliminating redundant processing of conversation history while exposing the full speed of their faster GPT-5.3-Codex-Spark model.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://teriradichel.substack.com/p/reducing-token-burn-rate-with-a-well?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Reducing Token Burn Rate With A Well-Designed Architecture&lt;/a&gt;&lt;br /&gt;
Teri Radichel walks through building a Lambda troubleshooting system that separates deterministic data gathering from AI analysis. The approach avoids burning tokens on repetitive queries by using traditional code to collect logs and configuration, only invoking AI for interpretation. Stop wasting tokens on repetitive work and only pay for actual insight.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://tylerfolkman.substack.com/p/i-run-qwen-36-on-two-gpus-because?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;I Run Qwen 3.6 on Two GPUs Because Renting AI Is Boring&lt;/a&gt;&lt;br /&gt;
Tyler Folkman explains that locally hosted models might not match the top-tier APIs, but they have one big advantage. They don’t go down. While Anthropic and others keep having “moments” (like as I&#39;m writing this), running your own stack looks a lot less boring.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://architectingautonomy.substack.com/p/the-escalation-trap?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;The Escalation Trap&lt;/a&gt;&lt;br /&gt;
Aaron Sempf walks through three failure modes of human escalation in AI systems: over-escalation creating bottlenecks, selective escalation missing new edge cases, and avoiding escalation entirely. The piece argues for moving escalation decisions to a separate governance layer that evaluates authority boundaries before execution. This is a hard problem.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://ranthebuilder.cloud/blog/how-i-use-claude-cowork-to-write-with-ai-in-my-voice?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;How I Use Claude Cowork To Write With AI In My Voice&lt;/a&gt;&lt;br /&gt;
Ran Isenberg walks through his Claude Cowork configuration for generating content that sounds human. But trying too hard to “not sound like AI” can backfire. A lot of the things people avoid, like short sentences and clarity, are just good writing. The goal shouldn&#39;t be to hide AI; it should be to help you articulate &lt;em&gt;your thoughts and ideas&lt;/em&gt; clearly.&lt;/p&gt;
&lt;h3&gt;Podcasts, Videos, and more&lt;/h3&gt;
&lt;p&gt;&lt;a href=&quot;https://www.youtube.com/watch?v=8rjKqb79Qyg?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Serverless CrAIc Ep 83 Psychological Safety in the AI Era (No One Talks About This)&lt;/a&gt;&lt;br /&gt;
Serverless CrAIc explores how rapid AI adoption challenges team dynamics, mentorship capacity, and organizational culture. Keeping up used to be hard, but now it’s relentless. Fast doesn’t guarantee success, but it does help with learning. And that’s the frustrating part. Watching others move quickly and wondering what they’ve figured out that you haven’t. Also, no, we’re probably not all losing our jobs tomorrow. But it’s not crazy to wonder what the people in charge think.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://www.youtube.com/watch?v=KRT0Z7k01GE?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS Lambda durable functions: Best Practices, AI patterns, and Futures | Serverless Office Hours&lt;/a&gt;&lt;br /&gt;
Michael Gasch and Eric Johnson join Julian Wood to explore the latest in AWS Lambda durable functions, from Java SDK GA, S3 File support, to what&#39;s coming next.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://www.lennysnewsletter.com/p/how-anthropics-product-team-moves?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;How Anthropic’s product team moves faster than anyone else | Cat Wu (Head of Product, Claude Code)&lt;/a&gt;&lt;br /&gt;
Lenny&#39;s interview with Cat Wu explores how Anthropic builds products in days rather than months, and why their employees build custom internal tools instead of buying SaaS. Lots of great insights in here, including emerging PM skills in AI and the shift toward managing AI agent fleets rather than doing tasks yourself.&lt;/p&gt;
&lt;h3&gt;New from AWS&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/amazon-msk-serverless-13-regions?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon MSK Serverless expands to 13 new AWS regions&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/qwen-models-on-sagemaker-jumpstart?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Five new Qwen models for coding agents and efficient reasoning are now available in Amazon SageMaker JumpStart&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/03/aws-data-exports-cross-account-delivery-cost?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS Data Exports now supports cross-account delivery&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/backup-policies-aurora-dsql-redshift-serverless?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS Backup adds Amazon Redshift Serverless and Aurora DSQL support for AWS Organizations backup policies&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/cloudwatch-logs-insights-join-sub-query?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon CloudWatch Logs Insights introduces JOIN and sub-query commands&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/amazon-location-service-bulk-address-validation?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon Location Service now offers bulk address validation for the United States, Canada, Australia, and the United Kingdom&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/s3-five-additional-checksum-algorithms?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon S3 now supports five additional checksum algorithms&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/amazon-redshift-serverless-ai-driven-scaling-default?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon Redshift Serverless AI-driven scaling is now the default for new workgroups&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/redshift-update-delete-merge-iceberg-tables?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon Redshift supports UPDATE, DELETE, MERGE for Apache Iceberg tables&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/amazon-sagemaker-ft-qwen3-5?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon SageMaker AI now supports serverless model customization for Qwen3.5 models&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/amazon-smus-ci-cd-cli?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon SageMaker Unified Studio now offers CI/CD CLI for data and AI applications&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/sagemaker-ai-inference-rec?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon SageMaker AI launches optimized generative AI inference recommendations&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/amazon-athena?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon Athena simplifies federated queries with managed connectors&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/aws-compute-optimizer-ec2-rds?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS Compute Optimizer supports 162 new EC2 instance types and 32 new RDS DB instance classes&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/machine-learning/amazon-sagemaker-ai-now-supports-optimized-generative-ai-inference-recommendations?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon SageMaker AI now supports optimized generative AI inference recommendations&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;Developer Tools&lt;/h3&gt;
&lt;p&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/developer/create-minimal-reproductions-for-aws-sdk-javascript-v3-with-create-aws-sdk-repro?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23363&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Create minimal reproductions for AWS SDK JavaScript v3 with create-aws-sdk-repro&lt;/a&gt; by John Lwin&lt;br /&gt;
AWS released create-aws-sdk-repro, a CLI tool that generates boilerplate for AWS SDK for JavaScript v3 projects. It handles service selection, environment setup (Node.js, Browser, or React Native), and creates projects with proper imports and credentials configuration already in place.&lt;/p&gt;
&lt;h3&gt;Final Thoughts 🤔&lt;/h3&gt;
&lt;p&gt;It&#39;s getting a lot easier to build systems that don&#39;t forget.&lt;/p&gt;
&lt;p&gt;Serverless isn&#39;t stateless anymore. Agents are getting access to real tools, real workflows, and persistent memory. And the infrastructure is finally starting to reflect that shift, with better primitives for state, orchestration, and long-running execution.&lt;/p&gt;
&lt;p&gt;But the tradeoffs are changing. We&#39;re layering memory into systems that were designed to be ephemeral. Giving agents persistence across sessions. Letting them interact with tools and data in ways that blur the line between request and workflow. The hard part isn&#39;t adding memory. It&#39;s deciding what gets promoted from a single session into something durable, what stays scoped to one agent versus shared across many, and what should be forgotten on purpose.&lt;/p&gt;
&lt;p&gt;That&#39;s where things get complicated. State introduces responsibility. Memory introduces risk. Every piece of context an agent carries forward is something you now have to govern. Who can read it, when it expires, how it&#39;s surfaced back into a prompt, and what happens when it&#39;s wrong. The more capable these systems become, the more those decisions start to look like product decisions, not implementation details.&lt;/p&gt;
&lt;p&gt;At the same time, the direction is forming. Serverless platforms are adding stateful primitives. Agent frameworks are focusing on orchestration instead of just prompts. Memory is becoming a first-class concept instead of a bolted-on feature. Even model providers are starting to expose more control over how context is stored, retrieved, and applied.&lt;/p&gt;
&lt;p&gt;It&#39;s not just about generating better responses anymore. It&#39;s about building systems that can carry context forward. Systems that can act, adapt, and remember without breaking the guarantees we still rely on. The shift is real, and it&#39;s accelerating.&lt;/p&gt;
&lt;p&gt;Because once systems start remembering, everything else has to change with them.&lt;/p&gt;
&lt;p&gt;See you next week,&lt;br /&gt;
Jeremy&lt;/p&gt;
&lt;hr /&gt;
&lt;p&gt;I hope you enjoyed this newsletter. We&#39;re always looking for ideas and feedback to make it better and more inclusive, so please feel free to reach out to me via &lt;a href=&quot;https://bsky.app/profile/jeremydaly.com&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Bluesky&lt;/a&gt;, &lt;a href=&quot;http://www.linkedin.com/in/jeremydaly&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;LinkedIn&lt;/a&gt;, &lt;a href=&quot;http://twitter.com/jeremy_daly&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;X&lt;/a&gt;, or &lt;a href=&quot;mailto:contact@jeremydaly.com&quot;&gt;email&lt;/a&gt;.&lt;/p&gt;
</content>
  </entry>
  <entry>
    <title>Issue #362: Mo’ Models, Mo’ Problems ⚠️</title>
    <link href="https://offbynone.io/issues/362/"/>
    <updated>2026-04-21T12:00:00Z</updated>
    <summary>In this issue, Claude gets a major upgrade, AWS makes AI costs more visible, and Cloudflare goes all-in on agents.</summary>
    <id>https://offbynone.io/issues/362/</id>
    <content type="html">&lt;h2&gt;Mo’ Models, Mo’ Problems ⚠️&lt;/h2&gt;
&lt;p&gt;In our &lt;a href=&quot;https://offbynone.io/issues/361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;previous issue&lt;/a&gt;, AI started breaking things faster than we can defend them, AWS launched an agent registry, and S3 kinda became a filesystem. This week, Claude gets a major upgrade, AWS makes AI costs more visible, and Cloudflare goes all-in on agents. Plus, we&#39;ve got some amazing cloud, serverless, and AI content from the community.&lt;/p&gt;
&lt;h3&gt;News &amp;amp; Announcements&lt;/h3&gt;
&lt;p&gt;Anthropic announced &lt;a href=&quot;https://www.anthropic.com/news/claude-opus-4-7?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Claude Opus 4.7&lt;/a&gt; this past week as their latest push towards world domination. Early signals point to serious gains in software engineering, especially for long-running tasks, plus stronger vision support. AWS wasted no time &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/claude-opus-4.7-amazon-bedrock?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;rolling it out in Bedrock&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;AWS also introduced &lt;a href=&quot;https://aws.amazon.com/blogs/machine-learning/introducing-granular-cost-attribution-for-amazon-bedrock?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;granular cost attribution for Amazon Bedrock&lt;/a&gt;, which is a big step toward actually understanding AI spend. Cost control and observability for LLMs is still pretty messy, and being able to map usage down to IAM users and roles starts to make that problem a lot more tractable.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/aurora-serverless-smarter-scaling?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon Aurora Serverless&lt;/a&gt; is getting up to 30% better performance with smarter scaling, while still keeping the scale-to-zero promise. There’s a deeper dive from the team &lt;a href=&quot;https://aws.amazon.com/blogs/database/aurora-serverless-faster-performance-enhanced-scaling-and-still-scales-down-to-zero?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;here&lt;/a&gt; if you want the details. I like this direction.&lt;/p&gt;
&lt;p&gt;AWS also announced general availability of &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/aws-announces-ga-AWS-interconnect-multicloud?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS Interconnect&lt;/a&gt;, kicking things off with Google Cloud. Dedicated bandwidth between clouds is becoming a thing, with Azure and Oracle Cloud Infrastructure expected to follow later this year. Let the homogeneity begin.&lt;/p&gt;
&lt;p&gt;Anthropic introduced &lt;a href=&quot;https://claude.com/blog/introducing-routines-in-claude-code?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;routines in Claude Code&lt;/a&gt;, which basically turns repeatable development workflows into something you can automate. Feels like another positive step toward making agents more useful in day-to-day dev work. They also highlighted what people are building in their ecosystem with &lt;a href=&quot;https://claude.com/blog/meet-the-winners-of-our-built-with-opus-4-6-claude-code-hackathon?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;their latest hackathon winners&lt;/a&gt;. No fluff, they&#39;re all practical AI solutions that address real pain points. 🤷&lt;/p&gt;
&lt;p&gt;It was Agents Week over at Cloudflare last week, and they shipped &lt;em&gt;a lot&lt;/em&gt;. The full rundown of launches is &lt;a href=&quot;https://blog.cloudflare.com/agents-week-in-review?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;here&lt;/a&gt;, but there were a few standouts: &lt;a href=&quot;https://blog.cloudflare.com/ai-search-agent-primitive?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AI Search&lt;/a&gt; as a core primitive for agents, &lt;a href=&quot;https://blog.cloudflare.com/flagship?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Flagship&lt;/a&gt; to bring feature flags into the agent era, &lt;a href=&quot;https://blog.cloudflare.com/introducing-agent-memory?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Agent Memory&lt;/a&gt;, and a new &lt;a href=&quot;https://blog.cloudflare.com/email-for-agents?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;email service for agents&lt;/a&gt; in public beta.&lt;/p&gt;
&lt;p&gt;Not on my 2026 Bingo card, but Apple announced that &lt;a href=&quot;https://www.apple.com/newsroom/2026/04/tim-cook-to-become-apple-executive-chairman-john-ternus-to-become-apple-ceo?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Tim Cook is stepping into the Executive Chairman role at Apple, with John Ternus taking over as CEO&lt;/a&gt;. Big shift for one of the most stable leadership runs in tech. I&#39;m sure it has nothing to do with Apple Intelligence. 😬&lt;/p&gt;
&lt;p&gt;And in case you missed it, the recent &lt;a href=&quot;https://vercel.com/kb/bulletin/vercel-april-2026-security-incident?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Vercel hack&lt;/a&gt; highlights a growing pattern in cybersecurity. Third-party AI tooling accessing internal systems is introducing a whole new threat model. One that most teams aren’t even aware of, never mind prepared for.&lt;/p&gt;
&lt;p&gt;If your incident response still involves five tabs, three tools, and someone asking “who’s on point?”, it might be time to rethink things. &lt;a href=&quot;https://fandf.co/3OcoQib&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;incident.io&lt;/a&gt; is an all-in-one platform that runs Slack and Teams native, so you can declare, manage, and resolve incidents without leaving the conversation. It handles the busywork too, auto-assigning roles, kicking off workflows, and even surfacing insights from past incidents so you don’t keep fixing the same problem twice. Definitely worth a deeper look if you want faster response times without adding more process: &lt;a href=&quot;https://fandf.co/3OcoQib&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;incident.io&lt;/a&gt;. &lt;code&gt;Sponsored&lt;/code&gt;&lt;/p&gt;
&lt;h3&gt;Tutorials&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href=&quot;https://edjgeek.com/blog/lambda-managed-instances?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Lambda Managed Instances: A Working Demo and the Math Behind It&lt;/a&gt; by Eric Johnson&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/machine-learning/transform-retail-with-aws-generative-ai-services?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Transform retail with AWS generative AI services&lt;/a&gt; by Bhavya Chugh&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://dev.to/jayaganeshk/the-hidden-cost-of-aws-lambda-snapstart-for-python-and-how-i-fixed-it-with-durable-functions-2ba4?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;The Hidden Cost of AWS Lambda SnapStart for Python, and How I Fixed It with Durable Functions&lt;/a&gt; by Jaya Ganesh&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://claude.com/blog/best-practices-for-using-claude-opus-4-7-with-claude-code?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Best practices for using Claude Opus 4.7 with Claude Code&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/machine-learning/power-video-semantic-search-with-amazon-nova-multimodal-embeddings?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Power video semantic search with Amazon Nova Multimodal Embeddings&lt;/a&gt; by Amit Kalawat&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://dev.to/aws-heroes/serverless-applications-on-aws-with-lambda-using-java-25-api-gateway-and-dynamodb-part-6-using-1ji?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Serverless applications on AWS with Lambda using Java 25, API Gateway and DynamoDB - Part 6 Using GraalVM Native Image&lt;/a&gt; by Vadym Kazulkin&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/database/accelerate-database-migration-to-amazon-aurora-dsql-with-kiro-and-amazon-bedrock-agentcore?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Accelerate database migration to Amazon Aurora DSQL with Kiro and Amazon Bedrock AgentCore&lt;/a&gt; by Noorul Mahajabeen Mustafa&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://www.infoq.com/articles/lambda-extension-deferred-flush?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Using AWS Lambda Extensions to Run Post-Response Telemetry Flush&lt;/a&gt; by Melvin Philips&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://dev.to/aws-heroes/serverless-applications-on-aws-with-lambda-using-java-25-api-gateway-and-aurora-dsql-part-5-3dlj?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Serverless applications on AWS with Lambda using Java 25, API Gateway and Aurora DSQL - Part 5 SnapStart + full priming&lt;/a&gt; by Vadym Kazulkin&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;Reads&lt;/h3&gt;
&lt;p&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/machine-learning/navigating-the-generative-ai-journey-the-path-to-value-framework-from-aws?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Navigating the generative AI journey: The Path-to-Value framework from AWS&lt;/a&gt;&lt;br /&gt;
AWS tries to put some structure around the chaos with a “Path-to-Value” framework. It’s less of a step-by-step guide and more of a reminder that AI adoption is messy, multidimensional, and mostly about tradeoffs between value, risk, and organizational reality.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://blog.cloudflare.com/past-bots-and-humans?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Moving past bots vs. humans&lt;/a&gt;&lt;br /&gt;
The bot vs human model is breaking down fast. Cloudflare is leaning into intent over identity, which feels like the right direction as agents start acting more like users and users start looking more like bots.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://blog.cloudflare.com/internal-ai-engineering-stack?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;The AI engineering stack we built internally — on the platform we ship&lt;/a&gt;&lt;br /&gt;
Always interesting when a company dogfoods its own stack at scale. Cloudflare’s setup is a good look at what a modern AI platform actually needs when you’re pushing billions of tokens and not just running demos.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://www.taskade.com/blog/multi-agent-production?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Multi-Agent AI in Production | Taskade Engineering (2026)&lt;/a&gt;&lt;br /&gt;
Three years into multi-agent systems and the same problems keep showing up. Memory, coordination, and agents getting stuck in loops. Good practical patterns here, especially if you’ve already hit these walls.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://dev.to/awshuss/why-aws-certified-genai-developer-stands-apart-from-other-aws-certs-14n?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Why AWS Certified GenAI Developer stands apart from other AWS certs&lt;/a&gt;&lt;br /&gt;
Anwaar Hussain points out that this cert is less about knowing AI and more about wiring it into real systems. Which is probably the right shift, because building with AI is quickly becoming more of an architecture problem than a modeling one.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://dev.to/amitkayal/lessons-i-learned-building-a-memory-aware-agent-with-amazon-bedrock-agentcore-runtime-4lc9?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Lessons I learned building a memory-aware agent with Amazon Bedrock AgentCore Runtime&lt;/a&gt;&lt;br /&gt;
Memory is still the hardest part of agent design. Amit Kayal gives us a solid walkthrough of scoping, lifecycle, and not blowing up your prompts while trying to make agents feel stateful.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://newsletter.pragmaticengineer.com/p/learnings-from-conducting-1000-interviews?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Learnings from conducting ~1,000 interviews at Amazon&lt;/a&gt;&lt;br /&gt;
Steve Huynh shares a good reminder that hiring is its own system with its own signals. If you don’t understand what a company actually optimizes for, you’re probably optimizing for the wrong thing.&lt;/p&gt;
&lt;h3&gt;Podcasts, Videos, and more&lt;/h3&gt;
&lt;p&gt;&lt;a href=&quot;https://www.youtube.com/watch?v=SvKXhFVVbGY?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Serverless Apache Airflow | Serverless Office Hours&lt;/a&gt;&lt;br /&gt;
Airflow, but make it serverless. John Jackson and Kamen Sharlandjiev breakdown when MWAA actually makes sense versus just reaching for Step Functions, especially once you factor in cost, scaling, and how much orchestration complexity you really need.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://www.youtube.com/watch?t=3s&amp;v=xqRUnoaQiUM?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Building, Managing &amp;amp; Governing APIs on AWS&lt;/a&gt;&lt;br /&gt;
APIs aren’t just for humans anymore. Giedrius Praspaliauskas covers the full lifecycle on AWS, but the interesting part is how API strategies are evolving to support agents, not just apps. Same primitives, very different consumers.&lt;/p&gt;
&lt;h3&gt;New from AWS&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/amazon-msk-replicator-external-kafka-cluster-support?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;MSK Replicator now supports replication from external Apache Kafka clusters to MSK Express Brokers&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/amazon-msk-replicator-enhanced-consumer-offset-synchronization?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon MSK Replicator now supports enhanced consumer offset synchronization for bidirectional replication&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/amazon-msk-replicator-logs?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon MSK Replicator now supports log forwarding for replication visibility&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/amazon-documentdb-mongodb-in-place-version-upgrade-5-0-to-8-0?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon DocumentDB (with MongoDB compatibility) now supports in-place upgrade from version 5.0 to 8.0&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/amazon-ecr-pull-through-cache-referrers?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon ECR Pull Through Cache Now Supports Referrer Discovery and Sync&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/amazon-cloudwatch-cross-region-enablement-rules?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon CloudWatch now supports cross-region telemetry auditing and enablement rules&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;Developer Tools&lt;/h3&gt;
&lt;p&gt;&lt;a href=&quot;https://github.com/brognilucas/sls-testing?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;brognilucas/sls-testing&lt;/a&gt; by Lucas Brogni&lt;br /&gt;
Typed, composable testing utilities for AWS Lambda from Lucas Brogni that provides event builders and Jest matchers for Lambda functions.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://dev.to/pujaaan/i-got-tired-of-writing-the-same-cdk-wiring-so-i-built-simple-cdk-obg?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;pujaaan/simple-cdk&lt;/a&gt; by pujaaan&lt;br /&gt;
A thin runtime over CDK that scans your folders, runs adapters in a deterministic three-phase pipeline (discover → register → wire), and emits real CDK constructs.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/machine-learning/toolsimulator-scalable-tool-testing-for-ai-agents?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23362&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;ToolSimulator: scalable tool testing for AI agents&lt;/a&gt; by Darren Wang&lt;br /&gt;
An LLM-powered tool simulation framework within Strands Evals to thoroughly and safely test AI agents that rely on external tools, at scale.&lt;/p&gt;
&lt;h3&gt;Final Thoughts 🤔&lt;/h3&gt;
&lt;p&gt;It’s getting a lot easier to build powerful systems.&lt;/p&gt;
&lt;p&gt;Models are getting better at real work. Agents are starting to handle meaningful workflows. And the infrastructure around all of this is finally catching up, from cost visibility to orchestration to deployment patterns.&lt;/p&gt;
&lt;p&gt;But the gaps are still there.&lt;/p&gt;
&lt;p&gt;We’re wiring these capabilities into systems that were never designed for autonomous behavior. Giving tools access to internal systems. Letting agents make decisions across boundaries that used to be tightly controlled. And in some cases, we’re doing it faster than we understand the implications.&lt;/p&gt;
&lt;p&gt;That’s where things start to break.&lt;/p&gt;
&lt;p&gt;The Vercel incident isn’t an outlier. It’s a preview. A glimpse into what happens when powerful models meet loosely defined boundaries and third-party integrations. The tooling is evolving quickly, but the assumptions behind our systems haven’t fully caught up yet.&lt;/p&gt;
&lt;p&gt;At the same time, you can see the industry starting to respond.&lt;/p&gt;
&lt;p&gt;Better cost attribution. More structured agent workflows. Dedicated primitives for memory, search, and control. Even multicloud connectivity is starting to blur the lines between platforms. It’s not just about building faster anymore, it’s about building systems that can actually support what we’re asking them to do.&lt;/p&gt;
&lt;p&gt;Still early. Still messy. But the pattern is emerging. More power, more abstraction, and more responsibility to get the boundaries right.&lt;/p&gt;
&lt;p&gt;Because “Mo’ Models, Mo’ Problems” isn’t really a joke. It’s just the beginning.&lt;/p&gt;
&lt;p&gt;See you next week,&lt;br /&gt;
Jeremy&lt;/p&gt;
&lt;hr /&gt;
&lt;p&gt;I hope you enjoyed this newsletter. We&#39;re always looking for ideas and feedback to make it better and more inclusive, so please feel free to reach out to me via &lt;a href=&quot;https://bsky.app/profile/jeremydaly.com&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Bluesky&lt;/a&gt;, &lt;a href=&quot;http://www.linkedin.com/in/jeremydaly&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;LinkedIn&lt;/a&gt;, &lt;a href=&quot;http://twitter.com/jeremy_daly&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;X&lt;/a&gt;, or &lt;a href=&quot;mailto:contact@jeremydaly.com&quot;&gt;email&lt;/a&gt;.&lt;/p&gt;
</content>
  </entry>
  <entry>
    <title>Issue #361: S3 is Still Not A File System 🤷</title>
    <link href="https://offbynone.io/issues/361/"/>
    <updated>2026-04-14T12:00:00Z</updated>
    <summary>In this issue, AI starts breaking things faster than we can defend them, AWS launches an agent registry, and S3 becomes a filesystem (sort of).</summary>
    <id>https://offbynone.io/issues/361/</id>
    <content type="html">&lt;h2&gt;S3 is Still Not A File System 🤷&lt;/h2&gt;
&lt;p&gt;In our &lt;a href=&quot;https://offbynone.io/issues/360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;previous issue&lt;/a&gt;, Claude leaked its secrets, AWS locked down S3, and Cloudflare made a run at WordPress. This week, AI starts breaking things faster than we can defend them, AWS launches an agent registry, and S3 becomes a filesystem (sort of). Plus, we&#39;ve got plenty of awesome content from the cloud, serverless, and AI communities.&lt;/p&gt;
&lt;h3&gt;News &amp;amp; Announcements&lt;/h3&gt;
&lt;p&gt;Anthropic just dropped &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/amazon-bedrock-claude-mythos?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Claude Mythos Preview&lt;/a&gt;, a gated research release (available on Amazon Bedrock) that’s already raising eyebrows. Early reports suggest it’s extremely good at finding vulnerabilities in long-standing systems, which is both impressive and a little terrifying. They’re rolling it out slowly so people can &lt;a href=&quot;https://claude.com/blog/preparing-your-security-program-for-ai-accelerated-offense?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;prepare their security programs for AI-accelerated offense&lt;/a&gt;. The reality is AI is getting very, very good at breaking things, and we’re going to need to rethink how we defend systems that were never designed for this level of pressure.&lt;/p&gt;
&lt;p&gt;AWS made a bunch of moves in Bedrock this week. They added &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/bedrock-iam-cost-allocation?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;cost allocation by IAM user and role&lt;/a&gt;, which is a big win for visibility into AI spend. They also introduced &lt;a href=&quot;https://aws.amazon.com/blogs/machine-learning/introducing-stateful-mcp-client-capabilities-on-amazon-bedrock-agentcore-runtime?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;stateful MCP client capabilities in AgentCore Runtime&lt;/a&gt; and added OS-level actions to the &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/agentcore-browser-os-actions?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AgentCore Browser&lt;/a&gt;. Good stuff.&lt;/p&gt;
&lt;p&gt;AWS also launched the &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/aws-agent-registry-in-agentcore-preview?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Agent Registry in preview&lt;/a&gt;. Centralized discovery and governance is going to be critical as agent sprawl becomes a thing (think shadow APIs on steroids). There’s a deeper dive on the vision in &lt;a href=&quot;https://aws.amazon.com/blogs/machine-learning/the-future-of-managing-agents-at-scale-aws-agent-registry-now-in-preview?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;this post on managing agents at scale&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;Big S3 news this week with the launch of &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/amazon-s3-files?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;S3 Files&lt;/a&gt;, which essentially puts a filesystem interface in front of your buckets. If you want more detail, the full breakdown is in &lt;a href=&quot;https://aws.amazon.com/blogs/aws/launching-s3-files-making-s3-buckets-accessible-as-file-systems?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;this launch post&lt;/a&gt;. Still not a filesystem… but closer than ever before.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/aws-lambda-response-streaming?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS Lambda response streaming&lt;/a&gt; is now available in all commercial regions, which is great to see. This feature has unlocked much better UX for real-time apps. I&#39;m glad there&#39;s no longer a regional bottleneck.&lt;/p&gt;
&lt;p&gt;If you still love PHP, you can now plug into Aurora DSQL with the new &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/aurora-dsql-connector-for-php?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;PHP connector&lt;/a&gt;. Not flashy, but extremely useful.&lt;/p&gt;
&lt;p&gt;Anthropic also introduced &lt;a href=&quot;https://claude.com/blog/claude-managed-agents?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Claude Managed Agents&lt;/a&gt;, which is basically a faster path to getting agents into production without wiring everything up yourself. Less plumbing, more doing.&lt;/p&gt;
&lt;p&gt;Cloudflare announced &lt;a href=&quot;https://blog.cloudflare.com/sandbox-ga?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Sandboxes GA&lt;/a&gt;. Giving agents their own isolated environments feels like table stakes at this point.&lt;/p&gt;
&lt;p&gt;And finally, Vercel might be getting ready for Wall Street. Their CEO is signaling IPO readiness as AI agents drive growth. You can read more about it &lt;a href=&quot;https://techcrunch.com/2026/04/13/vercel-ceo-guillermo-rauch-signals-ipo-readiness-as-ai-agents-fuel-revenue-surge?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;here&lt;/a&gt;.&lt;/p&gt;
&lt;h3&gt;Tutorials&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/machine-learning/embed-a-live-ai-browser-agent-in-your-react-app-with-amazon-bedrock-agentcore?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Embed a live AI browser agent in your React app with Amazon Bedrock AgentCore&lt;/a&gt; by Sundar Raghavan&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/machine-learning/manage-ai-costs-with-amazon-bedrock-projects?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Manage AI costs with Amazon Bedrock Projects&lt;/a&gt; by Ba&#39;Carri Johnson&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://claude.com/blog/seeing-like-an-agent?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Seeing like an agent: how we design tools in Claude Code&lt;/a&gt; by Thariq Shihipar&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://dev.to/aws-heroes/serverless-applications-on-aws-with-lambda-using-java-25-api-gateway-and-aurora-dsql-part-4-1kg5?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Serverless applications on AWS with Lambda using Java 25, API Gateway and Aurora DSQL - Part 4 SnapStart + DSQL request priming&lt;/a&gt; by Vadym Kazulkin&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/machine-learning/building-intelligent-audio-search-with-amazon-nova-embeddings-a-deep-dive-into-semantic-audio-understanding?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Building intelligent audio search with Amazon Nova Embeddings: A deep dive into semantic audio understanding&lt;/a&gt; by Madhavi Evana&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://edjgeek.com/blog/s3-files-lambda-agents?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Lambda Just Got a File System. I Put AI Agents on It.&lt;/a&gt; by Eric Johnson&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://dev.to/aws-heroes/serverless-applications-on-aws-with-lambda-using-java-25-api-gateway-and-dynamodb-part-5-5gpe?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Serverless applications on AWS with Lambda using Java 25, API Gateway and DynamoDB - Part 5 SnapStart + full priming&lt;/a&gt; by Vadym Kazulkin&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;Reads&lt;/h3&gt;
&lt;p&gt;&lt;a href=&quot;https://martinmueller.dev/openclaw-eng?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;How I Use OpenClaw as My AI-Powered Personal Operating System&lt;/a&gt;&lt;br /&gt;
Running an “AI OS” stops sounding crazy once you see it working. I like Martin Mueller&#39;s isolation model here. The Agent Hub project I&#39;ve been working on takes a similar approach, running agents in Docker sandboxes (lightweight VMs built for exactly this use case).&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://blog.cloudflare.com/enterprise-mcp?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Scaling MCP adoption: Our reference architecture for simpler, safer and cheaper enterprise deployments of MCP&lt;/a&gt;&lt;br /&gt;
Cloudflare lays out a very real architecture for scaling MCP across an enterprise. This is exactly the kind of thing you have to think about. Context, auth, routing, and security all become first-class concerns once your agents start doing real work.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://claude.com/blog/multi-agent-coordination-patterns?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Multi-agent coordination patterns: Five approaches and when to use them&lt;/a&gt;&lt;br /&gt;
I’m a huge fan of patterns, and seeing these emerge in the agent ecosystem is exciting. We’re finally getting reusable, battle-tested approaches that unlock real capabilities instead of everyone rebuilding one-off hacks.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://ranthebuilder.cloud/blog/i-tested-three-spec-driven-ai-tools-here-s-my-honest-take?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;I Tested Three Spec-Driven AI Tools. Here’s My Honest Take.&lt;/a&gt;&lt;br /&gt;
I’m still not sold on spec-driven being the end state. It feels a bit too waterfall for how fast things move. That said, it’s a solid pattern for documentation and grounding the model upfront.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://www.lastweekinaws.com/blog/s3-is-not-a-filesystem-but-now-theres-one-in-front-of-it?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;S3 Is Not a Filesystem (But Now There&#39;s One In Front of It)&lt;/a&gt;&lt;br /&gt;
Corey Quinn still says S3 isn&#39;t a filesystem, but he believes layering a real one in front of it actually makes sense. He also says the pricing is pretty reasonable for what you get, which means a lot coming from him.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://claude.com/blog/the-advisor-strategy?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;The advisor strategy: Give Sonnet an intelligence boost with Opus&lt;/a&gt;&lt;br /&gt;
This is a really smart cost pattern. Let the expensive model do the hard thinking once, then have the cheaper models following instructions, which they&#39;re surprisingly good at.&lt;/p&gt;
&lt;h3&gt;Podcasts, Videos, and more&lt;/h3&gt;
&lt;p&gt;&lt;a href=&quot;https://www.youtube.com/watch?v=u3GjIXP9N0s?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS Distinguished Eng: Learning From 3000 Incidents And How Engineering Is Changing | Marc Brooker&lt;/a&gt;&lt;br /&gt;
Marc Brooker shares some really interesting insights here on finding impactful engineering problems and system design patterns that actually hold up under real-world pressure. His advice for both junior and senior engineers navigating the AI shift is worth your time for a listen.&lt;/p&gt;
&lt;h3&gt;New from AWS&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/03/cloudwatch-logs-insights-query-params?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon CloudWatch Logs Insights now supports saved queries with parameters&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/machine-learning/spring-ai-sdk-for-amazon-bedrock-agentcore-is-now-generally-available?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Spring AI SDK for Amazon Bedrock AgentCore is now Generally Available&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/amazon-opensearch-serverless-supports-derived-source?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon OpenSearch Serverless now supports Derived Source for storage optimization&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/bedrock-data-automation-vocabulary?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon Bedrock Data Automation now supports custom vocabulary&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/amazon-redshift-topk-optimization?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon Redshift introduces key performance optimization for Top-K queries&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/opensearch-managed-prometheus-agent-tracing?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon OpenSearch Service supports Managed Prometheus and agent tracing&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/03/s3-lifecycle-pauses-actions-on-objects?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon S3 Lifecycle pauses actions on objects that are unable to replicate&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/amazon-cloudwatch-pipelines-conditional?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon CloudWatch pipelines now supports drop and conditional processing&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/cloudwatch-pipelines-compliance-governance?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon CloudWatch pipelines introduces new compliance and governance capabilities&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/aws-private-ca-customer-managed-permissions?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS Private CA now supports customer managed permissions for cross-account sharing&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/aws-billing-and-cost-management-dashboards-scheduled-email-delivery?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS Billing and Cost Management Dashboards Now Supports Scheduled Email Delivery&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/amazon-opensearch-serverless-supports-zstandard-index-compression?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon OpenSearch Serverless now supports Zstandard (zstd) codec for index compression&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/rds-blue-green-proxy?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon RDS Blue/Green Deployments now supports Amazon RDS Proxy&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/oracle-database-aws-available-twelve-regions?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23361&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Oracle Database@AWS is now available in twelve AWS Regions&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;Final Thoughts 🤔&lt;/h3&gt;
&lt;p&gt;It’s getting a lot easier to break things.&lt;/p&gt;
&lt;p&gt;Between models that can uncover vulnerabilities faster than ever, agents operating with increasing autonomy, and systems that were never designed for this kind of pressure, the attack surface is expanding in real time. The tools are getting smarter, but so are the ways they can be used against us.&lt;/p&gt;
&lt;p&gt;And we’re still figuring out how to keep up with it.&lt;/p&gt;
&lt;p&gt;But there are signs the industry is adjusting.&lt;/p&gt;
&lt;p&gt;We’re seeing better visibility into AI spend, more structured ways to manage and govern agents, and even isolated environments becoming the default for running them. There’s a growing recognition that if agents are going to act on our behalf, we need systems that can track them, constrain them, and clean up after them when things go sideways.&lt;/p&gt;
&lt;p&gt;Even S3 Files is part of that story. Not because it changes what S3 is, but because it makes powerful primitives easier to use, observe, and secure correctly.&lt;/p&gt;
&lt;p&gt;It’s not clean yet, and it’s definitely not solved.&lt;/p&gt;
&lt;p&gt;But the direction is clear. Smarter systems, stronger boundaries, and a shift toward building with the assumption that things will break.&lt;/p&gt;
&lt;p&gt;Because they most definitely will.&lt;/p&gt;
&lt;p&gt;See you next week,&lt;br /&gt;
Jeremy&lt;/p&gt;
&lt;hr /&gt;
&lt;p&gt;I hope you enjoyed this newsletter. We&#39;re always looking for ideas and feedback to make it better and more inclusive, so please feel free to reach out to me via &lt;a href=&quot;https://bsky.app/profile/jeremydaly.com&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Bluesky&lt;/a&gt;, &lt;a href=&quot;http://www.linkedin.com/in/jeremydaly&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;LinkedIn&lt;/a&gt;, &lt;a href=&quot;http://twitter.com/jeremy_daly&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;X&lt;/a&gt;, or &lt;a href=&quot;mailto:contact@jeremydaly.com&quot;&gt;email&lt;/a&gt;.&lt;/p&gt;
</content>
  </entry>
  <entry>
    <title>Issue #360: Anthropic’s Wardrobe Malfunction 🫣</title>
    <link href="https://offbynone.io/issues/360/"/>
    <updated>2026-04-07T12:00:00Z</updated>
    <summary>In this issue, Claude leaks its secrets, AWS locks down S3, and Cloudflare makes a run at WordPress.</summary>
    <id>https://offbynone.io/issues/360/</id>
    <content type="html">&lt;h2&gt;Anthropic’s Wardrobe Malfunction 🫣&lt;/h2&gt;
&lt;p&gt;In our &lt;a href=&quot;https://offbynone.io/issues/359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;previous issue&lt;/a&gt;, App Runner got sent to maintenance mode, agents started looking like real software, and efficiency may have become the new AI race. This week, Claude leaks its secrets, AWS locks down S3, and Cloudflare makes a run at WordPress. Plus, we&#39;ve got a whole bunch of amazing content from the cloud, serverless, and AI communities.&lt;/p&gt;
&lt;h3&gt;News &amp;amp; Announcements&lt;/h3&gt;
&lt;p&gt;Anthropic is having &lt;em&gt;a month&lt;/em&gt;. Their &lt;em&gt;very&lt;/em&gt; public &lt;a href=&quot;https://techcrunch.com/2026/03/31/anthropic-is-having-a-month?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Claude Code leak&lt;/a&gt; gave us an unexpected peek under the hood. Lots to learn from it, including their concept of “skeptical memory” and how it maps to patterns emerging in agent design. Not exactly how you want transparency to happen, but still… fascinating.&lt;/p&gt;
&lt;p&gt;AWS is continuing to tighten the screws on security with &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/s3-default-bucket-security-setting?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;S3 rolling out new default bucket protections&lt;/a&gt; they announced last year. I&#39;m sure this will break someone&#39;s CI/CD pipeline, but if this prevents another “oops, we exposed millions of records to the internet” incident, it’s worth it.&lt;/p&gt;
&lt;p&gt;On the AI safety front, &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/bedrock-guardrails-cross-account-safeguards?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon Bedrock Guardrails now support cross-account safeguards (GA)&lt;/a&gt;, which is a big step toward centralized governance of AI systems. The deeper dive on &lt;a href=&quot;https://aws.amazon.com/blogs/aws/amazon-bedrock-guardrails-supports-cross-account-safeguards-with-centralized-control-and-management?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;how cross-account guardrails work&lt;/a&gt; is worth a read. This is the kind of thing enterprises actually need: consistent policies, enforced everywhere, without duct-taping controls into every individual service.&lt;/p&gt;
&lt;p&gt;AWS also introduced &lt;a href=&quot;https://aws.amazon.com/blogs/machine-learning/aws-launches-frontier-agents-for-security-testing-and-cloud-operations?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;frontier agents for security testing and cloud operations&lt;/a&gt;, which is both exciting and mildly terrifying. Letting agents poke at your infrastructure to find weaknesses sounds great… assuming they don’t introduce a few outages along the way. 😬 Given AWS’s recent adventures with agentic coding tools taking down services, let&#39;s hope they spent a few extra human cycles on this one.&lt;/p&gt;
&lt;p&gt;Observability keeps evolving too. &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/amazon-cloudwatch-opentelemetry-metrics?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;CloudWatch adding OpenTelemetry metrics support&lt;/a&gt; is actually a big deal. Standardizing metrics across systems has been messy for years, and leaning into OpenTelemetry makes it a lot easier to build portable, vendor-agnostic observability pipelines.&lt;/p&gt;
&lt;p&gt;And speaking of observability, &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/03/opensearch-agentic-ai-log-analytics-observability?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;OpenSearch introduced agentic AI for log analytics&lt;/a&gt;, another signal that logs are no longer just something you query, they’re something you collaborate with. Expect more systems where you ask questions and let agents do the digging.&lt;/p&gt;
&lt;p&gt;On the security side, this one’s a bit of a nightmare: &lt;a href=&quot;https://venturebeat.com/security/axios-npm-supply-chain-attack-rat-maintainer-token-2026?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;a trojan slipped into axios via an npm supply chain attack&lt;/a&gt;. Axios is everywhere, which makes this especially concerning. Friendly reminder to pin your dependencies, use proper scanning tools, and maybe don’t blindly trust package libraries.&lt;/p&gt;
&lt;p&gt;Cloudflare introduced &lt;a href=&quot;https://blog.cloudflare.com/emdash-wordpress?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;EmDash, as a spiritual successor to WordPress&lt;/a&gt;. It’s positioning itself as a more secure, modern alternative to the plugin-heavy ecosystem we’ve all wrestled with. WordPress killer? Maybe. Or at least another sign that the PHP-era assumptions about how we build content systems are starting to fade.&lt;/p&gt;
&lt;p&gt;And finally, Luc van Donkersgoed announced &lt;a href=&quot;https://lucvandonkersgoed.com/2026/03/31/introducing-semantic-content-search-to-aws-news?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;semantic content search for AWS News&lt;/a&gt;. Good stuff.&lt;/p&gt;
&lt;h3&gt;Tutorials&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href=&quot;https://darryl-ruggles.cloud/amazon-aurora-dsql-a-practical-guide-to-awss-distributed-sql-database?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon Aurora DSQL: A Guide to AWS&#39;s Distributed SQL DB&lt;/a&gt; by Darryl Ruggles&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/machine-learning/building-real-time-conversational-podcasts-with-amazon-nova-2-sonic?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Building real-time conversational podcasts with Amazon Nova 2 Sonic&lt;/a&gt; by Madhavi Evana&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://dev.to/aws-heroes/serverless-applications-on-aws-with-lambda-using-java-25-api-gateway-and-dynamodb-part-4-45i8?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Serverless applications on AWS with Lambda using Java 25, API Gateway and DynamoDB - Part 4 SnapStart + DynamoDB request priming&lt;/a&gt; by Vadym Kazulkin&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/database/stream-live-data-from-amazon-keyspaces-to-s3-vector-for-real-time-ai-applications?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Stream live data from Amazon Keyspaces to S3 vector for real time AI applications&lt;/a&gt; by Siva Palli&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/database/working-with-identity-columns-and-sequences-in-aurora-dsql?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Working with identity columns and sequences in Aurora DSQL&lt;/a&gt; by Arnab Chowdhury&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://dev.to/franckpachot/the-single-duality-view-pattern-can-sqljson-preserve-aggregate-locality-4ifd?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Single-Cluster Duality View&lt;/a&gt; by Franck Pachot&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://dev.to/aws-builders/add-chat-ai-summary-using-amazon-bedrock-and-http-response-streaming-522h?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Add Chat AI Summary Using Amazon Bedrock and HTTP Response Streaming&lt;/a&gt; by Marko Djakovic&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/machine-learning/simulate-realistic-users-to-evaluate-multi-turn-ai-agents-in-strands-evals?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Simulate realistic users to evaluate multi-turn AI agents in Strands Evals&lt;/a&gt; by Ishan Singh&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/machine-learning/control-which-domains-your-ai-agents-can-access?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Control which domains your AI agents can access&lt;/a&gt; by Evandro Franco&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/machine-learning/build-a-finops-agent-using-amazon-bedrock-agentcore?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Build a FinOps agent using Amazon Bedrock AgentCore&lt;/a&gt; by Salman Ahmed&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/machine-learning/build-reliable-ai-agents-with-amazon-bedrock-agentcore-evaluations?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Build reliable AI agents with Amazon Bedrock AgentCore Evaluations&lt;/a&gt; by Akarsha Sehwag&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://blog.stackademic.com/stop-losing-your-screenshots-build-a-personal-line-summarization-and-search-bot-with-aws-and-b9c56c5f0869?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Stop Losing Your Screenshots: Build a Personal LINE Summarization and Search Bot with AWS and Claude&lt;/a&gt; by Hiro O.&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;Reads&lt;/h3&gt;
&lt;p&gt;&lt;a href=&quot;https://www.theserverlessmindset.com/p/how-i-use-ai-every-day-without-losing?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;How I Use AI Every Day Without Losing My Mind&lt;/a&gt;&lt;br /&gt;
Marco Troisi shares three principles for using AI coding tools without experiencing developer burnout: stay actively engaged in the code, maintain focus on customer needs, and reject multitasking for focused single-task work. His approach rightly emphasizes treating AI as a collaborator rather than letting it automate everything.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://claude.com/blog/harnessing-claudes-intelligence?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Build Smarter AI Apps with Claude | 3 Key Patterns&lt;/a&gt;&lt;br /&gt;
Love this take on agent harness design from Anthropic. Their point about assumptions growing stale as Claude improves is spot on. I&#39;ve definitely found myself over-engineering orchestration layers for things Claude can now handle directly, and it&#39;s a good reminder to periodically question what actually needs to live in the harness.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://ranthebuilder.cloud/blog/ai-made-everyone-a-builder-and-thats-a-problem?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AI Made Everyone a Builder and That&#39;s a Problem&lt;/a&gt;&lt;br /&gt;
Ran Isenberg shares his thoughts on the unintended consequences of AI-powered development. The point about AI making it easy to ship demos but hard to maintain production systems is something I see all the time. And I&#39;ve definitely seen the explosion of half-baked open source projects inadvertently name-squatting. 🤨&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://sethorell.substack.com/p/proudly-found-elsewhere?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Proudly Found Elsewhere&lt;/a&gt;&lt;br /&gt;
Great post by Seth Orell on embracing managed services over building everything yourself. I completely agree with his litmus test: only build what&#39;s a distinguishing, separately marketable feature of your business, otherwise find a provider. This is exactly the kind of thinking that makes serverless architectures so compelling (even if we&#39;re still figuring out where the boundaries are).&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://www.lindamohamed.com/blog/how-I-built-an-AI-film-crew-on-aws?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;How I Built an AI Film Crew on AWS&lt;/a&gt;&lt;br /&gt;
Linda Mohamed, one of my favorite AWS Heroes, built an AI-powered video editing system using Step Functions to orchestrate MediaConvert, Rekognition, Transcribe, and Bedrock agents. This post is loaded with deep technical expertise and demonstrates a production-grade implementation, not just concepts.&lt;/p&gt;
&lt;h3&gt;Podcasts, Videos, and more&lt;/h3&gt;
&lt;p&gt;&lt;a href=&quot;https://www.youtube.com/watch?v=j5URBon7YiU?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS Lambda Performance Tuning | Serverless Office Hours&lt;/a&gt;&lt;br /&gt;
Julian Wood hosts Matt Diamond and Paras Jain for a deep dive into Lambda performance optimization. The session covers configuration tuning, memory settings, and initialization best practices with practical examples for improving function speed and efficiency.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://www.youtube.com/watch?v=BEYav6kl0vM?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Building Real-Time Applications at Scale | Serverless Office Hours&lt;/a&gt;&lt;br /&gt;
Brian Zambrano and Kim Wendt join Eric Johnson to show essential strategies for managing high-volume connection requests, implementing effective filtering patterns, and handling event distribution at scale.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://www.youtube.com/watch?v=gsLkVmkkOxc?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Stop Your Bad Lambda Deployments Before They Hit Production&lt;/a&gt;&lt;br /&gt;
Excellent hands-on walkthrough by James Eastham showing how to implement automated rollbacks with Lambda aliases and versions. The practical CodeDeploy setup he demonstrates is exactly the kind of safety net your serverless applications need.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://www.lennysnewsletter.com/p/an-ai-state-of-the-union?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;An AI state of the union: We’ve passed the inflection point, dark factories are coming, and automation timelines | Simon Willison&lt;/a&gt;&lt;br /&gt;
Lenny Rachitsky chats with Simon Willison about the practical patterns for what he calls &amp;quot;agentic engineering&amp;quot; and explains why mid-career engineers might face more risk than juniors or seniors. The conversation covers everything from security challenges with prompt injection to how pelicans on bicycles became an unofficial AI quality benchmark. I also agree that deciding what to build is the new bottleneck.&lt;/p&gt;
&lt;h3&gt;New from AWS&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/amazon-sagemaker-unified-studio?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon SageMaker Unified Studio adds notebook import/export and developer acceleration features&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/AWS-Cost-Explorer-Natural-Language-Query?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS Cost Explorer launches Natural Language Query capabilities powered by Amazon Q&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/03/smithy-java-client-framework?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS announces general availability of Smithy-Java client framework&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/amazon-ecs-managed-daemons?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon ECS announces Managed Daemons for ECS Managed Instances&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/amazon-verified-permissions-policy-store?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon Verified Permissions now supports policy store aliases and named policies and policy templates&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/amazon-cloudwatch-cloudfront-enablement?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon CloudWatch expands auto-enablement to Amazon CloudFront logs and 3 additional resource types&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/amazon-cloudfront-sha-256-signed-urls?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon CloudFront now supports SHA-256 for signed URLs and signed cookies&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/aws-secrets-manager-console-custom-kms-key-input?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS Secrets Manager console now supports custom input for AWS KMS keys&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/03/s3-vectors-expands-17-regions?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon S3 Vectors expands to 17 additional AWS Regions&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/amazon-emr-spark-troubleshooting-upgrade-kiro-power?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Apache Spark troubleshooting and upgrade agents now available as Kiro powers&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/amazon-elasticache-serverless-ipv6-dual-stack?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon ElastiCache Serverless now supports IPv6 and dual stack connectivity&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/03/aws-organizations-paths-api-responses?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS Organizations now provides organization paths in API responses&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/04/ses-mail-manager-introduces-new-features?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon SES Mail Manager adds new features for enhanced security and email processing&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;Developer Tools&lt;/h3&gt;
&lt;p&gt;&lt;a href=&quot;https://www.reddit.com/r/serverless/comments/1sbeuw4/i_profiled_every_require_in_our_lambda_handler?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;yetanotheraryan/coldstart&lt;/a&gt; by Aryan Tiwari&lt;br /&gt;
&lt;code&gt;coldstart&lt;/code&gt; is a zero-dependency startup profiler for Node.js that instruments CommonJS and ESM startup loading, reconstructs the dependency tree, and points at the modules that actually slow boot time down.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://x.com/kevingu/status/2039843234760073341?s=12&amp;t=OQY7mkl3UskUDm8ylE0fVw?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AutoAgent: first open source library for self-optimizing agents&lt;/a&gt; by Kevin Gu&lt;br /&gt;
AutoAgent is an open source library where a meta-agent autonomously optimizes a task agent by tweaking prompts, adding tools, and refining orchestration. Kevin Gu reports it achieved top leaderboard scores on spreadsheet and terminal benchmarks after 24+ hours of iterative self-improvement.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://dev.to/martinhicks/introducing-dynoxide-a-fast-embeddable-dynamodb-engine-15ni?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23360&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Introducing Dynoxide: a fast, embeddable DynamoDB engine&lt;/a&gt; by Martin Hicks&lt;br /&gt;
Dynoxide is a new DynamoDB-compatible engine written in Rust and backed by SQLite, designed as a fast, embeddable alternative to DynamoDB Local.&lt;/p&gt;
&lt;h3&gt;Final Thoughts 🤔&lt;/h3&gt;
&lt;p&gt;It’s been a rough few weeks for “secure by default.”&lt;/p&gt;
&lt;p&gt;Between accidental leaks, supply chain attacks, and the constant stream of “oops” moments, it’s clear that the pace of building is still outpacing the guardrails meant to keep things in check. Even the companies building the tools are learning these lessons in real time.&lt;/p&gt;
&lt;p&gt;But there’s a flip side to all of this.&lt;/p&gt;
&lt;p&gt;You can see the industry responding. S3 locking things down by default. Centralized guardrails for AI systems. Better observability standards. Even agents being tasked with finding vulnerabilities before humans do. It’s messy, but it’s progress.&lt;/p&gt;
&lt;p&gt;We’re not slowing down, so the only real option is to get better at building systems that can keep up with us.&lt;/p&gt;
&lt;p&gt;And maybe that’s where things are heading. Not just faster development, but safer defaults, stronger guardrails, and tooling that assumes mistakes will happen and is ready for them when they do.&lt;/p&gt;
&lt;p&gt;See you next week,&lt;br /&gt;
Jeremy&lt;/p&gt;
&lt;p&gt;I hope you enjoyed this newsletter. We&#39;re always looking for ideas and feedback to make it better and more inclusive, so please feel free to reach out to me via &lt;a href=&quot;https://bsky.app/profile/jeremydaly.com&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Bluesky&lt;/a&gt;, &lt;a href=&quot;http://www.linkedin.com/in/jeremydaly&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;LinkedIn&lt;/a&gt;, &lt;a href=&quot;http://twitter.com/jeremy_daly&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;X&lt;/a&gt;, or &lt;a href=&quot;mailto:contact@jeremydaly.com&quot;&gt;email&lt;/a&gt;.&lt;/p&gt;
</content>
  </entry>
  <entry>
    <title>Issue #359: So Long App Runner 🪦</title>
    <link href="https://offbynone.io/issues/359/"/>
    <updated>2026-03-31T12:00:00Z</updated>
    <summary>In this issue, App Runner heads to maintenance mode, agents start looking like real software, and efficiency might become the new AI race.</summary>
    <id>https://offbynone.io/issues/359/</id>
    <content type="html">&lt;h2&gt;So Long App Runner 🪦&lt;/h2&gt;
&lt;p&gt;In our &lt;a href=&quot;https://offbynone.io/issues/358&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;previous issue&lt;/a&gt;, AWS gave agents real-time streaming and shell access, MCP was thought to be dead, and the distance between idea and deployment continued to shrink while quality became optional. This week, App Runner heads to maintenance mode, agents start looking like real software, and efficiency might become the new AI race. Plus, there&#39;s lots of awesome cloud, serverless, and AI content from the community.&lt;/p&gt;
&lt;h3&gt;News &amp;amp; Announcements&lt;/h3&gt;
&lt;p&gt;AWS finally did it. 😢 &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/03/aws-service-availability?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;App Runner is heading into maintenance mode&lt;/a&gt; on April 30, 2026. It had its problems, but the free load balancer was hard to beat. Now it looks like the future belongs to &lt;a href=&quot;https://docs.aws.amazon.com/apprunner/latest/dg/apprunner-availability-change.html&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon ECS Express Mode&lt;/a&gt; (which requires a separate load balancer). If you’ve been using it as a trusted platform layer, and chances are you weren&#39;t, now might be a good time to reassess.&lt;/p&gt;
&lt;p&gt;Despite the latest deprecations, AWS continues to compress the distance between idea and execution. You can now &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/03/amazon-aurora-postgresql-database?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;spin up and connect to an Amazon Aurora PostgreSQL database in seconds&lt;/a&gt;, which feels less like infrastructure provisioning and more like an on-demand service. Combine that with &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/03/lambda-32-gb-memory-16-vcpus?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Lambda Managed Instances scaling up to 32 GB of memory and 16 vCPUs&lt;/a&gt; plus a new &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/03/aws-Lambda-file-descriptors-increase-4096?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;file descriptor limit of 4,096&lt;/a&gt;, and LMIs start looking like more than just a graduation path.&lt;/p&gt;
&lt;p&gt;Step Functions also picked up &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/03/aws-step-functions-sdk-integrations?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;28 new service integrations, including Amazon Bedrock AgentCore&lt;/a&gt;, a clear signal that agent orchestration is becoming a first-class citizen. That pairs nicely with AgentCore’s other evolutions: &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/03/bedrock-agentcore-runtime-session-storage?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;managed session storage for persistent agent filesystem state&lt;/a&gt; and &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/03/agentcore-evaluations-generally-available?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;general availability of AgentCore Evaluations&lt;/a&gt;. Agents aren’t just stateless prompt loops anymore. They’re getting memory, persistence, and scoring systems that start to look like actual software systems.&lt;/p&gt;
&lt;p&gt;On the model side, &lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/03/palmyra-vision-7b-writer-bedrock?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Palmyra Vision 7B is now available on Amazon Bedrock&lt;/a&gt;, adding more multimodal options to the growing Bedrock ecosystem. And if you’re building voice interfaces (like I am right now), &lt;a href=&quot;https://aws.amazon.com/blogs/machine-learning/introducing-amazon-polly-bidirectional-streaming-real-time-speech-synthesis-for-conversational-ai?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon Polly’s new bidirectional streaming&lt;/a&gt; brings real-time conversational synthesis into the mix. Adding voice interactions will change your life.&lt;/p&gt;
&lt;p&gt;Outside of AWS, efficiency is the new AI battleground. Google’s &lt;a href=&quot;https://research.google/blog/turboquant-redefining-ai-efficiency-with-extreme-compression?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;TurboQuant research&lt;/a&gt; pushes model compression to extremes, hinting that smaller, cheaper models running locally might end up being more important than the cloud-based ones. Meanwhile, developer workflows keep shifting as tools like &lt;a href=&quot;https://claude.com/blog/auto-mode?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Auto mode for Claude Code&lt;/a&gt; lean further into autonomous execution, and enterprises get more guardrails with the &lt;a href=&quot;https://claude.com/blog/claude-platform-compliance-api?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Claude Compliance API&lt;/a&gt;. These seem like small steps, but they are filling lots of gaps.&lt;/p&gt;
&lt;p&gt;And I can&#39;t believe I missed this a few weeks ago! &lt;a href=&quot;https://www.dash0.com/blog/dash0-acquires-lumigo-to-expand-agentic-observability-across-aws-and-serverless?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Dash0 is acquiring Lumigo&lt;/a&gt;, making a bet that “agentic observability” is going to be its own category. You can’t run fleets of semi-autonomous systems without knowing what they’re doing, and traditional logs and traces aren’t going to cut it.&lt;/p&gt;
&lt;p&gt;In case you missed it, and given the download numbers, you probably did, &lt;a href=&quot;https://www.fastcompany.com/91515523/openai-sora-never-understood-social-media?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Sora is shutting down&lt;/a&gt;. It&#39;s a good reminder that as fake and curated as social media may be, there is a limit to the amount of AI slop users are willing to take.&lt;/p&gt;
&lt;p&gt;Finally, a bit of the end of an era as the &lt;a href=&quot;https://loige.co/farewell-fullstack-bulletin?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;FullStack Bulletin bids farewell&lt;/a&gt;. Newsletters come and go, but the good ones reshape how we think about this space.&lt;/p&gt;
&lt;h3&gt;Tutorials&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/developer/upgrading-aws-cli-from-v1-to-v2-using-the-migration-tool?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Upgrading AWS CLI From v1 to v2 Using the Migration Tool&lt;/a&gt; by Ahmed Moustafa&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/machine-learning/building-age-responsive-context-aware-ai-with-amazon-bedrock-guardrails?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Building age-responsive, context-aware AI with Amazon Bedrock Guardrails&lt;/a&gt; by Pradip Kumar Pandey&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/machine-learning/accelerating-custom-entity-recognition-with-claude-tool-use-in-amazon-bedrock?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Accelerating custom entity recognition with Claude tool use in Amazon Bedrock&lt;/a&gt; by Kimo El Mehri&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://darryl-ruggles.cloud/aws-lambda-now-knows-where-it-lives-exploring-the-new-az-metadata?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS Lambda Now Knows Where It Lives: Explore New AZ Metadata&lt;/a&gt; by Darryl Ruggles&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://dev.to/aws-heroes/serverless-applications-on-aws-with-lambda-using-java-25-api-gateway-and-dynamodb-part-3-2h31?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Serverless applications on AWS with Lambda using Java 25, API Gateway and DynamoDB - Part 3 Introducing Lambda SnapStart&lt;/a&gt; by Vadym Kazulkin&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://dev.to/aws-heroes/serverless-applications-on-aws-with-lambda-using-java-25-api-gateway-and-aurora-dsql-part-3-4ep2?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Serverless applications on AWS with Lambda using Java 25, API Gateway and Aurora DSQL - Part 3 Introducing Lambda SnapStart&lt;/a&gt; by Vadym Kazulkin&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://dev.to/dhananjay_lakkawar/the-15-millisecond-ai-building-pre-cognitive-edge-caching-on-aws-ad7?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;The 15-Millisecond AI: Building &amp;quot;Pre-Cognitive&amp;quot; Edge Caching on AWS&lt;/a&gt; by Dhananjay Lakkawar&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://dev.to/kota_sasaki/real-time-streaming-from-claude-code-on-agentcore-to-the-browser-using-appsync-event-api-1l65?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Real-Time Streaming from Claude Code on AgentCore to the Browser Using AppSync Event API&lt;/a&gt; by Kota&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://dev.to/aws-heroes/serverless-applications-on-aws-with-lambda-using-java-25-api-gateway-and-aurora-dsql-part-2-4bbb?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Serverless applications on AWS with Lambda using Java 25, API Gateway and Aurora DSQL - Part 2 Initial performance measurements&lt;/a&gt; by Vadym Kazulkin&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://dev.to/gunnargrosch/multi-agent-systems-on-aws-lambda-with-durable-functions-2gg3?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Multi-Agent Systems on AWS Lambda with Durable Functions&lt;/a&gt; by Gunnar Grosch&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://ranthebuilder.cloud/blog/prevent-sensitive-data-leaks-in-amazon-cloudwatch-logs?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Prevent Sensitive Data Leaks in Amazon CloudWatch Logs&lt;/a&gt; by Yaron Ben Ezra&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;Reads&lt;/h3&gt;
&lt;p&gt;&lt;a href=&quot;https://aws.amazon.com/blogs/architecture/architecting-for-agentic-ai-development-on-aws?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Architecting for agentic AI development on AWS&lt;/a&gt;&lt;br /&gt;
Alan Oberto Jimenez covers architectural patterns for AI agents in development workflows. They walk through system architecture approaches (local emulation, hybrid testing, preview environments) and codebase patterns (domain-driven design, layered testing) that enable agents to autonomously write, test, and deploy code with rapid validation cycles.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://www.linkedin.com/pulse/agents-dont-know-what-good-looks-like-thats-exactly-luca-mezzalira-sgwte?trackingId=0FCRL+3WRLucg6g1dgh0BQ==?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Agents don&#39;t know what good looks like. And that&#39;s exactly the problem.&lt;/a&gt;&lt;br /&gt;
Luca Mezzalira reflects on a recent Neal Ford and Sam Newman fireside chat about AI agents. This is an extremely well-written and enlightening post that deserves your attention.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://www.readysetcloud.io/blog/allen.helton/your-ai-agents-are-a-security-nightmare?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Your AI agents are a security nightmare&lt;/a&gt;&lt;br /&gt;
Great post by Allen Helton on treating AI agents as first-class principals with proper identity management. His approach using Teleport for session-bound AWS credentials makes a lot of sense, especially the parallel he draws to how we botched security with early Lambda deployments.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://caylent.com/blog/the-heirloom-syntax-why-ai-monocultures-threaten-the-future-of-innovation?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;The Heirloom Syntax: Why AI Monocultures Threaten the Future of Innovation&lt;/a&gt;&lt;br /&gt;
Great post from Brian Tarbox on preserving &amp;quot;systemic diversity&amp;quot; in an age of AI commoditization. I totally agree that our value as technologists isn&#39;t in content volume anymore (machines won that game), but in maintaining the unique voice and insights that come from actual experience in the trenches.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://blog.cloudflare.com/workflow-diagrams?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;How we use Abstract Syntax Trees (ASTs) to turn Workflows code into visual diagrams&lt;/a&gt;&lt;br /&gt;
André Venceslau explains how Cloudflare built a diagram system that parses workflow code using ASTs. Great deep dive if you want to understand the technical challenge of statically analyzing minified JavaScript to track Promise relationships and parallel execution, then rendering those into visual diagrams that show execution order and branching. Cool stuff.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://medium.com/@harshit27062002/why-i-collapsed-50-aws-lambdas-into-a-single-api-55624f7f5d0d?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Why I Collapsed 50+ AWS Lambdas Into a Single API&lt;/a&gt;&lt;br /&gt;
Harshit walks through consolidating 50+ Lambda functions into a single API to address cold start multiplication and VPC ENI exhaustion. I&#39;ve built A LOT of serverless APIs over the years, and I&#39;m a big proponent of this approach. There are still good reasons for single-purpose functions, but a Lambdalith using Hono for internal routing is extremely easy to reason about.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://www.fastcompany.com/91513823/ai-is-creating-the-first-generation-of-cognitively-outsourced-humans?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AI is creating the first generation of cognitively outsourced humans&lt;/a&gt;&lt;br /&gt;
The cognitive offloading problem we&#39;re facing with generative AI is definitely real. Confusing fluent AI output with actual understanding is where most people get fooled (and even worse if they believe its sycophancy), and I totally agree that we need to treat AI as a tool to sharpen judgment, not to replace it.&lt;/p&gt;
&lt;h3&gt;Podcasts, Videos, and more&lt;/h3&gt;
&lt;p&gt;&lt;a href=&quot;https://www.youtube.com/watch?v=foYaB6_hd8w?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS MCP server | Serverless Office Hours&lt;/a&gt;&lt;br /&gt;
Praneeta Prakash and Claudiu Popa join Julian Wood to explore the AWS MCP Server. It combines natural language interfaces with AWS best practices through Agent SOPs. You&#39;ll see practical examples of provisioning infrastructure, troubleshooting, and building Lambda durable functions while maintaining IAM controls throughout.&lt;/p&gt;
&lt;h3&gt;New from AWS&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/03/amazon-athena-adding-cap-reservation-regions?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon Athena launches Capacity Reservations in additional regions&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/03/cloudwatch-centralization-datasource?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon CloudWatch now supports multi-account and region log centralization based on data source&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/03/access-cluster-insights-opensearch?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Access Cluster Insights through the Amazon OpenSearch Service Console and Amazon EventBridge events&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/03/amazon-cloudwatch-infrequent-access-log-class?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon CloudWatch Logs now supports data protection, OpenSearch PPL and OpenSearch SQL for the Infrequent Access ingestion class&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/03/amazon-timestream-for-influxdb-advanced-metrics?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon Timestream for InfluxDB Now Supports Advanced Metrics&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/03/aws-jdbc-caching-with-valkey?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;The AWS Advanced JDBC Wrapper now supports automatic query caching with Valkey&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/03/aurora-dsql-connector-for-ruby?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Aurora DSQL launches connector that simplifies building Ruby applications&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/03/amazon-managed-service-flink-2-2?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon Managed Service for Apache Flink now supports Apache Flink 2.2&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/03/transform-custom-new-codebase-analysis?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS Transform custom announces general availability of automated codebase analysis&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/03/transform-custom-new-transformations?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS Transform custom introduces new AWS-managed transformations to modernize code at scale&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/03/amazon-bedrock-agentcore-browser-policies-root-ca?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon Bedrock AgentCore adds support for Chrome policies and custom root CA&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/03/aws-security-agent-ondemand-penetration?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS Security Agent on-demand penetration testing is now generally available&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/03/aws-devops-agent-generally-available?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;AWS DevOps Agent is now generally available&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/03/cloudwatch-lookup-query-command?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Amazon CloudWatch Logs introduces lookup query command&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://aws.amazon.com/about-aws/whats-new/2026/03/aurora-dsql-rust-npgsql-connectors?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Aurora DSQL launches new connectors that simplify building .NET and Rust applications&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;Developer Tools&lt;/h3&gt;
&lt;p&gt;&lt;a href=&quot;https://github.com/gunnargrosch/durable-viz?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;gunnargrosch/durable-viz&lt;/a&gt;&lt;br /&gt;
Durable-viz parses AWS Lambda Durable Functions code to generate Mermaid flowcharts without requiring deployment. The tool supports TypeScript, JavaScript, Python, and Java, detecting all SDK primitives like step, invoke, parallel, map, and wait. It&#39;s available as an NPM package or VS Code extension with features like click-to-navigate and PNG export.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://dev.to/aws-builders/how-i-search-10000-aws-github-repos-in-10-seconds-4hi2?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;How I Search 10,000+ AWS GitHub Repos in 10 Seconds&lt;/a&gt; by Ajit&lt;br /&gt;
Ajit built a search engine that indexes over 10,000 AWS GitHub repositories using hybrid search (BM25 for exact matches, FAISS for semantic understanding). The system auto-indexes twice daily via EventBridge and uses Bedrock to classify repos across 22 metadata dimensions.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://heeki.medium.com/introducing-loom-an-agent-platform-66e7db019cdb?utm_source=newsletter&amp;utm_medium=email&amp;utm_content=offbynone&amp;utm_campaign=Off-by-none%3A%20Issue%20%23359&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Introducing Loom, an agent platform&lt;/a&gt; by Heeki Park&lt;br /&gt;
Heeki Park&#39;s Loom is an opinionated agent platform built on AWS that integrates Strands Agents SDK, Bedrock, and AgentCore with enterprise guardrails. The platform uses configuration-driven agent creation with mandatory tagging and RBAC/ABAC controls, avoiding runtime code generation for security.&lt;/p&gt;
&lt;h3&gt;Final Thoughts 🤔&lt;/h3&gt;
&lt;p&gt;It feels like infrastructure is starting to shift in a more fundamental way. Not just bolting AI onto existing systems, but weaving it into everything: orchestration, storage, compute, observability. Agents with memory, evaluation loops, persistent state. Voice interfaces that feel real-time. Even the way we think about databases and execution environments is starting to change.&lt;/p&gt;
&lt;p&gt;But we might be getting ahead of ourselves.&lt;/p&gt;
&lt;p&gt;Luca’s point that agents don’t actually know what “good” looks like, and Brian’s argument for preserving systemic diversity instead of letting AI push everything toward a monoculture, are worth paying attention to. If every system is built the same way, trained on the same data, and optimized for the same outputs, we’re not really innovating, we’re just converging.&lt;/p&gt;
&lt;p&gt;The stack is evolving fast. The question is whether we’re shaping it, or just letting it shape us.&lt;/p&gt;
&lt;p&gt;See you next week,&lt;br /&gt;
Jeremy&lt;/p&gt;
&lt;p&gt;I hope you enjoyed this newsletter. We&#39;re always looking for ideas and feedback to make it better and more inclusive, so please feel free to reach out to me via &lt;a href=&quot;https://bsky.app/profile/jeremydaly.com&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;Bluesky&lt;/a&gt;, &lt;a href=&quot;http://www.linkedin.com/in/jeremydaly&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;LinkedIn&lt;/a&gt;, &lt;a href=&quot;http://twitter.com/jeremy_daly&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;X&lt;/a&gt;, or &lt;a href=&quot;mailto:contact@jeremydaly.com&quot;&gt;email&lt;/a&gt;.&lt;/p&gt;
</content>
  </entry>
</feed>