Skip to content

Richard Seroter's Architecture Musings

About
Daily Reading List
Contact Me

Author: Richard Seroter

Daily Reading List – June 11, 2026 (#803)

I think every one of my dozen meetings today was in a different room (and floor) in the office. That’s not a brag, although my step count today is outstanding.

[article] Your AI strategy has a trust problem, not a tooling problem. Terrific post. Most corporate structures are designed to slow you down, not speed you up. That’s on purpose. Making the needed changes is hard.

[blog] How to unlock true ROI in software development – a deep dive into the latest DORA research. Three good insights here. If you’re struggling to define ROI or you’re in the trough of high cost with no return, read this.

[blog] Loop Engineering. It’s the hot topic this week. No doubt it has a place in certain scenarios. I’m not sold (or want to be sold) that this is how engineering should be moving forward.

[article] Loop Engineering: Design the System That Prompts Agents. Got it. I wonder how “big” a loop should be. If we’re talking small PR-style loops, maybe. If we’re talking entire software construction kicked off by a user-defined “goal”, that seems non-serious to me.

[aricle] The Anthropic leader who built Claude Code says he ditched prompting — now he just writes loops. Last one specifically about loops, I promise. But it’s good to know what the discussion is all about.

[blog] How Gemini Managed Agents Works under the Hood. This is an implementation of loop engineering! You ask for something, and kick off a background process where your intent is turned into a plan that an agent loops on until completion.

[article] Employees spend more time managing AI than producing work. Makes sense. The work is changing. That doesn’t mean the current state is the permanent one.

[article] Engineering leadership lessons from LDX3 2026. Read some takeaways on newly-amplified friction, hiring, and management changes.

[blog] Report: GKE Inference Gateway delivers up to 92% faster AI responses. No comparison if you want fast inference on a cloud Kubernetes. I mean, there is a comparison, and it’s included here. But there’s a clear winner.

[article] AI teams now deploy 1,000 times a month. Your pipeline wasn’t built for that. If your pipelines aren’t groaning right now, and you’re doing a lot of AI-generated code, that’s a concern.

[blog] How to deploy a Google Agent Development Kit (ADK) agent to Google Cloud Run. Nice step-by-step. You can go from nothing to running agent pretty quickly.

[blog] The Unbundling and Bundling of the PaaS Market. Is PaaS back? It’s always a back and forth between independent services and opinionated stack bundles.

[blog] Growing the next generation of American workers. It’s the builder era, and that includes ALL types of builders. Love this investment.

Want to get this update sent to you every day? Subscribe to my RSS feed or subscribe via email below:

Type your email…

June 11, 2026
Daily Reading List – June 10, 2026 (#802)

A big reading list today. And I got to spend an hour+ this afternoon doing some fresh research into Google’s internal metrics, processes, and tools for agentic coding. Good day.

[blog] Doing nothing at work. Bravo. I’m wary of anyone who claims to be “busy” all the time with no flex in their schedule. That’s not a strategic way to work. And you probably miss most moments where you can make a real impact.

[blog] Routing AI responses between local runners. Model routing will be a hot topic this year, especially as people target different prompts/workloads for different models. Jason built a router to learn about them.

[blog] Developer experience is dead. Long live agent experience. It’s about that layer between the model and the codebase. Do you have the right AX so that a looping agent knows what’s important, what’s allowed, and what success looks like?

[blog] Lovable says it has hit $500M in annualized revenue, with 1 million new projects a week. There was a sense late last year that vibe coding platforms had peaked. Not so.

[blog] Build your own Flutter GenUI solution with Gemini structured outputs. More exploration of generative UIs and what goes into them.

[blog] The current impact of AI on engineering velocity. This post has many strong takeaways. It’s a good reminder for where to look if AI productivity gains aren’t showing up the way you expected.

[blog] Deep dive: How Lightning Engine delivers 4.9x faster Apache Spark performance. All the goodness of Spark, but crazy performance and price-performance for serverless or managed infrastructure. Not a bad deal.

[blog] DiffusionGemma: 4x faster text generation. Impressive stuff! This variant generates entire blocks of text at once.

[article] Anthropic says AI can turn software patches into exploits within hours. Even if you want nothing to do with AI, you better invest a lot in your security patching process because AI is making it easier to turn patches into exploits.

[blog] From Gemini CLI to Antigravity CLI: Automated OWASP Security Compliance and Agentic Remediation in Your Terminal. Speaking of security, here’s a look at how to do easier scanning for vulnerabilities while you’re building.

[blog] Fluid, natural voice translation with Gemini 3.5 Live Translate. My kids are growing up in a world where anyone can understand anyone, regardless of language. Amazing.

[blog] 5 Software Supply Chain Security Best Practices for Development Teams. This is for teams working with containers. Solid advice overall.

[blog] Mastering Hooks in Coding Agents. Are you doing much with hooks as a means of intercepting the agent’s flow of operation? This post will give you more awareness of when/how to use them.

[blog] Building with the Developer Knowledge API and Antigravity CLI. We should be past prompt-and-respond. Now it’s about really thinking through context and grounding that agents need to do quality work.

Want to get this update sent to you every day? Subscribe to my RSS feed or subscribe via email below:

Type your email…

June 10, 2026
Daily Reading List – June 9, 2026 (#801)

I like how much great content I’m seeing about practices, risks, and strategies for people using AI. It’s not “everything is awesome” or “everything is awful.” It’s about staying informed and focused on the desired outcomes.

[blog] In the Seams. If you’ve been around tech long enough, you’ve seen what Devin is calling out here.

[blog] Surviving the eviction: How to build interrupt-resilient AI workloads on GKE. All of this is outstanding advice to avoid any unexpected app behavior when running Kubernetes on ephemeral compute.

[article] Claude Code vs. Cursor vs. Codex vs. Antigravity — six months in. Even harder to predict the next six months! There convergence on some key concepts right now.

[blog] Modern Engineering Values. Fantastic piece. Really. Find details of a pragmatic way of working with LLMs, insights into the engineering values to hold on to, and the results you could see.

[blog] Paris Hilton is Android’s first icon in residence. I really, genuinely love this. You shouldn’t have to be an engineer to go from creative idea to implementation.

[article] 8 myths on software engineering and AI. These will sound familiar to many of you. But seeing them all together is useful.

[blog] Thoughts on starting new projects with LLM agents. I like the advice here: forcing the agents to do small CLs/commits, human in the loop the whole time, Go as the right language for agent-written projects.

[blog] Bringing the latest Gemini models to Apple developers. Easier access to models, along with Gemini in Xcode for help on coding tasks. That model access is courtesy of Firebase technologies.

[article] Companies Are Using AI for Efficiency. They Should Use It to Grow. Efficiency is a great reason to do automation or use AI. Full stop. But not the ONLY reason. Sounds like many execs are missing the top-line value of using AI to drive growth.

[blog] Antigravity Managed Agents Tutorial: Ship Production AI Agents. Simply fantastic writeup of this agent-as-a-service offering where you provide a prompt, and the service analyzes your objective, plans an approach, executes tools, produces results, and loops until complete.

[blog] The Intent Debt. Requirement docs, and such never really captured “intent.” Usually it was the result of discussion/agreement. Now with unattended AI loops, do we need better ways to share the thinking that goes into our decisions? Probably.

[article] Anthropic brings Mythos to the masses with Claude Fable 5, its most powerful generally available model ever. Your chance to try a Mythos-class model is here. Some safeguarded features aren’t opened up to the masses, just in case you wanted to try those. Anthropic’s blog, and one mentioning it’s available on Google Cloud.

Want to get this update sent to you every day? Subscribe to my RSS feed or subscribe via email below:

Type your email…

June 9, 2026
Daily Reading List – June 8, 2026 (#800)

Following last week’s intense focus on tokens, it’s interesting to see some folks doubling-down on token-intense practices such as loop engineering. While others, in today’s reading list, explore the important human role in AI work.

[blog] Do better research with NotebookLM. Sweet update. Better models, more types of output formats (e.g. PDF, Excel, Markdown), and more help getting started on research.

[blog] How I Actually Code (and Review) With AI in 2026. Thoughtful post, and actionable too. Steal some of these prompts/skills for your coding harness.

[blog] Using Agent Platform Memory Bank and Sessions from other runtimes. Services that support agents (like those providing memory) need to be available to any runtime, not tightly coupled to one.

[article] Embedding pipelines are the new ETL. When you do ingestion, chunking, and indexing for your embedding pipeline, are you doing the equivalent of extract-transform-load exercises?

[blog] What’s worth learning in an AI era? “Do I need to learn this” is a profound question to ask yourself right now. And how deep do we need to go?

[youtube-video] Software engineering at the tipping point. Wow, watch this. Seriously. Important talk for the AI era and why engineering matters more than ever.

[blog] Choosing Values for Robust Tests. Quick simple advice for us to follow if we want to avoid false confidence in our test suite.

[blog] How to Secure AI Agents: A Practical Overview for Development Teams. Check this out to see which four security domains Docker thinks you should pay attention to when delivering agents.

[blog] Is Valkey Ready to Replace Redis in 2026? Not a definitive “yes” nor a definitive “no.” This piece has a few useful considerations to factor in.

[blog] Coding Is No Longer the Constraint: Scaling Developer Experience to Teams and Agents at Spotify. Use fewer things, but be great at the tech you invest in. That’s Spotify’s engineering vibe, and it’s serving them well.

[blog] Deploying Hermes AI Agent and WebUI on GCP: A Step-by-Step Hands-On Guide. If you’ve got the tokens and personal agents like crave loops, run stuff like Hermes and OpenClaw.

[paper] Tokenomics: Quantifying Where Tokens Are Used in Agentic Software Engineering. Source paper for an article I shared a month or so ago. It’s more timely right now as people suffer under the weight of rapidly-consumed token budgets.

[article] The AI Agents Stack (2026 Edition). Nice job. It’s a good look at various layers and components that make up modern agents.

Want to get this update sent to you every day? Subscribe to my RSS feed or subscribe via email below:

Type your email…

June 8, 2026
Daily Reading List – June 5, 2026 (#799)

Just got home after an 18-hour travel day. Air France provided the best international wifi of my life. Super fast the whole time. See you Monday!

[blog] Announcing Angular v22. Three big features reach a stable status. The Angular team also delivered a handful of capabilities that make it better for AI apps and agentic coding.

[blog] Gemma 4 12B: The Developer Guide. Find some useful details about this small, portable, and mighty model.

[article] The token bill comes due: Inside the industry scramble to manage AI’s runaway costs. This conversation went from a growl to a roar this week. All off a sudden, token costs are a legit line item expense, and vendors are scrambling to respond accordingly. You know it’s serious when we get a new Foundation!

[blog] The Many Deaths and Rebirths of .NET. Has any programming language/framework had the funky history of .NET? I can’t think of one.

[article] Wells Fargo CEO: AI’s effect on employment is ‘complicated’. Some clear-eyed thinking here. Don’t upend or reconfigure your entire organization until you know what’s possible, and what you’re aiming for.

[blog] Migrating to Antigravity CLI. Moving on from the Gemini CLI? This has the key steps, and also flags a few pretty cool features of the Antigravity CLI.

[blog] What Bun Can Tell Us About AI, Open Source and Anthropic. What does it mean that this wildly popular JavaScript runtime has mostly AI contributors instead of human? This and more is discussed by Steve in this interesting post.

[blog] From Flutter to Backend: How to Build and Ship Production REST APIs with Dart and Shelf. This is a legit deep dive that goes beyond “hello world.” Middleware, authentication, data handling, and more.

[blog] Hybrid AI in Flutter: Routing Between On-Device and Cloud Models. It doesn’t have to be local OR cloud. You can use a mix of both with smart routing.

[blog] Build your own Flutter GenUI solution with Gemini structured outputs. One more Flutter piece today. It’s long, but I like the conclusion that the “thinking” was the hardest part.

[article] Engineering Leaders, You Should Be Worried If Your Team Isn’t Pushing Back. Silence from your team doesn’t mean everyone agrees. Or that the volume equals clarity. Good reminder!

[blog] Scaling AI Agents: A Step-by-Step Guide to Deploying ADK on GKE Autopilot. Very good end-to-end look of building, containerizing, deploying, and configuring an agent on Kubernetes.

[article] How People Are Really Using AI in 2026. Some red flags on losing our intentions and outsourcing our thinking. We’re also replacing human connections that provide emotional support. Not great. There are excellent uses, but we’re probably going to go too far before we pull back to the right place.

Want to get this update sent to you every day? Subscribe to my RSS feed or subscribe via email below:

Type your email…

June 5, 2026
Daily Reading List – June 4, 2026 (#798)

I enjoyed the day in Paris at our big Cloud event. Many great customer discussions, and keynote where all my live demos worked. Miracle! Flying home tomorrow, so may or may not have a list depending on airplane wifi.

[blog] Essential books for product builders—part 1. What a list. I added a couple of these to my own wish list.

[article] Budgeting your team’s code review capacity. Reviews still happen at human speed, even if you’re generating 10x the code. Of course some are applying AI to reviews as well. But then you lose all awareness of what your system does. Check this out for a take on the problem, and a solution.

[blog] Modular Monolith Boundaries Done Wrong. Start with ownership. That’s the guidance here.

[blog] The Future of Agents. Nobody knows what’s going to happen next, but I appreciate people who chew on that question. I agree with most of this!

[blog] Connecting AI agents with unstructured data using Google Cloud Storage MCP Servers. Start to imagine what you can do with an easy way for agents to store and retrieve all types of digital data (screenshots, PDFs).

[blog] My Agent Skill for Test-Driven Development. Good stuff. Is someone making it easy to start an agentic coding session with a pre-loaded set of default skills for the given situation? You’d add a TDD one for legit coding sessions.

[blog] Kaggle is making AI benchmark creation effortless. Build and work on these model tests from your local environment now.

[article] Reliability Engineering for Air-Gapped Systems. I’ve thought very little (not at all?) about this problem. This helped me develop some understanding of it.

[blog] How XP Made A Better AI Coder. Aja shares how some fundamentals of software come in handy now.

Want to get this update sent to you every day? Subscribe to my RSS feed or subscribe via email below:

Type your email…

June 4, 2026
Daily Reading List – June 3, 2026 (#797)

Fantastic day at our Google Cloud Summit Nordics event in Stockholm. I got to host some customers during the keynote, deliver a breakout on AI-first product delivery, and then meet with a handful of customers at all different levels of AI adoption. Flying to Paris right now ahead of another event tomorrow.

[article] How do coding agents compare to copilots for developer productivity? User effort was half with agents compared to copilots. Also less cognitive load, and less understanding of the code.

[paper] Twenty Years of Bigtable. Many imitators, nothing quite like the original. This talks about the product journey and improvements made over time.

[blog] The Speed of Prototyping in the Age of AI. Nice report about how the shape of work changes, and the kind of the work we do.

[blog] Why Hardened Images are Suddenly Everywhere. Great post. A bit of history, and a focus on what developers really care about. It’s scary out there!

[blog] Design Patterns Are Dead. Long Live Design Patterns. Which classic design patterns got eaten by the programming language, or matter less? Christina looks at this, and which ones survived.

[article] AI enthusiasts are in a race against time, AI skeptics are in a race against entropy. What an essay from Charity. These groups are talking past each other and increasingly frustrated. Charity tries to bring it back together on shared goals.

[blog] Introducing Gemma 4 12B: a unified, encoder-free multimodal model. Multimodal and laptop ready. Still Apache 2 licensed so party on. More here.

[article] How GitHub plans to win developers back. There’s still time. Lots to do. The usage-based billing probably won’t help, but those who grumble won’t find much relief elsewhere.

[article] AI’s brave new world of technical debt. A good challenge of conventional wisdom here. The latest thing isn’t always the safest thing.

[article] OpenAI launches new Codex tools for white-collar work. Not surprising to see them (and many platform companies) expanding reach to more types of users.

[blog] 6 Enterprise MCP Adoption Best Practices. Basically, “operational maturity” is the story here.

[article] Can Chainguard Save Open-Source Software From Mythos? Can Anyone? How businesses consume open source is broken, says Dan in this article.

Want to get this update sent to you every day? Subscribe to my RSS feed or subscribe via email below:

Type your email…

June 3, 2026
Daily Reading List – June 2, 2026 (#796)

Just wrapped up the day in Stockholm with more fun customer chats, and some rehearsals for the big event here tomorrow. And after a five hour dinner with people, my social battery is sub-zero.

[blog] State of SDLC Security 2026. The Wiz team shared a few data points in this blog post and links to a downloadable report.

[article] Coders are refusing to work without AI — and that could come back to bite them. It could. But I’d also refuse (a few years ago) to code without an IDE, or ask for anything but a cloud server. We get used to things that help us!

[blog] How we used Gemini to build Google I/O 2026. Creative teams everywhere should be using these tools. Apply your own ideas and produce output that would’ve seemed impossible just a year ago.

[blog] The Go language server can do some impressive code navigation. Some geeky love here for some thoughtful things we added for users of our language server.

[article] The DIY platform trap that’s burning out engineering teams. It’s the build versus buy debate. Building your own platform has benefits, and risks. I think the typical rule holds that you should only build differentiators.

[blog] Introducing the GKE standby buffer: Improve node startup times without blowing your budget. Over-provision to be safe, or endure a slow auto-scaling when the burst comes? There’s now a third option for Kubernetes users.

[blog] Why is test-driven development with agents so helpful for security? Aron shares some good thinking here. Can TDD be a security and stability gate for your AI coding agent?

[blog] Iterating on Frontend Design with Stitch and Antigravity CLI. This is the biggest change that AI has made for me. All of a sudden, I can work on parts of the stack (like the frontend) that were inaccessible to me before.

[article] Get a Good Return on Your AI Investments. We have plenty of anecdotal feedback about AI in software teams, but Nathen brings receipts.

[blog] Developer’s guide to Gemini Enterprise and A2UI integration. Instead of getting walls of text back from your AI chat tool, what if you could get rich, dynamic UIs composed on the fly? You can. Here’s how.

Want to get this update sent to you every day? Subscribe to my RSS feed or subscribe via email below:

Type your email…

June 2, 2026
Daily Reading List – June 1, 2026 (#795)

I’m settling into Stockholm after having yesterday to walk around and enjoy time with an old friend. Today was full of customer meetings and hanging out with startup founders.

[blog] How To Handle Conflict: 7 Secrets From History’s Master Strategist. Good post, especially for those of us that can be clumsy with our approach to conflict with others.

[article] How to stop the AI code generation treadmill. Interesting. This writer suggests that instead of piling more guardrails around AI-generated code, generate less code. Push the AI to select pre-built, pre-tested components. That assumes you have them!

[article] Open Source Ecosystems. A quick look at the role interdependencies play on the impact of open source.

[blog] Twelve Ways to Be Wrong About AI-Assisted Coding. Here are examples of bad measurements of value for your AI tools.

[article] Claude Mythos exposed a hard truth: Your enterprise patching process is way too slow. Talked to a colleague about this on Friday too. Attackers are exploting zero-days so much faster now, but enterprise patching still takes weeks. Five recommendations here.

[blog] Gemini Managed Agents or ADK? Autopilot or Cockpit? It’s not the same, but resembles a PaaS vs. Kubernetes debate. The answer is usually both. When you want one-off, fully-managed agents use Gemini Managed Agents. When you want flexibility and control, build agents with ADK.

[article] “Tokenmaxxing is real, expensive & it’s spreading”: AI budgets are exploding. It looks like products are emerging to help you be more judicious with token use. And figure out the impact/outcome of those tokens.

[blog] What’s 🔥 in Enterprise IT/VC #500. I read this every week, and congrats to Ed on reaching 500 editions. This one has some fantastic insights into the new stage of enterprise AI.

[article] What Makes a Good Productivity Metric? I like the questions this article asks, and the example our Google researchers provide.

[blog] From petabytes to predictions: Easy BigQuery insights in Google Sheets. I forgot about this feature, probably because I don’t live in spreadsheets. But Connected Sheets are a pretty awesome idea.

[article] The Keys to Succeeding Under a New Manager. Good advice, and reminder. I’ve switched managers a few times at Google, and I’ve gone into this latest one too casually.

[article] Vendor neutrality isn’t magic: A hard look at the OpenTelemetry ecosystem. This project may fly under the radar if you’re not a platform engineer, but OpenTelemetry is a great example of a widely adopted standard.

Want to get this update sent to you every day? Subscribe to my RSS feed or subscribe via email below:

Type your email…

June 1, 2026
One prompt, four (sub)agents, and ninety seconds to get a working app
I’m been thinking a lot about agent teams. You know, a set of AI agents that work together towards a goal. You might implement this within a single agent harness (see Garry Tan’s gstack for Claude Code) or with an orchestrator for multiple harnesses (see Google’s Scion project). I’ve been doing agent-at-a-time coding work thus far, but figured it was time to dive into more multi-agent workflows. The subagents natively built into Google Antigravity 2.0 and Antigravity CLI gave me the push.

Why use subagents at all? Can’t I just use one coding agent to run through my work? Don’t we reach decision fatigue faster because of the coordination overhead? Yes, the engineering work shifts from more sequential human tasks to assigning and reviewing the work of multiple AI agents. There’a tax, no doubt. But there are benefits to deconstructing the work in a way that an agent team can tackle it. You limit the change surface (by giving each agent a persona, MCPs, and skills to address an isolated piece of work), consume fewer tokens (by not processing the entire context for all tasks), and go faster (by parallelizing work that tolerates it).

Sound intimidating? It’s felt that way to me. But Claude Code and Antigravity CLI make it less scary. Let’s talk about Google Antigravity. This agent-first harness (used via a desktop app, CLI, or IDE) provides out-of-the-box support for pre-built subagents (e..g browser and research), creating custom subagents, communicating between subagents, and lifecycle management of subagents.

I spent last weekend playing around with prompts to create an agent team that could build a backend API, a corresponding frontend API, and some unit tests. To make this process go as fast as possible, I’m also basically doing YOLO mode where I let Antigravity run any terminal command and proceed without my intervention. Here’s the setup in the Antigravity desktop app.

So what’s the prompt? Here’s what I came up with.
```
Let's build a hotel room booking app for Seroter Hotels consisting of a Go backend API and a web frontend. 

First, launch the **Engineering Manager** agent to design the API and frontend, saving the design and a Mermaid diagram into an artifact called 'architecture.md'. 

Once the design is ready, launch three agents in parallel:
1. **Test Manager**: Write a simple API test plan and append it to 'architecture.md'.
2. **Backend Engineer**: Build a clean Go REST API with standard error handling based on the design.
3. **Frontend Engineer**: Build a responsive web UI using a simple CSS framework like Tailwind to interact with the API (skip UI testing).

As soon as the Test Manager finishes the plan, have them hand it off to the Backend Engineer, who reads the plan from 'architecture.md' and adds the Go tests to the code. After both engineers finish building, the Test Manager runs the tests. Finally, spin up both components and a browser so I can test the live app.
```
Let’s be clear. In real-life, you’d provide each subagent with significantly more context (tools, skills, data structures, persona characteristics) or trigger some back-and-forth so the subagent could gather a rich set of requirements from you. But this works to prove the point.

Here’s a video recording of the result, at 1.0x speed. It takes all of ninety seconds from when Antigravity starts working until the frontend and backend services are running. Below the video, I’ll deconstruct some key parts of the response.

Right after Antigravity got started, we saw the Implementation plan. This artifact defined each subagent and the sequence they would follow.

I steered work to a shared architecture.md file. When we view that, we see the decision log used by the agents.

As the subagents got to work, the main task list kept getting updated automatically.

Each subagent had its own context and workstream. Here, the backend agent records its implementation work.

It’s mesmerizing to watch the subagents start up and do their thing! On this right pane, you can see each subagent come up, printouts of each thing it’s doing, and finally when the root agent kills them off. I’m also able to track all the artifacts and any background tasks going on.

When it finished up, the main conversation recapped the final details and showed me how to access the app endpoints.

Because Antigravity knows how to work the Chrome browser, it also automatically launched a browser window and showed me the web front end. Looks great!

The Antigravity CLI supports virtually the same workflow.

I think it completed even faster than the desktop app! Same agent team. Same artifacts and workflow.

Pretty awesome. I’m sold. Download Google Antigravity, copy my prompt, make it better, try this for yourself. To me, building software has never been this fun.
June 1, 2026

←Previous Page

1 2 3 4 5 … 145

Richard Seroter's Architecture Musings

Loading Comments...

Write a Comment...

Email (Required)

Name (Required)

Website

Subscribe Subscribed
- Richard Seroter's Architecture Musings
- Already have a WordPress.com account? Log in now.