Richard Seroter's Architecture Musings

Author: Richard Seroter

Daily Reading List – June 9, 2025 (#564)

I’m back! Sort of. Vacation is over, and now I’m in Sweden for customer meetings and local events. I’ve got a pile of links to read through, but we’ll chip away at it together.

[article] It’s not your imagination: AI is speeding up the pace of change. These Mary Meeker reports are always interesting to read. I haven’t gotten through all of it yet, but it reaffirms things I thought I knew. Direct link here.

[blog] Cloud Repatriation is Getting Complicated. Corey looks at why people buy cloud, who is talking about repatriation, and when workloads actually move around.

[blog] AI Agents in a Nutshell. Megan is such a good communicator, and does a terrific job here explaining what agents are all about, and even some challenges.

[paper] The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity. While on vacation, I noticed a lot of folks talking about this one. Apple’s got a paper that claims LLMs struggle to actually “reason” through a certain type of hard problem.

[article] Developer Joy: A Better Way to Boost Developer Productivity. While you might not stick “joy metrics” on your exec reports, building platforms that reduce friction and toil, while increasing flow matter.

[blog] Announcing new capabilities for boosted productivity in Colab Enterprise. You may not know Colab, but it’s one of the most heavily used cloud services out there. Here’s what’s new for enterprise notebook users.

[blog] Valkey Turns One: How the Community Fork Left Redis in the Dust. “In the dust” might be generous, but Valkey is definitely not just a Redis fork. It’s a powerful and performant database on its own.

[blog] Dude, Where’s My Strategy? Platform teams need a strategy too. Camille has a great, short piece about what it starts to look like.

[blog] Google is a Leader in the 2025 Gartner® Magic Quadrant™ for Data Science and Machine Learning Platforms report. Lots of players in this one, including an unusual number of “leaders.” Glad we’re recognized so highly.

[article] Nearly half of CEOs say employees are resistant or even hostile to AI. Not surprising and this will remain common as long as companies over-hype expectations and under-invest in training and change management.

[blog] Announcing new MCP integrations to Google Cloud Databases to enable AI-assisted development. Now it’s easier to use natural language interactions with your database, directly from your AI-based IDE. I like this.

[blog] Optimizing LLM-based trip planning. How do you factor in real-world constraints when planning with LLMs? This is some practical work from the Google Research team.

Want to get this update sent to you every day? Subscribe to my RSS feed or subscribe via email below:

Type your email…

June 9, 2025
Daily Reading List – May 30, 2025 (#563)

This might be my best reading list of the year. I came across some terrific content and I hope you enjoy it. Next week I’m on vacation, so no reading lists. I’m in Stockholm the following week for our Nordics Summit, and a bunch of customer meetings. Reading list will resume then!

[blog] TDD: The Missing Protocol for Effective AI Assisted Software Development. Does a test-driven development approach ensure a higher quality of AI generated code? Here’s one proposed workflow.

[blog] 10 Years of Engineering Ladders. Job ladders still matter, and I’m glad Camille is still offering advice for those defining expectations of skills at various levels.

[blog] Forget “Too Cool to Care” — Aim to be Great. I love this. Don’t go through the motions. Pursue excellence and greatness.

[article] Being Bored Could Actually Be Good For Your Brain, Scientists Reveal. Being “great” doesn’t mean relentless hustle culture. Slow down to speed up. Keep your phone at home while taking a walk or running errands. Be bored sometimes!

[article] Setting targets for developer productivity metrics. Good stuff here from Abi and Laura from DX. Review these principles before defining the goals for your dev teams.

[blog] Create shareable generative AI apps in less than 60 seconds with Vertex AI and Cloud Run. This is such a useful way to deploy a prompt as an app that you can share with others to iterate on.

[blog] Database Coupling: How to FIX a Spaghetti Code System. Spaghetti is great, but not in your architectures. Derek explores a few ways to clean up tricky couplings.

[article] Say Your Writing. Fantastic advice from Martin. Read your writing out loud. Does it sound off? Is that phrasing weird? I do this all the time and it helps me (a) refine my writing and (b) ensure it’s in my normal voice.

[blog] Unlock Elite Agents: The Art of Evolving LLM Prompts into System Masterpieces. I like how Casey is thinking about meta-prompting and using AI as a thinking partner.

[youtube-video] Build your first Java agent with Google ADK. Here’s a sixteen-minute masterclass from Guillaume that no Java dev should miss.

[article] MCP: What It Is and Why It Matters—Part 2. Addy further explains what MCP is, and why it’s a big deal for developers.

[article] How To Negotiate Like An Expert: 7 Secrets From Research. Fantastic stuff from Eric. If you don’t know what negotiation looks like, you might be frustrated when people use these tactics.

[blog] Boost your Search and RAG agents with Vertex AI’s new state-of-the-art Ranking API. This looks like very powerful technology, but I’m somehow more fired up about demos embedded within our docs.

Want to get this update sent to you every day? Subscribe to my RSS feed or subscribe via email below:

Type your email…

May 30, 2025
Daily Reading List – May 29, 2025 (#562)

Today’s reading list has some deep reads, but they’re worth it. Not everything needs to be bite-sized!

[blog] The Prompt Engineering Playbook for Programmers. Fantastic piece from Addy. Everyone in a technical role should read this and understand what separates a good LLM prompt from an ok prompt.

[paper] Vibe Coding vs. Agentic Coding: Fundamentals and Practical Implications of Agentic AI. This is actually a terrific paper that distinguishes vibe coding (IDE based, human driving the cycles) and agentic coding (async, agent iterates on a user-provided goal). Great depth on the topic.

[blog] In defense of shallow technical knowledge. Makes sense. You don’t need to know everything about everything. A functional knowledge in many areas, with depth in a few, seems like a strong career strategy.

[blog] Google Cloud’s open lakehouse: Architected for AI, open data, and unrivaled performance. Just because our two giant events are done for the year doesn’t mean we’re slowing down our ship rate. If you’re building a lakehouse strategy, you’ll like these cloud updates.

[article] A holistic model for understanding the costs and value of software development. Some smart colleagues look at a model for “understanding the costs and commercial value of software development.”

[blog] Announcing dotnet run app.cs – A simpler way to start with C# and .NET 10. Great feature for C# devs who want a simpler startup without all the project/solution machinery.

[paper] The State of AI in the Enterprise. This ungated report from Box offers very useful insights into buyer expectations and realities with AI.

[article] China’s DeepSeek quietly releases upgraded R1 AI model, ramping up competition with OpenAI. Open models like DeepSeek continue to impress. More here.

[blog] Announcing Angular v20. Another sizable release for this widely used web framework. Check out what’s new.

Want to get this update sent to you every day? Subscribe to my RSS feed or subscribe via email below:

Type your email…

May 29, 2025
Daily Reading List – May 28, 2025 (#561)

I ended up doing more coding than reading today, which hasn’t happened in a while. I’m building some agent demos for upcoming presentations, and had fun getting things to work.

[blog] Making AI Work: Leadership, Lab, and Crowd. i’m calling it. This will be the best thing I read this week. Ethan looks at why individual performance improvements with AI won’t make a difference without a bigger organizational change.

[blog] Things you never dared to ask about LLMs — Take 2. You’ll finish this blog (with embedded slides and video) much smarter about LLMs.

[blog] Introducing TCQ, Forrester’s Technology Change Quotient. How ready is your company to adapt to fast-changing technology changes? The analysts at Forrester Research came up with a new measurement that looks useful.

[blog] Cloud Service Mesh in 2025 — global control, zero pain upgrades. The service mesh has gotten much simpler (and more invisible) over the past five years. It’s worth another look if you dismissed it years ago.

[article] The Era of the Product Creator. Do you have the title of product manager but find yourself transferring feature requests to a backlog and coordinating releases? That’s a risky long term career bet.

[blog] Announcing LMEval: An Open Source Framework for Cross-Model Evaluation. This looks helpful. You’ll want to have some strong team skills around evals.

[blog] Leveraging AI for incident response: Personalized Service Health integrated with Gemini Cloud Assist. Does AI replace operational work? No, but it sure makes it easier. I like the ability to have a contextual AI chat that understands the current state of the cloud, and my systems in it.

Want to get this update sent to you every day? Subscribe to my RSS feed or subscribe via email below:

Type your email…

May 28, 2025
Daily Reading List – May 27, 2025 (#560)

Back after a three-day weekend, and feeling good. A backlog of reading material means longer reading lists this week. You’re getting your money’s worth!

[blog] Reinvent the Wheel. Sure, sometimes you should reinvent the wheel, repeat yourself, and fix what isn’t broken. We often learn and make unexpectedly big improvements when we don’t just accept the status quo.

[blog] Cloud CISO Perspectives: How Google Cloud’s security team helps build securely. Here’s some insight into our engineering mindset, and also a great roundup of links and news.

[article] What causes procrastination for software engineers? Procrastination has both negative AND positive side effects? This research also offers a handful of strategies to mitigate procrastination.

[blog] Journey to 1000 models: Scaling Instagram’s recommendation system. Cool post, and a reminder that adding ML models to your architecture requires a reassessment and redesign of a few dimensions.

[blog] Transforming Kubernetes and GKE into the leading platform for AI/ML. Kubernetes seems well positioned to be a key part of the training and serving infrastructure for companies of any size.

[article] How to Lead an All-Hands After Delivering Bad News. Over the years, I’ve had a few of these. This is excellent advice.

[blog] Vertex AI Studio, redesigned: Your source for generative AI media models across all modalities. Cloud customers deserve nice things. This refresh of the Vertex AI Studio experience makes it much simpler to design prompts, generate media, ground your responses, and more.

[article] What Salesforce’s $8B acquisition of Informatica means for enterprise data and AI. This might have been a bigger headline a few years back, but I still can imagine lots of good value here. More here.

[blog] An ADK Java GitHub template for your first Java AI agent. Sometimes the “getting started demos” can themselves be intimidating. I like this sort of bare bones entry into a new framework. Build on it from here!

[blog] The Triad of Agent Architecture: ADK, MCP, and Cloud Run. Here’s a code-heavy demonstration for an agent solution.

[blog] Google AI Studio: How to go from a prompt to a geo-location guessing app in minutes. Vibe code your way to a running app in no time at all.

Want to get this update sent to you every day? Subscribe to my RSS feed or subscribe via email below:

Type your email…

May 27, 2025
Daily Reading List – May 23, 2025 (#559)

We’ve got a three-day weekend coming up here in the States, and it’s at the right time. I’m spent and looking forward to some sunshine and downtime. See you back at the Reading List on Tuesday!

[article] Google’s AI Vision: Make Tech Human Again. What a thoughtful writeup. I hope our vision for things turns into the reality.

[blog] Now’s a great time to rediscover PaaS. Platforms are in vogue, as they should be. I’m not sure Cloud Foundry is the right bet in 2025, but Coté makes a good case.

[article] HashiCorp Releases Terraform MCP Server for AI Integration. Everyone is shipping MCP servers. Also see ones from Firebase and DigitalOcean.

[blog] How We Decomposed Tinder’s Monolith. Faster build times, and a simpler system that can be updated more easily? Good work from the Tinder engineering team.

[blog] KrebsOnSecurity Hit With Near-Record 6.3 Tbps DDoS. Wow. Glad we were able to help protect on this one.

[article] The Future of Dev Tools is Autonomous, Engineers Will Become Fleet Generals. Nobody knows exactly what’s going to happen, but this seems like the direction we’re heading.

[article] Agentic AI delivers measurable value to early adopters. Not shocking? If you pick some good use cases where there’s legit pain points, AI agents are going to make a noticeable difference.

[codelab] Build Powerful, Stateful, E2E AI Agent Apps in Java with ADK, AlloyDB & Gemini. Get hands on learning how to build agent solutions. This is a terrific step by step exercise.

[blog] Business Intelligence in AI Era: How Agents and Gemini unlock your data. Are we doing vibe business intelligence now? Of the tech practices, data exploration seems well suited to iterating quickly on ideas with AI.

Want to get this update sent to you every day? Subscribe to my RSS feed or subscribe via email below:

Type your email…

May 23, 2025
Daily Reading List – May 22, 2025 (#558)

I’m catching up after a few days in Mountain View and starting to clear out the reading queue. Still a ways to go. Enjoy today’s list!

[blog] Vibing at Home. I like seeing people share their own experiences. Those who only pontificate online about AI coding are losing credibility with me. At least try it yourself before lamenting or praising it!

[blog] Announcing Anthropic’s Claude Opus 4 and Claude Sonnet 4 on Vertex AI. Folks were eagerly awaiting these updated models from Anthropic. They look powerful.

[blog] Build Powerful, Stateful AI Agents in Java with Agent Development Kit (ADK). Abi does a great walkthrough of a realistic agent scenario and how the new Java ADK pulls it together.

[blog] JUDE: LLM-based representation learning for LinkedIn job recommendations. A look at how LinkedIn uses LLMs for better job recommendations.

[article] Stitch is Google’s AI-powered tool to help design apps. This is definitely on my “to play with” list. Designers have so many AI tools at their disposal.

[youtube-video] Redis just blew it and the alternative is way better… Hot take! But a strong case for why things don’t go back to the way they were just because of a license reset.

[blog] What’s new in Firebase at I/O 2025. The Firebase team is cooking right now. Check out this post to see all the ways that modern app dev is boosted with Firebase.

[article] How to Delegate to Someone Who Doesn’t Report to You. This is a good skill to develop. You often can’t MAKE someone do something, but you can smartly shift the work.

[blog] Your First Spring AI 1.0 Application. Many details and code samples in this post from Josh. He covers many important dimensions like observability and security too.

[blog] Advancing sovereignty, choice, and security in the cloud for our customers. We’ve been investing for years in creative and comprehensive sovereign solutions and that’s coming to fruition now.

[blog] Secure A2A Authentication with Auth0 and Google Cloud. With MCP and A2A solidifying into accepted standards, now the important part comes: defining robust security patterns.

[blog] AI deployment made easy: Deploy your app to Cloud Run from AI Studio or MCP-compatible AI agents. I showed these in my blog post from yesterday. These Cloud Run leaders give it a more extensive treatment.

[blog] What’s new in Flutter 3.32. These seem like valuable updates. For those building cross-platform mobile or web apps, Flutter remains a great choice.

[blog] 100 things we announced at I/O. Good recap, including a couple of things I hadn’t noticed myself.

Want to get this update sent to you every day? Subscribe to my RSS feed or subscribe via email below:

Type your email…

May 22, 2025
Daily Reading List – May 21, 2025 (#557)

Feels like Friday. Only Wednesday. I’m home after Google I/O and a customer meeting, and another customer chat tomorrow. I’m learning a lot!

[blog] Understand all the I/O news with NotebookLM. This is a creative and smart way to make it simpler to digest and explore the news from Google I/O.

[blog] Google I/O 2025: From research to reality. Take a moment to read Sundar’s post and get a good idea of where AI is now, and heading.

[blog] Building with AI: highlights for developers at Google I/O. Some killer new things for developers called out here.

[blog] Fuel your creativity with new generative media models and tools. If you do nothing else on this one, scroll down to the Veo 3 examples with generated audio. Mind blowing stuff.

[blog] Shop with AI Mode, use AI to buy and try clothes on yourself virtually. I’m absolutely trying this out with the family tonight when I get home.

[article] Google just leapfrogged every competitor with mind-blowing AI that can think deeper, shop smarter, and create videos with dialogue. We’re doing something special over here!

[blog] Takeaways from Coding with AI. Wow, this was a great collection of people opining about the plusses and minuses of software engineering and coding with AI. Read Tim O’Reilly’s recap.

[blog] Adapting to Change: Returning to Work in a Fast-Moving Tech World. Did you take a few weeks off from work? Maybe even a few months? Here’s some advice on re-onboarding while also not stressing yourself out about what you missed.

[article] How do AI code reviews impact engineering teams? Some unexpected findings in this research, including AI-assisted code reviews actually taking longer.

[blog] How CircleCI implemented llms.txt for better AI discoverability. Do your users a favor and generate llms.txt files that make it easier for devs to pass in context about your product/service/framework.

[blog] Write AI agents in Java — Agent Development Kit getting started guide. This is a BIG deal for Java devs. Guillaume does a terrific job getting you started building agents.

[blog] Pros and Cons of Going from Individual Contributor Back to Manager. You don’t see much written about this direction. I see more on going from manager to IC. Refreshing!

[blog] JetBrains AI Assistant – Now in Visual Studio Code. We seem to be going through another season of tech products stretching themselves into more places.

[article] GitLab Extends Scope and Reach of Core CI/CD Platform. See previous point about extending scope.

[article] When the Best Leadership Skill Is Just Being Present. As a manager, you don’t need to have all the answers. We can all get better at being in the moment with the other individual.

Want to get this update sent to you every day? Subscribe to my RSS feed or subscribe via email below:

Type your email…

May 21, 2025
From code to cloud: Check out six new integrations that make it easier to host your apps and models on Cloud Run

Where you decide to run your web app is often a late-binding choice. Once you’ve finished coding something you like and done some localhost testing, you seek out a reasonable place that gives you a public IP address. Developers have no shortage of runtime host options, including hyperscalers, rented VMs from cheap regional providers, or targeted services from the likes of Firebase, Cloudflare, Vercel, Netlify, Fly.io, and a dozen others. I’m an unapologetic fanboy of Google Cloud Run—host scale-to-zero apps, functions, and jobs that offer huge resource configurations, concurrent calls, GPUs, and durable volumes with a generous free tier and straightforward pricing—and we just took the wraps of a handful of new ways to take a pile of code and turn it into a cloud endpoint.

Vibe-code a web app in Google AI Studio and one-click deploy to Cloud Run

Google AI Studio is really a remarkable. Build text prompts against our leading models, generate media with Gemini models, and even build apps. All at no cost. We just turned on the ability to do simple text-to-app scenarios, and added a button that deploys your app to Cloud Run.

First, I went to the “Build” pane and added a text prompt for my new app. I wanted a motivational quote printed on top of an image of an AI generated dog.

In one shot, I got the complete app including the correct backend AI calls to Gemini models for creating the motivational quote and generating a dog pic. So cool.

Time to ship it. There’s rocket ship icon on the top right. Assuming you’ve connected Google AI Studio to a Google Cloud account, you’re able to pick a project and one-click deploy.

It takes just a few seconds, and you get back the URL and a deep link to the app in Google Cloud.

Clicking that link shows that this is a standard Cloud Run instance, with the Gemini key helpfully added as an environment variable (versus hard coded!).

And of course, viewing the associated link takes me to my app that gives me simple motivation and happy dogs.

That’s such a simple development loop!

Create an .NET app in tools like Cursor and deploy it using the Cloud Run MCP server

Let’s say you’re using one of the MANY agentic development tools that make it simpler to code with AI assistance. Lots of you like Cursor. It supports MCP as a way to reach into other systems via tools.

We just shipped a Cloud Run MCP server, so you can make tools like Cursor aware of Cloud Run and support straightforward deployments.

I started in Cursor and asked it to build a simple REST API and picked Gemini 2.5 Pro as my preferred model. Cursor does most (all?) of the coding work for you if you want it to.

It went through a few iterations to land on the right code. I tested it locally to ensure the app would run.

Cursor has native support for MCP. I added a .cursor directory to my project and dropped in a mcp.json file in there. Cursor picked up the MCP entry, validated it, and showed me the available tools.

I asked Cursor to deploy my C# app. It explored the local folder and files to ensure it had what it needed.

Cursor realized it had a tool that could help, and proposed the “deploy_local_folder” tool from the Cloud Run MCP server.

After providing some requested values (location, etc), Cursor successfully deployed my .NET app.

That was easy. And this Cloud Run MCP server will work with any of your tools that understand MCP.

Push an open model from Google AI Studio directly to Cloud Run

Want to deploy a model to Cloud Run? It’s the only serverless platform I know of that offers GPUs. You can use tools like Ollama to deploy any open model to Cloud Run, and I like that we made even easier for Gemma fans. To see this integration, you pick various Gemma 3 editions in Google AI Studio.

Once you’ve done that, you’ll see a new icon that triggers a deployment directly to Cloud Run. Within minutes, you have an elastic endpoint providing inference.

It’s not hard to deploy open models to Cloud Run. This option makes it that much easier.

Deploy an Python agent built with the Agent Development Kit to Cloud Run with one command

The Agent Development Kit is an open source framework and toolset that devs use to build robust AI agents. The Python version reached 1.0 yesterday, and we launched a new Java version too. Here, I started with a Python agent I built.

Built into ADK are a few deployment options. It’s just code, so you can run it anywhere. But we’ve added shortcuts to services like Google Cloud’s Vertex AI Agent Engine and Cloud Run. Just one command puts my agent onto Cloud Run!

We don’t yet have this CLI deployment option for the Java ADK. But it’s also simple to use the Google Cloud CLI command to deploy a Java app or agent to Cloud Run with one command too.

Services like Cloud Run are ideal for your agents and AI apps. These built-in integrations for ADK help you get these agents online quickly.

Use a Gradio instance in Cloud Run to experiment with prompts after one click from Vertex AI Studio

How do you collaborate or share prompts with teammates? Maybe you’re using something like Google Cloud Vertex AI to iterate on a prompt yourself. Here, I wrote system instructions and a prompt for helping me prioritize my work items.

Now, I can click “deploy an app” and get a Gradio instance for experimenting further with my app.

This has public access by default, so I’ve got to give the ok.

After a few moments, I have a running Cloud Run app! I’m shown this directly from Vertex AI and have a link to open the app.

That link brings me to this Gradio instance that I can share with teammates.

The scalable and accessible Cloud Run is ideal for spontaneous exploration of things like AI prompts. I like this integration!

Ship your backend Java code to Cloud Run directly from Firebase Studio

Our final example looks at Firebase Studio. Have you tried this yet? It’s a free to use, full-stack dev environment in the cloud for nearly any type of app. And it supports text-to-app scenarios if you don’t want to do much coding yourself. There are dozens of templates, including one for Java.

I spun up a Java dev environment to build a web service.

This IDE will look familiar. Bring in your favorite extensions, and we’ve also pre-loaded this with Gemini assistance, local testing tools, and more. See here that I used Gemini to add a new REST endpoint to my Java API.

Here on the left is an option to deploy to Cloud Run!

After authenticating to my cloud account and picking my cloud project, I could deploy. After a few moments, I had another running app in Cloud Run, and had a route to make continuous updates.

Wow. That’s a lot of ways to go from code to cloud. Cloud Run is terrific for frontend or backend components, functions or apps, open source or commercial products. Try one of these integrations and tell me what you think!

May 21, 2025
Daily Reading List – May 20, 2025 (#556)

It’s been a great day at Google I/O, and I think we made a big industry impact. I still have to catch up on what we wrote, and what others wrote about us, so expect a few of those in tomorrow’s reading list.

[blog] What’s new with Agents: ADK, Agent Engine, and A2A Enhancements. Big updates for agent developers, including a Java version of the Agent Development Kit.

[blog] Announcing Gemma 3n preview: powerful, efficient, mobile-first AI. Open, small, and powerful. This looks like a great option for mobile or local workloads.

[blog] Introducing the next generation of AI inference, powered by llm-d. Great cross-vendor work that makes AI inference on Kubernetes faster and more efficient.

[article] Keeping Up With AI: The Painful New Mandate for Software Engineers. Manju is a great analyst and shares his thoughts on how leaders can better set up their teams in the AI era.

[article] Spring AI 1.0 Released, Streamlines AI Application Development with Broad Model Support. Congrats to the Spring team on the milestone. I expect a lot of folks to use this framework when building AI apps. Here’s the announcement from the Spring team, and Google Cloud’s post about Spring AI.

[blog] MCP Authorization in practice with Spring AI and OAuth2. Speaking of Spring, here’s a guide for setting up access security for MCP servers. Which we need to see more with all this widespread MCP support happening.

[blog] Introducing Docker Hardened Images: Secure, Minimal, and Ready for Production. Smart move, as Docker Hub images seem to be associated with starter or throwaway instances you shouldn’t critically depend on. This looks legit.

[blog] Reports of Deno’s Demise Have Been Greatly Exaggerated. There can be an “I’m not a crook” or Streisand Effect to these types of posts. But Ryan sprinkles in positive momentum.

[blog] Event Destinations: A Better Way to Deliver Webhooks. This seems like a useful alternative/addition to HTTP webhooks.

[blog] Build a RAG Agent using Google ADK and Vertex AI RAG Engine. Interact with our RAG-as-a-service using an AI agent.

[article] GitHub Launches Its Coding Agent. Microsoft Build is going on, and they’re shipping some neat things. Here’s an announcement summary, and the news about open sourcing the VS Code Copilot chat extension.

Want to get this update sent to you every day? Subscribe to my RSS feed or subscribe via email below:

Type your email…

May 20, 2025