Daily Reading List – Page 25 – Richard Seroter's Architecture Musings

Daily Reading List – February 11, 2025 (#491)

Goodness, what a day. I had the pleasure of presenting to a room full of folks, and then answer questions and participate in follow-up sessions all afternoon. Beforehand, I read through some great items you’ll find below.

[blog] Good engineers are right, a lot. When you’re deep your craft, you should have a lot of accurate opinions. Sean makes a point here that you should deliver more confident statements to help steer folks that depend on you.

[blog] Revolutionizing software testing: Introducing LLM-powered bug catchers. New stuff from Meta/Facebook. Using AI to generate tests for a specific type of fault? Interesting.

[blog] Best Practices for Using Third-Party APIs. This is good advice from the Square team, and likely relevant for anyone who uses APIs created by others.

[blog] How we use AI at Sentry to produce 40% fewer issues–and prevent millions of noisy alerts. It’s a smart idea to use AI to group similar logs so that people don’t have to sift through a bunch of similar content.

[blog] 5 ways Google Cloud can help you minimize credential theft risk. More details here than I expected. Whether you’re using our cloud or someone else’s make sure you consider these protection mechanisms.

[article] Don’t Let Bad Time Management Undermine Your Leadership. Very good advice here. It applies to leaders who do it wrong, and individuals who need a better way to explain to their management what’s not working.

[blog] Networking support for AI workloads. I don’t see this talked about a ton, but a major AI investment is going to have an effect on your network architecture and traffic patterns.

Want to get this update sent to you every day? Subscribe to my RSS feed or subscribe via email below:

February 11, 2025

Daily Reading List – February 10, 2025 (#490)

After spending the weekend adjusting, I had a good first work day in India. We have an excellent office here, and an even better team. Tomorrow I’m speaking at a event, and am excited to learn from local developers.

[blog] Why The Focus Has Shifted from AI Agents to Agentic Workflows. Are agents accurate enough to handle important work? This author thinks that agentic workflows are the answer.

[article] Context-switching is the main productivity killer for developers. Interruptions aren’t just inconvenient; for developers it results in mental fatigue, worse code, and more.

[blog] Announcing public beta of Gen AI Toolbox for Databases. This open source effort that makes it easier to build AI agents with tools that interact with databases.

[blog] The future belongs to idea guys who can just do things. This post includes a few different tangents, but the macro point is about embracing agency and getting things done.

[blog] Rightsize your Memorystore for Redis Clusters with open-source Autoscaler. We just open sourced the cluster autoscaler we use for our managed Redis service. Fork and customize as you see fit.

[article] 2025 Is the Last Year of Python Dominance in AI: Java Comin’. I dunno. The writer brings some receipts, but I’m not convinced that (a) Python will give up the crown or (b) Java is the language that will do it.

[blog] 11 Incredibly Useful URL Tricks for Google Sheets. Ugh, I bare knew even one of these. Neat to see all the ways you can use the URL to change how you interact with a Sheet.

[article] Beyond DX: Developers Must Now Learn Agent Experience (AX). Still figuring out a good experience for devs? Now you need to think about the experience of your agents too.

[blog] Watch Pixel’s new big game ads featuring Gemini Live. Well done. It got a little dusty here in this hotel room when I watched it.

[blog] Chainguard CVE Visualizations: Now Generally Available. From a tech perspective, I like what these folks are doing. But what I like more is their astute awareness that platforms need a clear way to demonstrate their value to outside stakeholders.

Want to get this update sent to you every day? Subscribe to my RSS feed or subscribe via email below:

February 10, 2025

Daily Reading List – February 7, 2025 (#489)

I’m writing this from the final segment of my 20+ hour flight to India. That first 15+ hour leg was no joke! I’m excited for the week ahead. I have quasi-working wifi on this flight, so you get a reading list 🙂

[podcast] 10x People, AI Trends, and Career Management, with Richard Seroter. Coté and I recorded dozens of podcast episodes together while at Pivotal. Now, we’re together again talking about all sorts of things. I bet you’ll enjoy it!

[blog] Understanding Reasoning LLMs. Here’s some excellent content for those interested in how these “thinking” models work, when to use them, and how to make one.

[blog] Here’s how developers have been using Gemini 2.0. Here are a handful of code repos and videos that you can learn from.

[blog] Ollama commands: How to use Ollama in the command line [Part 2]. You’ll be a pro with using Ollama for local testing of open LLMs after reading this.

[blog] Kubernetes vs. Serverless: When to Choose Which? It’s a good writeup, but it’s not really accurate in 2025. “Serverless” like Google Cloud Run solves for cold starts, has strong stateful options, and is viable for steady state workloads.

[blog] The hard truth about using AI in coding. Another good writeup, and this one MIGHT not be accurate in 2026. Yes, you want depth in the domain (e.g. coding languages) before trusting that AI-generated code is efficient and accurate. But that’s also improving VERY quickly

[article] After 30 years of code, Java remains an enterprise cornerstone. Java’s not going anywhere. There’s still too much old Java laying around though.

[blog] Grounding Results with Google Search, Gemini, and LangChainJS. Here’s a detailed example of how to get verified, trusted results from your LLM.

[article] IT leaders turn to upskilling to close looming skills gap. Smart move. Hire some fresh talent, but invest in the folks you have. We’ve been doing it this way at Google.

Want to get this update sent to you every day? Subscribe to my RSS feed or subscribe via email below:

February 7, 2025

Daily Reading List – February 6, 2025 (#488)

A shorter reading list today as I’m navigating airports on the way to India. Still, some good finds today!

[youtube-video] Gemini 2.0 blew me away. I like Theo’s videos, and this was a fun watch. He dogs us where he should, but overall, he was super impressed with Gemini as a model. Me too.

[blog] Ingesting Millions of PDFs and why Gemini 2.0 Changes Everything. Doc processing will never be the same. Gemini is really good at this. The Hacker News chat on this post was lively.

[article] Measuring developer productivity: A clear-eyed view. Abi and Kent Beck talk about dev productivity, and you’ll likely get a lot out of this discussion.

[article] Google Cloud says AI demand greater than its capacity – vows $75b CapEx boost. Like others, we’re in “invest” mode.

[blog] Why Observability 2.0 Is Such a Gamechanger. The term “observability” is already overloaded, but the “2.0” narrative hasn’t yet been watered down. Read more from the folks who know most about it.

[blog] Scale-to-Zero LLM Inference with vLLM, Cloud Run and Cloud Storage FUSE. Ollama gets a lot of love, but vLLM is popular in its own right. This post shows us how to do production-ready model-serving on a serverless platform.

[blog] Software development topics I’ve changed my mind on after 10 years in the industry. Book knowledge is great, and it should be supplemented with hard-fought experience. Do stuff, and form your own opinions.

[blog] 20 things you didn’t know you could do with Google Maps. Spot on. I didn’t know most of these. Maps is 20 years old, and full of functionality I hadn’t heard of.

Want to get this update sent to you every day? Subscribe to my RSS feed or subscribe via email below:

February 6, 2025

Daily Reading List – February 5, 2025 (#487)

Big tech day for Google with some new models out. Tomorrow I head out to India, and will do my best to keep the reading lists flowing while I’m half a world away.

[article] The End of Programming as We Know It. If you read one item from this list today, make it this. Read the others too, but this one feels important to digest.

[blog] Gemini 2.0 is now available to everyone. Big, big news. GA models, new models, and wide availability. This post has benchmarks, which shows you’re getting better models at great prices.

[blog] Improving Agentic SQL Generation. AI that generates code or SQL is so helpful. But it doesn’t magically know about your tables. Shane shows how metadata makes the predictions better.

[article] Introduction to Data Streaming. Here’s another one of The New Stack’s foundational tech articles. Get the scoop on what data streaming is really about and why it matters.

[article] Abandoned AWS S3 buckets can be reused in supply-chain attacks that would make SolarWinds look ‘insignificant’. This can apply to any object storage service. But of course, S3 is a wildly popular one, and likely a rich target.

[article] How not to waste a senior engineering hire. Make sure your experienced folks have assignments that fit their level.

[blog] What is Ollama and how to use it: a quick guide [part 1]. This tool may have flown past you without notice. But lots of folks like using it to run LLMs locally.

[article] 5 Signs a Remote Worker Is Burning Out. How to sense it, and what to do about it. Good advice for managers, and for all of us who also spend time working remotely.

[article] Enterprises lean on container solutions to deploy generative AI. Are AI workloads the trigger for mass adoption of container runtimes beyond what we’ve done thus far? Maybe so.

[blog] How I use LLMs as a staff engineer. All of these make sense to me, and are part of my own workflow with AI.

[blog] Secure by Design: Google’s Blueprint for a High-Assurance Web Framework. Read this if you ship web apps and want to eliminate exploitable vulnerabilities.

[ebook] How to Scale Your Model. This is basically Google’s own internal handbook, shared with everyone. Some fantastic content here for ML engineers and data scientists.

Want to get this update sent to you every day? Subscribe to my RSS feed or subscribe via email below:

February 5, 2025

Daily Reading List – February 4, 2025 (#486)

It’s been a while since I’ve shipped a reading list with this little AI content. It wasn’t intentional; other topics captured my attention today. Read below for content on security, Kubernetes, software deployment, and more.

[blog] Locking Down the Cloud with Cloud Armor – Easier Than You Think. Ideally you can basically “check a box” and protect your cloud environment from DDoS attacks, SQL injection, and more. Oh wait, you can.

[article] The End of Search, The Beginning of Research. Performing a web search to get a specific answer still matters. But the rise of focused “research agents” (like the new one from Open AI, or Gemini Deep Research) are a real breakthrough.

[blog] It’s OK to hardcode feature flags. Controversial take? Sure, but I like the argument. See if you agree.

[blog] Detection as Code in Google SecOps with Terraform. Cool idea which makes sense to me. Security teams can use IaC products like Terraform to define threat detection rules and deploy them as they would with software.

[blog] Infrastructure as Code is too generic. Continuing the thought from the last piece, is IaC too generic? Brian says there are problems with standard tools that don’t understand the purpose of resources or how they relate to each other.

[article] The Startup Drake Equation. Extensive, interesting piece from Jason here. What are all the activities and outcomes that need to line up for a startup to succeed? How do you reduce some key risks?

[blog] Get Started with n8n on Google Cloud for AI Workflow Automation. I hadn’t heard of n8n until today. Karl does a great hob showing us how to visually model (AI) workflows using an open platform deployed to a Kubernetes cluster.

[blog] A Generative AI Agent with a real declarative workflow. Let’s talk more about workflows. Guillaume shows how to take a code-based AI app and re-create it with a declarative workflow.

[article] How to Encourage the Right Kind of Conflict on Your Team. Very good advice here. It’s ok to have some positive tensions and the freedom to disagree. But done in the right way.

[blog] Kubernetes History Inspector. William gushes over this “information dense” view of Kubernetes logs. Try out this new open source tool.

[blog] How we improved GKE volume attachments for stateful applications by up to 80%. I wouldn’t have thought of this problem, but then again, I wouldn’t have personally experienced it. If you have lots of Kubernetes clusters with stateful workloads, you’ll care about this.

Want to get this update sent to you every day? Subscribe to my RSS feed or subscribe via email below:

February 4, 2025

Daily Reading List – February 3, 2025 (#485)

I’ll tell you, I’ve used the content I’ve found for this daily post over a dozen times already this year. Usually, in conversations with others or in presentations. Paying attention pays off!

[article] Emerging Patterns in Building GenAI Products. I enjoy these installment-style articles from ThoughtWorks, and this one has good introductory content on LLM prompting, evals, and embeddings.

[article] 6 Lessons for Startups from a Museum Dedicated to Failure. If you can’t quickly list 2-3 of your failures, I question your self-awareness 🙂 Here are ones to look out for in startup businesses.

[blog] Is engineering strategy useful? There’s always a strategy, even if it’s not articulated or written down. Will has a great post here about the importance of a written strategy.

[blog] Building Next-Gen AI Agents: Evolving Patterns and Best Practices. I found this a useful list of topics to consider when looking at what it means to build with AI right now.

[article] Introduction to Software Testing. Here’s a good foundational look at what it means to do software testing. Great if you’re looking for the basics!

[article] Prompt Injection for Large Language Models. Get some good details here on the risk of prompt injection, and how you can try and defend against it.

[blog] Serverless Compute for Notebooks, Workflows and Pipelines is now Generally Available on Google Cloud. Databricks customers have good options across clouds, and I’m happy to see this option fully available on the best one 🙂

Want to get this update sent to you every day? Subscribe to my RSS feed or subscribe via email below:

February 3, 2025

Daily Reading List – January 31, 2025 (#484)

Did you have a productive week? I think I did, but it was a blur. Looking forward to some forced downtime this weekend!

[blog] Code Quality in the Age of AI-Assisted Development. I like this analysis of what the developer’s inner loop and outer loop looks like before and after AI-generated code enters the fray.

[blog] Producer java library for Data Lineage is now open source. I knew pretty much nothing about this topic until I read this. Lineage seems like a very valuable capability for modern systems.

[blog] Does “Developer First” Mean What You Think It Means? Words mean things. What are you saying when you call a product or company “developer first”? Adam explores.

[article] Research: Humble Leaders Inspire Others to Step Up. Humility doesn’t convey weakness; some of the strongest people I know have a modesty about their own importance.

[article] Infrastructure as Code: From Imperative to Declarative and Back Again. This space is mature, but not settled. The paradigm for managing infrastructure has swung back and forth, and some folks are already looking at the next change. Keep watching.

[blog] Tuning and Evaluating an LLM in Vertex AI Pipelines. Chris looks at what it takes to have a repeatable process for getting LLMs into production.

[blog] Why I use Cline for AI Engineering. I feel like I’m saturated in AI-assisted coding tools, but I trust Addy’s recommendation here. Cline looks like a step up from most of what’s available right now.

Want to get this update sent to you every day? Subscribe to my RSS feed or subscribe via email below:

January 31, 2025

Daily Reading List – January 30, 2025 (#483)

My “no meeting Friday” tomorrow has somehow turned into a “seven meeting Friday” but that’s ok. There’s lots to do right now. Check out a fun variety of links below.

[article] Applying the Core 4 framework (Part 2). Learn more about this new framework (which is a combo of other frameworks) that help you focus dev productivity efforts, and connect to business impact metrics.

[article] Google Search’s newest Easter egg is all about breaking blocks. This is the opposite of developer productivity, but if you don’t play at least one round of this RIGHT NOW, I’ll be disappointed in you.

[article] This 90-Day Plan Turns Engineering Leaders Back into Frontline Developers. It can be done! Sometimes you want to get back into the weeds and build again.

[blog] Deploy Gemini-powered LangChain applications on GKE. AI app or not, this is a useful post for showing what it takes to go from source code to container to Kubernetes.

[blog] Running DeepSeek: From Open Source Model to Production-Ready API on Google Cloud — VertexAI. Here’s a walkthrough of running DeepSeek in Google’s AI platform.

[blog] Mistral Small 3. The open model has some great low latency. This post shares a lot of performance numbers, and differentiates the model from others like DeepSeek. More here.

[blog] Scaling the Tülu 3 post-training recipes to surpass the performance of DeepSeek V3. Another one to take a look at. I’m expecting another explosion of innovation in this space.

[blog] Announcing the general availability of Spanner Graph. Very cool option for graph database fans, and these new GA features—including an open source graph viewer—are legit.

[article] 5 Signs Your Optimism is Hurting Your Team. I’m a fairly upbeat and optimistic person, but this post reminds me to avoid “toxic positivity” with the team.

[blog] How Relationships Work in Data Connect. The data management service in Firebase isn’t just for simple single-table data. This post shows how relationships (one to many, many to many) work.

[blog] Simplify the developer experience on Kubernetes with KRO. It’s pretty rad when Google, AWS, and Microsoft can work together on something. New open source tooling!

Want to get this update sent to you every day? Subscribe to my RSS feed or subscribe via email below:

January 30, 2025

Daily Reading List – January 29, 2025 (#482)

Heading home after a short, productive trip to the Cloud HQ. Fairly large reading list today, with a lot of security content for some reason. Enjoy!

[blog] The Product Model and Agile. How does a product operating model intersect with Agile software? Same thing? Not related? Marty explains that Agile is important to one of the three dimensions of the product model.

[blog] An AI agent to generate short sci-fi stories. If you want a new short story every day, with images, you’ll like this prototype. I love the final result, and it shows off so many cool aspects of AI.

[blog] Enhancing Web Application Security with Cloud Run Sidecar Containers. Interesting use case, where static content is served by the sidecar, with a full Java app serving other requests.

[blog] A CheatSheet of 128 CheatSheets for Developers. Haven’t seen this before. A cheatsheet of different cheatsheets? Nice. You’ll find ones for interviews, languages, platforms, and more.

[blog] Semaphore is Going Open Source in 30 days. Wow, a product going straight-up Apache 2.0 open source? Seems rare nowadays. Keep an eye on the core platform of this CI/CD stack.

[blog] The New Frontier of Security: Creating Safe and Secure AI Models. This looks like great guidance whether you’re distributing models, or deciding which (open) models to consume.

[blog] Adversarial Misuse of Generative AI. Long post, but interesting. How are government-backed threat actors trying to use Gemini for attacks?

[blog] How we estimate the risk from prompt injection attacks on AI systems. One more on security. What does it look like to perform (automated) red-teaming against the risk of indirect prompt injection? Our Google pros explain.

[blog] How we kept the Google Play & Android app ecosystems safe in 2024. Ok, I lied. I came across yet another interesting security-related post. Here’s a good use case for AI.

[blog] Why NotebookLM Matters. Short post from Om that explains why NotebookLM has staying power.

[article] How can engineering teams optimize hybrid work? Here’s some analysis into hybrid work, productivity challenges, and actions to take.

[blog] How we built it: Usage-based billing. Cool post from Stripe about the architecture they created to support usage-based billing. Lots of real-time data processing needed!

[article] Waymo reportedly testing robotaxis in 10 new cities in 2025. I’m not ready to look over at the car driving next to me on the highway and not seeing a driver. But it’s coming soon.

Want to get this update sent to you every day? Subscribe to my RSS feed or subscribe via email below:

January 29, 2025

Category: Daily Reading List