Skip to content

Richard Seroter's Architecture Musings

About
Daily Reading List
Contact Me

Author: Richard Seroter

Daily Reading List – October 1, 2024 (#409)

What do you look for in a manager? I wrote up my thoughts on this about 8 years ago, and it still holds. One of those (“significant domain knowledge”) popped up today when my skip-level boss gave me an AI coding technique that blew my mind. Work for those that make you better!

[article] When DevOps Runs Its Course – We Need Platform as a Runtime. The title is a bit click-baity, as the word “DevOps” doesn’t show up in the piece. But, still a solid read on the benefits and components of a modern platform.

[blog] Confetti cannons or fire extinguishers? Here’s how to secure cloud surprises. Really good post. What steps do you follow as a security pro when you’re brought in late to the game? Follow this playbook.

[blog] How To Defeat Anger: 4 Secrets From The Ancients. It’s ok to get mad. To me, the key is how long (or short) you stay mad. Eric has some advice to reframe your thinking.

[blog] Demystifying Google’s Data Gemma. Here’s an exploration into one of our newest open models that has a “fresh take on reducing hallucinations and improving the factual accuracy of AI-generated content.”

[blog] Leveraging Dwell Time to Improve Member Experiences on the LinkedIn Feed. Use the right metrics for the right situation. “Dwell time” makes sense for LinkedIn to care about. Conversely, you don’t want folks dwelling a long time on your food ordering app.

[paper] Improving LLM reliability and performance: Prompt engineering, fine-tuning, RAG, and long context window techniques. New paper (no reg needed) that looks at the four major techniques for coaxing the most relevant responses from an LLM.

[article] Why does the “knowing-doing gap” exist, and how can engineering leaders overcome it? Are you quickly applying what you learn as a leader? Why or why not?

[blog] The flavor is a little…spare. Coté has some good tips for our AI-powered notebook service (NotebookLM) and offers up some use cases I hadn’t come across yet.

[blog] Building a Developer Platform. Another post on what you might include in your developer-oriented internal platform.

Want to get this update sent to you every day? Subscribe to my RSS feed or subscribe via email below:

Type your email…

October 1, 2024
Daily Reading List – September 30, 2024 (#408)

Happy Monday. There’s no rhyme or reason to the links today; just a mixed collection of interesting tech bits that caught my eye.

[blog] Survey Says: Tech Spending Is Up, But AI Rollouts Slower Than Expected. Here’s the latest State of Enterprise Tech Spending report from Battery Ventures.

[blog] AI-powered visualization and LookML assistants debut for Gemini in Looker. I don’t want to learn each tech’s domain-specific language. If AI makes it possible to convert my intent into machine code, that’s a good deal.

[article] A Self-Care Checklist for Leaders. Leaders need to take care of their team, and also take care of themselves.

[blog] Some Go web dev notes. I read a lot, and it’s fairly obvious when I come across a piece that’s written by a practitioner versus a paper expert. Julia is clearly the former.

[blog] 21 startups transforming education with AI. These education startups could change the lives of millions. What an inspiring use of modern tech!

[youtube-video] .NET and C# are in trouble. Here is what I’d do. This isn’t necessarily unique to the .NET ecosystem, but it feels particularly acute here.

[article] Resilience and Chaos Engineering in a Kubernetes World. Chaos happens. How do you plan for it, simulate it, and respond to it?

[article] Gov. Newsom vetoes California’s controversial AI bill, SB 1047. This one had folks riled up but seemed like the right decision. More here.

[blog] Eliminating Memory Safety Vulnerabilities at the Source. This mostly explores Android scenarios, but is a lesson for anyone tackling security at the root.

[article] Enterprises funnel IT spend into AI and data, Accenture says. There’s no new money funding AI initiatives; it’s coming at the expense of other projects. But I’m happy to see a proper investment in modernization and data BEFORE over-investing in AI.

[article] CEO Kurian: ‘When I Started, Most People Told Me We Didn’t Have a Chance’. It’s hard to paint Google as an underdog, but in Cloud, we were not a serious player when TK took over. Now? Not only is this the best cloud, engineering-wise, but it’s a first or second choice for a LOT of companies.

Want to get this update sent to you every day? Subscribe to my RSS feed or subscribe via email below:

Type your email…

September 30, 2024
Daily Reading List – September 27, 2024 (#407)

My Friday workday ends at inbox zero, and with a ton of pride in my team who delivered amazingly in some high-stakes presentations this afternoon. Have a great weekend!

[article] Why Open Source Forking Is a Hot-Button Issue. The current forks hit different than previous ones. There’s real energy behind alternatives to formerly open projects.

[blog] 3 Ways to use the new Multimodal Llama 3.2 on Vertex AI. This is a good overview into how you can experiment with Meta’s latest model.

[article] Research: Competent Leaders Know The Limits of Their Expertise. Feeling like an expert makes you overconfident when you shouldn’t be; actual expertise helps us know our boundaries better.

[blog] Enhance your prompts with Vertex AI Prompt Optimizer. Speaking of “knowing your limits”, I know I’m not a great prompt engineer. Tools like this help me get better results. Also check out the announcement post.

[blog] Is Product Art or Science? Yes. Marty things of product development as a science, but a new book made him reflect on the process artists go through to create something.

[blog] Introducing Custom Templates. I find Project IDX quite intriguing. It’s more than just an online IDE. And now you can set up customizable templates that drop you into a pre-configured environment.

[article] Vector Embeddings Explained: A Beginner’s Guide to Powerful AI. You’ll find a lot of details here about vector embeddings. Skim through to uplevel your knowledge.

[blog] From millions to billions: Announcing vector search in Memorystore for Valkey and Redis Cluster. Some silly performance here, but this is useful for anyone who needs quick responses from vector searches.

Want to get this update sent to you every day? Subscribe to my RSS feed or subscribe via email below:

Type your email…

September 27, 2024
Daily Reading List – September 26, 2024 (#406)

We’re doing some performance reviews and promotion assessments right now at work. I’m learning to give more specific feedback and lean into direct conversations. But I feel very fortunate to work with such a high caliber team that continues to perform so well.

[blog] Some advice and good practices when integrating an LLM in your application. Excellent post. Read this before going too far down a path with generative AI models.

[blog] Why I still blog after 15 years. Loved this. Yesterday, I published my 1000th post on this blog, and have been blogging overall for more than 19 years. This author’s reasons for continuing match mine.

[doc] Demystifying AI Risk: An Actionable Framework Aligning Business Needs to Risks and Mitigations. There’s useful content here for folks who want to learn about the unique risks to mitigate when you bring AI into your environment.

[article] AI Can (Mostly) Outperform Human CEOs. Any role that thinks its immune from the impact of AI, is mistaken.

[article] Russ Cox’s Next Act: AI-Powered Help Agents for Open Source Projects. Russ was a long-time tech lead for the Go language, and is now doing some cool work to make OSS maintenance easier.

[article] Meta’s Llama AI models now support images, too. Another day, another exciting model drops. The Meta folks are now in the multimodal game where they can handle more than just text input. Read Meta’s post too.

[blog] Meta’s Llama 3.2 is now available on Google Cloud. And the vendors are getting good at making new models available almost instantly. We offer a few low-touch ways to try it out.

[blog] NotebookLM adds audio and YouTube support, plus easier sharing of Audio Overviews. I’d bet that this service is about to breakthrough in a big way. Now with fresh audio input options and audio sharing. Create amazing notebooks out of piles of source material.

[article] Semantic Router and Its Role in Designing Agentic Workflows. If you’re deploying more than one model (or editions within a family of models), this is a pattern you should know about.

Want to get this update sent to you every day? Subscribe to my RSS feed or subscribe via email below:

Type your email…

September 26, 2024
Daily Reading List – September 25, 2024 (#405)

Today is my birthday. It’s not a number divisible by ten, so by Patton Oswalt’s rules, I shouldn’t be celebrating. Nonetheless, I had a great day, and appreciate the gift of another trip around the sun.

[site] Celebrating Popcorn. Today’s “Google Doodle” is a playable game that’s pretty fun. It’s also powered by Google Cloud and some open source software, which is pretty cool.

[blog] How we optimized LLM use for cost, quality, and safety to facilitate writing postmortems. You don’t want LLMs writing a postmortem on your behalf; these are learning opportunities following an incident. This Datadog post looks at how LLMs can assist, while not taking away from the primary objective.

[article] Harness aims to accelerate enterprise software development with AI agents. These folks are doing some cool work to address more of the app lifecycle besides just coding assistance.

[guide] Best practices for using CMEKs. I didn’t wake up today hoping to learn more about customer-managed encryption keys. But, this is well-written, and covers an important topic.

[blog] Moving from experimentation into production with Gemini and Vertex AI. I like a few things here. Structured output of LLM responses is GA, the batch API (send in lots of prompts) looks cool, and this prompt optimizer capability could be a big deal.

[blog] 10 Rules for Sustaining Excellence. This is excellent advice for becoming someone who is consistently good at what they do.

[blog] Lots of new cool Gemini stuff in LangChain4j 0.35.0. Guillaume keeps introducing more useful features into this LangChain-like library for Java devs.

[article] Stop Blaming Regulation for Poor Software Delivery Performance. Each industry has its own dynamics, but release sluggishness is a choice, not a mandate of any industry.

[article] Most mainframe application rewrites fail the first time. Some depressing stats here, unless you’re in the mainframe business.

[blog] Google Cloud files complaint with European Commission regarding Microsoft’s anti-competitive licensing practices. Sometimes, this the route you have to go. Windows runs great on other clouds, and licensing shenanigans shouldn’t be the reason you pick one over the other.

[blog] Implement Function Calling with Gemini to query backend SQL Systems — Complete deployment tutorial. Giving an LLM “tools” it can use when it doesn’t have the answer itself? That’s the power of function calling. Learn more about it by reading this.

Want to get this update sent to you every day? Subscribe to my RSS feed or subscribe via email below:

Type your email…

September 25, 2024
Daily Reading List – September 24, 2024 (#404)

It was a good day, and I’m doing a quick trip to Sunnyvale tonight to talk to a customer tomorrow about how Google uses AI to improve our own software delivery. Should be fun.

[blog] Updated production-ready Gemini models, reduced 1.5 Pro pricing, increased rate limits, and more. Wow. You’re getting premium models at the lowest price. I like the faster output and lower latency too.

[blog] Using Generative AI to Automatically Create a Video Talk from an Article. The concept of NotebookLM has fired folks up—generate consumable media from a pile of written content—and Lak wanted to see if he could create slides and videos from a blog post.

[article] How to Keep Learning at Work — Even When You Feel Fried. This one spoke to me. There’s good advice here on changing your mindset and goals.

[blog] Part1: Switching from Terraform to OpenTofu and Automating your infrastructure on Google Cloud. I haven’t noticed a ton of OpenTofu content, so this one caught my attention. Learn how to use this Terraform fork.

[blog] Customers are putting Gemini to work. Are companies really doing meaningful, business-critical things with generative AI? Some are, yes. This post names them. Related.

[blog] Beyond Backend: Honeycomb for Frontend Observability is Now GA. It’s hard to troubleshoot issues or find opportunities if you aren’t looking at all the data.

[blog] Evaluate open LLMs with Vertex AI and Gemini. Which prompts or models work best for what you’re trying to do? Philipp tries out our managed evaluation service to see what prompts work best on Llama 3.1.

[blog] Everything you need to know about the Gemini API as a developer in less than 5 minutes. Logan gets you up and running with the Gemini API, even if you don’t use Google Cloud.

[article] OpenTelemetry Isn’t the Hero We Need: Here’s Why it’s Failing our Stack. The writer comes from a “competing” product in eBPF, but that doesn’t mean the argument lacks merit.

[blog] Generative AI Cost Optimization Strategies. This AWS post offers up some solid considerations for a cost-aware AI strategy.

[blog] What drives users to Infrastructure as Code? Brian says that GUIs don’t scale, and chaining together CLI commands is brittle. IaC offers a better way to establish repeatable and maintainable infrastructure setups.

[blog] Apply to secure your company’s spot for the Project Starline x HP product. This was probably the most inspiring physical demonstration I participated in this year. Get your company in the queue.

[article] Navigating LLM Deployment: Tips, Tricks, and Techniques. Check this out for further advice on optimizing your LLM deployments.

Want to get this update sent to you every day? Subscribe to my RSS feed or subscribe via email below:

Type your email…

September 24, 2024
Daily Reading List – September 23, 2024 (#403)

I spent the weekend in Washington DC and got home last night. Back to it today, with some reading list items that will make you think.

[article] Retrieval Interleaved Generation (RIG): When real-time data retrieval meets response generation. Here’s a good deep-dive into this technique that differs from RAG, but fits more complex scenarios that demand factual information.

[guide] Migrate from AWS to Google Cloud: Migrate from Amazon RDS and Amazon Aurora for PostgreSQL to Cloud SQL and AlloyDB for PostgreSQL. Big, impressive guide for those looking to upgrade their cloud, and their database services.

[blog] How streaming LLM APIs work. You probably want big LLM results coming back as a stream versus in a single chunk, and Simon shows how 3 top models handle that.

[blog] Generate Podcast episode for Google Cloud Technology Nuggets. Do you create internal or external release notes or content roundups? Follow Romin’s lead and generate a realistic-sounding podcast with AI.

[blog] Different ways of working with SQL Databases in Go. Folks have their go-to techniques for calling databases from code, but it’s good to refresh your understanding of the available options.

[blog] Charting Your AI Native Journey. Great read, and thought-provoking in its approach. Guy shows one way to categorize AI tools and gives examples of what falls into each bucket.

[article] Google Proposes Adding Pipe Syntax to SQL. I barely write functional SQL the traditional way. I’m likely a “pass” on pipe syntax.

[blog] FSL: A Better Business/Open Source Balance Than AGPL. For single vendor OSS projects, I can understand why this is a strong choice for licensing your work.

Want to get this update sent to you every day? Subscribe to my RSS feed or subscribe via email below:

Type your email…

September 23, 2024
Daily Reading List – September 20, 2024 (#402)

I’m flying to Washington DC today with my oldest son to look at colleges. It was a vacation day from work (mostly), but I still did some reading worth sharing.

[article] Techniques to Tackle Technical Debt with Dustin Thostenson. Read this transcript (or listen to the podcast itself) for insights into identifying, prioritizing, and fixing tech debt.

[article] Paying down tech debt: further learnings. More on tech debt! This looks at incremental and big-bang approaches.

[blog] Debunking Kafka Top 5 Use Cases. Derek’s had enough of people throwing too many use cases at their Kafka infrastructure.

[article] Google’s NotebookLM evolves: What IT leaders need to know about its enterprise applications. This is the most interesting AI tech from the past month, and that’s saying something. And it embeds a tweet from me, so you KNOW it’s good.

[blog] Innovating at the speed of light: A CosMc’s story. You don’t have to be a small company to be nimble. McDonald’s folks share how they went from concept to reality in a short time.

[article] AWS sees customers repatriating workloads as cloud wars heat up. I think what you’re seeing is folks who lifted-and-shifted having buyers remorse. And a few cloud vendors cleaned up on those early migrations, but now have unsettled tenants.

[blog] Cost management for AI/ML platforms with Google Kubernetes Engine. Use clouds the right way, and repatriation will seem silly. I like when clouds make it easier to use them the right way.

[blog] Google is a Leader in the 2024 Gartner® Magic Quadrant™ for Container Management. Speaking of GKE, it’s still among the best ways to do Kubernetes in the cloud. I use all the options from the major hyperscalers, and don’t think it’s as close as represented here!

[blog] Pulumi Google Cloud Provider Version 8.0.0. If you’re using Pulumi already, or just looking to get rolling with a good infrastructure automation strategy, take a look at what’s new.

[article] Pfizer’s Future of Development. How does a big company truly change how they deliver software? I like the focus areas here, and the outcomes are excellent.

Want to get this update sent to you every day? Subscribe to my RSS feed or subscribe via email below:

Type your email…

September 20, 2024
Daily Reading List – September 19, 2024 (#401)

It’s been a long week! I spent some time this afternoon setting up a new work laptop, and I’m going to foolishly bring it on a trip tomorrow. What could go wrong?

[article] Valkey 8.0 rides high at Open Source Summit in Vienna. It’ll be interesting to see where this fork starts to distinguish itself and diverge from Redis functionality.

[blog] 9 new features we announced at Made on YouTube 2024. I can’t say that any of these apply to me directly, but it’s a guarantee I’ll interact with the results by YouTube creators.

[blog] Keys to a resilient Open Source future. Is AI going to be the best option for open source security? It might be, given the scale of code we’re talking and the volunteer-heavy approach.

[blog] Introducing Netflix’s Key-Value Data Abstraction Layer. Abstractions are tricky to maintain, and can accidentally block you from using unique features underneath. But for scenarios like this, the use case makes sense to me.

[article] Deno 2 Arrives With Long-Term Support, npm Compatibility. Migrating to this Nodejs replacement will be easier now, for those interested.

[blog] Quitting Time. Perseverance is important, but so is knowing when to quit. What’s your criteria, and can you stick to it?

[blog] Apache Airflow ETL in Google Cloud. The spectrum of hosting options is typically raw compute, managed compute, and managed servcies. That applies here as well.

Want to get this update sent to you every day? Subscribe to my RSS feed or subscribe via email below:

Type your email…

September 19, 2024
Daily Reading List – September 18, 2024 (#400)

Today was another day of team meetings so I was offline for much of it. But I got some good early-morning reading finished, and you’ll find them in my 400th list below.

[article] Mistral launches a free tier for developers to test its AI models. Every dev loves a good sandbox environment. Mistral is encouraging folks to give their models a try.

[article] Writing CNNs from Scratch in PyTorch. If you’ve been dying to create convoluted neural networks, follow along with this tutorial.

[article] Are You Done Yet? Mastering Long-running Processes in Modern Architectures. Bernd knows what he’s talking about, so learn from him in this talk/transcript about long running workflows.

[article] Go makes a comeback: What’s fueling its revival? This article proposes that Go is getting more popular because of its security posture and AI friendliness.

[blog] Cloud CISO Perspectives: The high value of cross-industry communication. It’s good to see that companies and organizations of all kinds willingly partner together to solve security challenges.

[article] Study Finds No DevOps Productivity Gains from Generative AI. Hmm. I haven’t seen findings like this, which is why the article caught my eye. Run your own analysis when you introduce these tools into your environment to see where it makes a positive difference.

[article] Open Source: Paid Maintainers Keep Code Safer, Survey Says. A lot of folks depend on software with a small number of maintainers. This says that those who are paid spend more time on security work.

[article] Home Depot builds DIY GenAI model that tells human employees what to say. This post looks at a new whitepaper from the home improvement giant. They’re using a RAG pattern to help with support chats.

[article] This Is How To Conquer Anxiety: 4 Secrets From Research. You may not conquer anxiety, but these are solid ways to get it under control or reframe it.

Want to get this update sent to you every day? Subscribe to my RSS feed or subscribe via email below:

Type your email…

September 18, 2024

←Previous Page

1 … 36 37 38 39 40 … 138

Richard Seroter's Architecture Musings

Subscribe Subscribed
- Richard Seroter's Architecture Musings
- Already have a WordPress.com account? Log in now.