I’m refreshed (and sunburned) after a long holiday weekend. Today wasn’t as packed with meetings as usual, so it was also great to crank out a blog post (in addition to this one), empty my inbox, and take some AI training. And get to read a lot, as you’ll see below.
[article] Arriving at ‘Hello World’ in enterprise AI. Top-down selling of AI into enterprises is a tough route. You also need bottoms-up enthusiasm and wins to get real traction.
[blog] 10 Tools For Building MCP Servers. We’re going to overdo it on MCP servers, aren’t we? Seems inevitable. If you want to join the gold rush, here’s a list of frameworks that get you there faster.
[article] How has AI impacted engineering leadership in 2025? Good insights, although AI usage data collected in March is already woefully dated. And I wonder if we’re working off a common definition of “developer productivity.” Probably not.
[book] Agentic Design Patterns. My colleague is writing this book out in the open in a series of Google Docs. Anyone can view or offer suggestions. Terrific content!
You’ve vibe coded an app? Congrats. That’s cool. Maybe you’re completely new to programming, or you’ve been doing it awhile. It’s possible you’ve seen some of the horror stories online of folks vibe coding something, deploying it, and then watching it collapse as folks flooded and hacked it. Yikes. It’s important to pay attention to quality attributes. Sometimes called non-functional requirements, these are key aspects of a well-built software system. They include traits like usability, maintainability, security, reliability, scalability, availability, and flexibility. While you might expect (hope?) that your vibe coding service created software that always adheres to these concepts, you shouldn’t assume so. And either way, these are important characteristics to understand if you want to mature as a software creator.
You can get AI to help here, though. I used Google AI Studio and Gemini 2.5 Pro to generate some prompts you could apply after you’ve built an app using Bolt, Vercel’s v0, Lovable, or Gemini Canvas. Maybe you plug those generated prompts into the Gemini CLI after you’ve downloaded the code from your vibe coding session. Here’s my initial prompt, which you could take and adjust however you want:
I’ve vibe coded an app, and now want to make sure it’s of high enough quality to ship to production. I’m not a professional developer. Suggest 3-5 comprehensive and understandable LLM prompts I should now offer my agentic CLI to check for software quality attributes like usability, reliability, performance, observability, security, repeatability, scalability, availability, flexibility, and maintainability.
Below is the direct output from Gemini, including the proposed prompts that hardens your software while making it more fault tolerant and scalable.
Here are 4 comprehensive prompts designed to be fed to your agentic CLI. They are structured to cover all the quality attributes you mentioned in a logical, step-by-step manner.
The Strategy: From Code to Cloud
We’ll use a layered approach. Think of it like building a house:
The Blueprint & Foundation: Is the code itself clean, organized, and easy to understand? (Maintainability, Flexibility)
The Walls & Locks: Is the application secure and can it handle errors gracefully? (Security, Reliability)
The Engine & Plumbing: Can the app perform under load and tell you when it’s sick? (Performance, Scalability, Observability)
The Assembly Line: Can you build and deploy it consistently every single time? (Repeatability, Availability, Scalability)
Prompt 1: The Code Quality & Maintainability Audit
Goal: This prompt focuses on cleaning up the existing “vibe code.” It makes your app easier for you (and your AI agent) to understand, modify, and extend in the future.
Attributes Covered:Maintainability, Flexibility, Repeatability (at the code level).
Act as a senior software engineer and mentor conducting a thorough code review of my entire project. Your goal is to improve its long-term maintainability, flexibility, and readability. Analyze the whole codebase and provide the following:
1. **Refactoring for Clarity:** Identify the top 5 functions or classes that are too long, complex, or have unclear names ("code smells"). For each, provide a refactored version that is simpler, follows the Single Responsibility Principle, and has clearer naming. 2. **Configuration & Secrets:** Scan the code for any hardcoded configuration values (like API endpoints, database strings, or magic numbers). Suggest extracting them into a centralized configuration file (e.g., `config.js`, `.env`, or `settings.py`) and provide the template for this file. Flag any plain-text secrets immediately. 3. **Dependency Review:** List all external libraries and dependencies. Point out any that are deprecated, have known major issues, or could be replaced by a more standard/modern alternative. 4. **Automated Quality Gates:** Generate a configuration file for a standard linter and code formatter for my project's language (e.g., `.eslintrc.json` and `.prettierrc` for JavaScript/TypeScript, or `pyproject.toml` for Python with Black and Ruff). This ensures future code stays clean. 5. **Documentation:** Generate a template for a `README.md` file that includes a project description, setup instructions for a new developer, and an explanation of the core project structure.
Prompt 2: The Security & Reliability Hardening
Goal: This prompt switches focus to making your app robust against attacks and resilient to failure. It assumes the code is now cleaner from Prompt 1.
Act as a paranoid but helpful security and reliability engineer. Your mission is to identify and help me fix potential security vulnerabilities and sources of unreliability in my application. Analyze the entire codebase and provide a report with actionable code suggestions for the following:
1. **Security Vulnerability Scan (OWASP Top 10):** * **Input Validation:** Find all points where the application accepts user input (API endpoints, forms, etc.). Check for potential injection vulnerabilities (SQL, NoSQL, Command). * **Cross-Site Scripting (XSS):** Check if output to the user is properly sanitized or escaped. * **Authentication/Authorization:** Review how users are authenticated and how their permissions are checked. Look for common flaws. * **Insecure Dependencies:** Scan my `package.json`, `requirements.txt`, etc., for dependencies with known security vulnerabilities (CVEs) and suggest updated, secure versions.
2. **Error Handling & Reliability:** * Identify all critical code paths (e.g., database calls, external API requests, file I/O). * Pinpoint areas lacking proper error handling (e.g., missing `try...catch` blocks or unchecked errors). * For each area, suggest adding robust error handling that prevents the app from crashing and provides a clear error message or fallback.
3. **Availability Checkpoint:** * Suggest creating a simple health check endpoint (e.g., `/healthz` or `/status`). This endpoint should return a `200 OK` status if the app is running and can connect to its essential services (like the database). Provide the code for this endpoint.
Prompt 3: The Performance, Scalability & Observability Tune-Up
Goal: Now that the app is clean and secure, let’s make it fast and ensure you can see what’s happening inside it when it’s running.
Act as a Site Reliability Engineer (SRE) focused on performance and observability. Your goal is to ensure my application can handle growth and that I can diagnose problems in production. Analyze the codebase and suggest improvements in these areas:
1. **Performance Bottlenecks:** * **Database Queries:** Identify any database queries performed inside loops (N+1 query problem). Suggest how to optimize them into a single, more efficient query. * **Heavy Computations:** Find any computationally expensive operations or inefficient algorithms that could block the main thread or slow down responses. Suggest optimizations or asynchronous execution. * **Data Handling:** Look for places where the app loads very large amounts of data into memory at once. Suggest using pagination, streaming, or chunking.
2. **Observability - Logging & Metrics:** * **Structured Logging:** Review my current logging (or lack thereof). Propose a structured logging strategy (e.g., JSON format). Refactor 3-5 key `console.log` or `print` statements to use this new structured logger, including important context like user ID or request ID. * **Key Metrics:** Identify the 3 most important metrics for my application's health (e.g., API request latency, error rate, number of active users). Show me where and how to instrument the code to capture these metrics, even if it's just with a logging statement for now.
3. **Scalability Review:** * Identify anything that would prevent me from running multiple instances of this application (horizontal scaling). This usually involves checking for in-memory state that should be moved to a shared store like a database or Redis (e.g., session stores, caches, locks).
Bonus Prompt 4: The Production-Ready Deployment Blueprint
Goal: This prompt bridges the gap between your laptop and a real server. It makes your deployment process automatic, reliable, and repeatable.
Act as a DevOps specialist. My application is now clean, secure, and performant, but I need a professional and repeatable way to deploy it. Based on my project's language and framework, generate the necessary files and configurations to containerize it and set up a basic deployment pipeline.
1. **Containerization:** Create a multi-stage `Dockerfile`. The first stage should build the application (installing dev dependencies and compiling assets), and the final stage should be a lean, production-optimized image containing only the necessary runtime files and dependencies. Add comments explaining each step. 2. **Local Development Environment:** Create a `docker-compose.yml` file. This file should define my application's service and any other services it depends on (like a database or a cache). This will allow any developer to spin up the entire stack with a single command (`docker-compose up`). 3. **Configuration Management:** Create a `.env.example` file that lists all the environment variables the application needs to run (from Prompt 1), with placeholder values. This file serves as a template for production, staging, and development environments. 4. **Continuous Integration (CI):** Generate a basic CI pipeline configuration file for GitHub Actions (`.github/workflows/ci.yml`). This pipeline should trigger on every push to the main branch and automatically: * Install dependencies. * Run the linter and code formatter checks (from Prompt 1). * (If you have them) Run any automated tests. * Build the Docker image to ensure it works.
By running your app through these four prompts, you’ll have systematically addressed the most critical software quality attributes, transforming your “vibe-coded” project into something you can be confident shipping to production.
Is this fun to do? I dunno. Maybe not as much fun as watching an agentic service crafting software right before your eyes. But it’s the right thing to do.
[article] When a PM takes over engineering. I haven’t seen this talked about much, and there’s some good advice here for those coming into engineering management from product.
[blog] Most Effective Infrastructure as Code (IaC) Tools. There’s going to be some implicit bias here because Pulumi is doing the assessment, but I thought this was a fair look at the overall landscape.
[blog] New Gemini tools for students and educators. This is the likely the domain where AI can be the most useful, and also the most dangerous. It’s fantastic as a learning and personalization tool. Unprecedented. It’s also made it easier to only learn how to prompt.
[blog] Agentic Coding Recommendations. Meaty set of recommendations here. We’re all learning right now, and what works for one person may not work for you. But study what folks are up to.
[blog] Vibe Learning is Underrated. Worthwhile read. AI makes learning new things feel less intimidating, as we can ask “dumb questions” or go on tangents without feeling guilty.
Want to get this update sent to you every day? Subscribe to my RSS feed or subscribe via email below:
It was a fun day in Sunnyvale learning from the Redmonk folks. We talked about “developers” all day, and picked up some good insights. Before that, lots of reading.
[blog] Gen AI Evaluation Service — An Overview. I’ve mentioned here a few times that building skills in evals is a good idea. Mete spends time in this post exploring one eval service for models, tools, and agents.
[article] The AI-Native Software Engineer. My colleague Addy published a super playbook on incorporating AI deeply into the software engineering discipline. We published complementary posts at almost exactly the same time today.
[blog] Go should be more opinionated. The point here is about application layout and structure. I’ve definitely seen a few approaches, and not sure the language should dictate one. But, a smart default is a good idea.
[blog] Docs for AI agents. Here’s a good mishmash of thoughts about the docs agents use to plan and do their work on your codebase.
[blog] Why We Replaced Kafka with gRPC for Service Communication. Have you fallen in love with a tool/service and use it for everything? We’ve all been there. I liked this look at switching from Kafka to gRPC for many use cases.
[blog] Everything I know about good system design. Patterns evolve, the tech changes, and what’s “good” doesn’t stay the same. But the items called out here represent some solid design truths.
Want to get this update sent to you every day? Subscribe to my RSS feed or subscribe via email below:
The “what” and the “how” in software engineering occasionally change at the same time. Often, one triggers the other. The introduction of mainframes ushered in batch practices that capitalized on the scarcity of computing power. As the Internet took off, developers needed to quickly update their apps and Agile took hold. Mobile computing and cloud computing happened, and DevOps emerged shortly thereafter. Our current moment seems different as the new “what” and “how” are happening simultaneously, but independently. The “what” that’s hot right now is AI-driven apps. Today’s fast-developing “how” is AI-native software engineering. I’m seeing all sorts of teams adopt AI to change how they work. What are they doing that you’re not?
AI natives always start (or end) with AI. The team at Pulley says “the typical workflow involves giving the task to an AI model first (via Cursor or a CLI program) to see how it performs, with the understanding that plenty of tasks are still hit or miss.” Studying a domain or competitor? Start with Gemini Deep Research or another AI research service. Find yourself stuck in an endless debate over some aspect of design? While you argued, the AI natives built three prototypes with AI to prove out the idea. Googlers are using it to build slides, debug production incidents, and much more. You might say “but I used an LLM before and it hallucinated while generating code with errors in it.” Stop it, so do you. Update your toolchain! Anybody seriously coding with AI today is using agents. Hallucinations are mostly a solved problem with proper context engineering and agentic loops. This doesn’t mean we become intellectually lazy. Learn to code, be an expert, and stay in charge. But it’s about regularly bringing AI in at the right time to make an impact.
I use the Gemini CLI to:
– debug production incidents, – activate and actuate changes on my CI/CD, – do risk analysis of changes and postmortems, – write slides and docs, – run capacity planning estimates.
— Ramón Medrano Llamas (@rmedranollamas) June 26, 2025
AI natives switched to spec-driven development. It’s not about code-first. Heck, we’re practically hiding the code! Modern software engineers are creating (or asking AI) for implementation plans first. My GM at Google Keith Ballinger says he starts projects by “ask[ing] the tool to create a technical design (and save to a file like arch.md) and an implementation plan (saved to tasks.md).” Former Googler Brian Grant wrote a piece where he explained creating 8000 character instructions that steered the agent towards the goal. Those folks at Pulley say that they find themselves “thinking less about writing code and more about writing specifications – translating the ideas in my head into clear, repeatable instructions for the AI.” These design specs have massive follow-on value. Maybe it’s used to generate the requirements doc. Or the first round of product documentation. It might produce the deployment manifest, marketing message, and training deck for the sales field. Today’s best engineers are great at documenting intent that in-turn, spawns the technical solution.
I have been testing and using Gemini CLI in the last weeks. Here's what my setup looks like:
– For complex tasks, I *never* ask for code first. My initial prompt is to create a plan "Create a detailed implementation plan for [FEATURE, BUG]". – Create multiple hierarchical GEMINI… pic.twitter.com/4brvJqv8nt
AI natives have different engineer and team responsibilities. With AI agents, you orchestrate. You remain responsible for every commit into main, but focus more on defining and “assigning” the work to get there. Legitimate work is directed to background agents like Jules. Or give the Gemini CLI the task of chewing through an analysis or starting a code migration project. Either way, build lots of the right tools and empower your agents with them. Every engineer is a manager now. And the engineer needs to intentionally shape the codebase so that it’s easier for the AI to work with. That means rule files (e.g. GEMINI.md), good READMEs, and such. This puts the engineer into the role of supervisor, mentor, and validator. AI-first teams are smaller, able to accomplish more, capable of compressing steps of the SDLC and delivering better quality, faster. AI-native teams have “almost eliminated engineering effort as the current bottleneck to shopping product.”
AI Agents turn everyone into a manager. We thought the future of software was AI reviewing and editing our work. But in fact the opposite is true. The future of software is to kick off tasks to AI Agents and review, edit, and orchestrate their work. https://t.co/dAlgt3IlRD
There are many implications for all this. Quality is still paramount. Don’t create slop. but to achieve the throughput, breadth, and quality your customers demand requires a leap forward in your approach. AI is overhyped and under-hyped at the same time, and it’s foolish to see AI as the solution to everything. But it’s a objectively valuable to a new approach. Many teams have already made the shift and have learned to continuously evaluate and incorporate new AI-first approaches. It’s awesome! If you’re ignoring AI entirely, you’re not some heroic code artisan; you’re just being unnecessarily stubborn and falling behind. Get uncomfortable, reassess how you work, and follow the lead of some AI-native pioneers blazing the trail.
Spent a small bit of time this weekend playing with using agents to build agents. How meta! Today, many of our engineering leads answered Gemini CLI questions in this Reddit AMA. What a wild time for builders.
[blog] How to Fix Your Context. Building on the previous items, how should you think smartly about creating and maintaining context used by an agent? Great post.
[blog] Go 1.25 interactive tour. These are always terrific posts. Anton publishes these assessments of Go releases by letting you test out the features right within the blog post. See his related (and interactive) look at the new JSON handling capabilities.
[blog] APIs Versioning. No new ground, but it’s a good topic to refresh yourself on from time to time. And a reminder for those who keep breaking APIs.
[blog] This Week in Open Source for June 27, 2025. I’m liking where this weekly update is going. It provides a broad look at the open source landscape and what’s happening.
[article] Why Software Migrations Fail: It’s Not the Code. I imagine that AI is going to “fix” parts of this, but still a reminder that code update aren’t the only thing you need to worry about when doing a migration project.
Want to get this update sent to you every day? Subscribe to my RSS feed or subscribe via email below:
It’s the end of a busy week. I’m looking forward to a hopefully-quiet weekend with the family and a few good books. Take a breather too!
[article] How much does AI impact development speed? Good look at some new research into how much AI impacts developers and their pace. Interesting that seniority and prior usage didn’t change the outcome.
[blog] Veo 3: A Detailed Prompting Guide. This is absolutely fantastic. Get all the right phrases and jargon to use when prompting a text-to-video model like Veo 3. Gold!
[blog] Coding agents have turned a corner. It’s fine when goofballs like me use these AI tools and offer guidance, but I really value the advice from capital-E Engineers like Brian. Here, he offers his direction on how to work with coding agents.
[blog] The rise of “context engineering.” The idea here is making sure the LLM has the right information and tools it needs to accomplish its task. It’s a system approach, versus thinking in prompts alone.
Another wild day. I’m doing some research into what other people think modern, AI-driven coding looks like. May turn my findings into a blog post. Either way, a new work style is forming.
[blog] Choosing the Right Deployment Path for Your Google ADK Agents. Fantastic post from Ayo that explores three agent hosts with different value propositions. You’ll likely debate their three types of platforms, regardless of which cloud you use.
[blog] 6 ways to become a database pro with the Gemini CLI. It’s a mistake to lump these agentic CLIs into a “coding tools” bucket. You can do a lot more than code apps. Karl shows some great data-focused examples here.
[blog] What Diff Authoring Time (DAT) reveals about developer experience. What’s going on from that moment a developer makes their first edit, until a pull request gets created? How do we measure that and improve the experience? Here’s analysis of some recent research.
[article] How To Prepare Your API for AI Agents. If you actually have an API strategy, you’re already ahead of others. This article has some advice for what to focus on.
[blog] I don’t care if my manager writes code. Should engineering managers be committing code alongside their reports? No, that doesn’t seem very wise or sustainable. But I do want my management to deeply know the tech the team is using.
Yesterday morning, we took the wraps off one of the most interesting Google releases of 2025. The Gemini CLI is here, giving you nearly unlimited access to Gemini from directly within the terminal. This is a new space, but there are other great solutions already out there. Why is this different? Yes, it’s good at multi-step reasoning, code generation, and creative tasks. Build apps, fix code, parse images, build slides, analyze content, or whatever. But what’s truly unique is that It’s fully open source, no cost to use, usable anywhere, and super extensible. Use Gemini 2.5 Pro’s massive context window (1m tokens), multimodality, and strong reasoning ability to do some amazing stuff.
Requirements? Have Node installed, and a Google account. That’s it. You get lots of free queries against our best models. You get more by being a cloud customer if you need it. Let’s have a quick look around, and then I’ll show you four prompts that demonstrate what it can really do.
The slash command shows me what’s available here. I can see and resume previous chats, configure the editor environment, leverage memory via context files like GEMINI.md, change the theme, and use tools. Choosing that option shows us the available tools such as reading files and folders, finding files and folders, performing Google searches, running Shell commands, and more.
I’m only scratching the surface here, so don’t forget to check out the official repo, docs, and blog post announcement. But now, let’s walk through four prompts that you can repeat to experience the power of the Gemini CLI, and why each is a big deal.
Prompt #1 – Do some research.
Software engineering is more than coding. You spend time researching, planning, and thinking. I want to build a new app, but I’m not sure which frontend framework I should use. And I don’t want stale answers from an LLM that was trained a year ago.
I’ve got a new research report on JavaScript frameworks, and also want to factor in web results. My prompt:
What JavaScript framework should I use to build my frontend app? I want something simple, standards-friendly, and popular. Use @report.pdf for some context, but also do a web search. Summarize the results in a way that will help me decide.
The Gemini CLI figured out some tools to use, successfully considered the file into the prompt, started off on its work searching the web, and preparing results.
The results were solid. I got tradeoff and analysis on three viable options. The summary was helpful and I could have continued going back and forth on clarifying questions. For architects, team leaders, and engineers, having a research partner in the terminal is powerful.
Why was this a big deal? This prompt showed the use of live Google Search, local (binary) file processing, and in-context learning for devs. These tools are changing how I do quick research.
Prompt #2 – Build an app.
These tools will absolutely change how folks build, fix, change, and modernize software. Let’s build something new.
I fed in this prompt, based on my new understanding of relevant JavaScript frameworks.
Let’s build a calendar app for my family to plan a vacation together. It should let us vote on weeks that work best, and then nominate activities for each day. Use Vue.js for the JavaScript framework.
Now to be sure, we didn’t build this to be excellent at one-shot results. Instead, it’s purposely built for an interactive back-and-forth with the software developer. You can start it with –yolo mode to have it automatically proceed without asking permission to do things, and even with –b to run it headless assuming no interactivity. But I want to stay in control here. So I’m not in YOLO mode.
I quickly got back a plan, and was asked if I wanted to proceed.
Gemini CLI also asks me about running Shell commands. I can allow it once, allow it always, or cancel. I like these options. It’s fun watching Gemini make decisions and narrate what it’s working on. Once it’s done building directories, writing code, and evaluating its results, the CLI even starts up a server so that I can test the application. The first draft was functional, but not attractive, so I asked for a cleanup.
The next result was solid, and I could have continued iterating on new features along with look and feel.
Why was this a big deal? This prompt showed iterative code development, important security (request permission) features, and more. We’ll also frequently offer to pop you into the IDE for further coding. This will change how I understand or bootstrap most of the code I work with.
Prompt #3 – Do a quick deploy to the cloud.
I’m terrible at remembering the syntax and flags for various CLI tools. The right git command or Google Cloud CLI request? Just hopeless. The Gemini CLI is my solution. I can ask for what I want, and the Gemini CLI figures out the right type of request to make.
We added MCP as a first-class citizen, so I added the Cloud Run MCP server, as mentioned above. I also made this work without it, as the Gemini CLI figured out the right way to directly call the Google Cloud CLI (gcloud) to deploy my app. But, MCP servers provide more structure and ensure consistent implementation. Here’s the prompt I tried to get this app deployed. Vibe deployment, FTW.
Ship this code to Cloud Run in us-west1 using my seroter-project-base project. Don’t create a Dockerfile or container, but just deploy the source files.
The Gemini CLI immediately recognizes that a known MCP tool can help, and shows me the tool it chose.
It got going, and shipped my code successfully to Cloud Run using the MCP server. But the app didn’t start correctly. The Gemini CLI noticed that by reading the service logs, and diagnosed the issue. We didn’t provide a reference for which port to listen on. No problem.
It came up with a fix, made the code changes, and redeployed.
Why was this a big deal? We saw the extensibility of MCP servers, and the ability to “forget” some details of exactly how other tools and CLIs work. Plus we observed that the Gemini CLI did some smart reasoning and resolved issues on its own. This is going to change how I deploy, and how much time I spend (waste?) deploying.
Prompt #4 – Do responsible CI/CD to the cloud.
The third prompt was cool and showed how you can quickly deploy to a cloud target, even without knowing the exact syntax to make it happen. I got it working with Kubernetes too. But can the Gemini CLI help me do proper CI/CD, even if I don’t know exactly how to do it? In this case I do know how to set up Google Cloud Build and Cloud Deploy, but let’s pretend I don’t. Here’s the prompt.
Create a Cloud Build file that would build a container out of this app code and store it in Artifact Registry. Then create the necessary Cloud Deploy files that defines a dev and production environment in Cloud Run. Create the Cloud Deploy pipeline, and then reference it in the Cloud Build file so that the deploy happens when a build succeeds. And then go ahead trigger the Cloud Build. Pay very careful attention for how to create the correct files and syntax needed for targeting Cloud Run from Cloud Deploy.
The Gemini CLI started by asking me for some info from my Google Cloud account (project name, target region) and then created YAML files for Cloud Build and Cloud Deploy. It also put together a CLI command to instantiate a Docker repo in Artifact Registry. Now, I know that the setup for Cloud Deploy working with Cloud Run has some specific syntax and formatting. Even with my above command, I can see that I didn’t get syntactically correct YAML in the skaffold file.
I rejected the request of the Gemini CLI to do a deployment, since I knew it would fail. Then I gave it the docs URL for setting up Cloud Run with Cloud Deploy and asked it to make a correction.
That Skaffold file doesn’t look correct. Take a look at the docs (https://cloud.google.com/deploy/docs/deploy-app-run), and follow its guidance for setting up the service YAML files, and referencing the right Skaffold version at the top. Show me the result before pushing a change to the Cloud Deploy pipeline.
Fortunately, the Gemini CLI can do a web fetch and process the latest product documentation. I did a couple of turns and got what I wanted. Then I asked it to go ahead and update the pipeline and trigger Cloud Build.
It failed at first because I didn’t have a Dockerfile, but after realizing that, automatically created one and started the build again.
It took a few iterations of failed builds for the Gemini CLI to land on the right syntax. But it kept dutifully trying, making changes, and redeploy until it got it right. Just like I would have if I were doing it myself!
After that back and forth a few times, I had all the right files, syntax, container artifacts, and pipelines going.
Some of my experiments went faster than others, but that’s the nature of these tools, and I still did this faster overall than I would have manually.
Why was this a big deal? This showcased some sophisticated file creation, iterative improvements, and Gemini CLI’s direct usage of the Google Cloud CLI to package, deploy, and observe running systems in a production-like way. It’ll change how confident I am doing more complex operations.
Background agents, orchestrated agents, conversational AI. All of these will play a part in how we design, build, deploy, and operate software. What does that mean to your team, your systems, and your expectations? We’re about to find out.
We’ve been building up to today’s launch of the Gemini CLI. There were some inevitable hiccups on launch day, but it’s fun to be part of teams that make things people like using. Give it a try!
[blog] Gemini CLI: your open-source AI agent. This is a huge deal. Open source, free to use, lightweight, and super extensible. This is another reason I think software engineering is changing forever. Press coverage here, here, here, and here.
[blog] Getting Started with Gemini CLI. Here’s a spot-on guide for installing and flexing some of the most important parts of our powerful new tool.