Daily Reading List – November 25, 2024 (#448)

Happy Monday. It’s a short week here in the States, and I seemed to fit a week’s worth of meetings into today. I’ll do reading lists through Wednesday!

[blog] Skip the RAG workflows with Gemini’s 2M context window and the Context Cache. You don’t have to skip them entirely, but there are definitely scenarios now where RAG is unnecessary.

[article] QCon San Francisco 2024 Day 1: Architectures, Rust, AI/ML for Engineers, Sociotech Resilience. QCons are probably the best non-vendor conferences in our industry. They grab the best speakers and have some of the smartest topics. Also check out the day 2 recap.

[blog] AlphaQubit tackles one of quantum computing’s biggest challenges. This probably has zero impact on your work or anything in your life today, but it’s still freakin’ cool.

[blog] Introducing the Model Context Protocol. It wouldn’t be surprising to see that we’re at a point in the AI ecosystem where open standards get proposed, and adopted.

[blog] AI-Powered Updates–Issue Grouping, Autofix, Anomaly Detection, and more. Look for vendor solutions that use AI to complement their core value prop. Sentry seems to be doing that.

[blog] Redacting sensitive information when using Generative AI models. How do you filter out sensitive data from your LLM API calls? Guillaume shows off one technique that works well.

[article] KPMG fuels Google Cloud practice with $100M investment. Smart folks over there at KPMG 🙂

[article] Start Presentations on the Second Slide. I like this advice. Too many folks spend excessive time on the setup and lose the audience before they get to the good stuff.

[blog] GoMLX: ML in Go without Python. Terrific post from Eli who shows that machine learning is expanding beyond it’s Python base.

[blog] Deno v. Oracle: Canceling the JavaScript Trademark. Popcorn, popped. This will be one to watch, and would benefit the industry if Deno gets their way.

[blog] Re-Invoke: Tool invocation rewriting for zero-shot tool retrieval. Here’s the latest from Google Research. It looks at how to get LLMs to retrieve the most relevant tools for a downstream agent to use.

[blog] Make IAM for GKE easier to use with Workload Identity Federation. Accessing cloud services from workloads running in Kubernetes? You don’t want to have to impersonate accounts, or embed credentials in the workload. Other options? This shows a great one.

Want to get this update sent to you every day? Subscribe to my RSS feed or subscribe via email below:

Comments

One response to “Daily Reading List – November 25, 2024 (#448)”

  1. […] Daily Reading List – November 25, 2024 (#448) (Richard Seroter) […]

Leave a reply to Dew Drop – November 26, 2024 (#4309) – Morning Dew by Alvin Ashcraft Cancel reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.