In this issue, April 28, 2026 View it in your browser.

Orchestrating AI, AI Agent Memory, Cassandra Upgrades, Vault 2.0, AWS Ends WorkMail, PDF Extraction, pnpm 11 RC, tRPC APIs, Kotlin Room 3.0, Slack Rebuild, EDA for Banking, Observability

InfoQ Certified Architect Program: Last chance to join the May online cohort (closes May 6).

When you are the most senior technical person on your team, honest feedback on your architecture is hard to find. This 5-week online working group led by Luca Mezzalira is a small cohort of senior architects pressure-testing each other's real decisions in a confidential setting. Enroll now.

 

Sponsored by YugabyteDB

Designing Data Layers for Agentic AI: Patterns for State, Memory, and Coordination at Scale - Sponsored by YugabyteDB

Designing Data Layers for Agentic AI: Patterns for State, Memory, and Coordination at Scale

AI agents create new architectural challenges: shared memory, cross-agent state, and auditability. This session explores data layer patterns —conversation state, knowledge persistence, coordination — and tradeoffs in consistency, latency, and cost at scale using AWS and YugabyteDB. Live Webinar, May 12th, 2026 — Save Your Seat.

https://res.infoq.com/podcasts/engineering-stable-secure-scalable-platforms/en/smallimage/the-infoq-podcast-logo-thumbnail-1775134657783.jpg

Engineering Stable, Secure and Scalable Platforms: A Conversation with Matthew Liste

In this podcast, Michael Stiefel spoke to Matthew Liste about building and managing software platforms. Platform services act as the basis for application development, and must always be stable, secure, and scalable. Scaling these systems is particularly difficult because unknown resource contention often causes them to break. (Podcast)

TOP AI, ML & Data Engineering NEWS HEADLINES

  1. Subagents in Gemini CLI Enable Task Delegation and Parallel Agent Workflows

  2. Designing Memory for AI Agents: inside Linkedin’s Cognitive Memory Agent

  3. Cloudflare Introduces Project Think: a Durable Runtime for AI Agents

Orchestrating Agentic and Multimodal AI Pipelines with Apache Camel

In this article, author Vignesh Durai discusses how agentic and multimodal AI systems can be engineered using Apache Camel and LangChain4j technologies. The key components in the solution include LLM-based reasoning, retrieval-augmented generation (RAG), and image classification. (Article)

Deepfakes, Disinformation, and AI Content Are Taking Over the Internet

Shuman Ghosemajumder explains how generative AI has transformed from a creative curiosity into a high-scale tool for disinformation and fraud. He shares insights on "Disinformation Automation," the fallacy of CAPTCHA in an AI world, and why engineering leaders must adopt zero-trust "cyber fusion" strategies to defend against automated attacks that mimic human behavior with chilling accuracy. (Presentation with transcript included)

Dynamic Moments: Weaving LLMs into Deep Personalization at DoorDash

Sudeep Das and Pradeep Muthukrishnan explain the shift from static merchandising to dynamic, moment-aware personalization at DoorDash. They share how LLMs generate natural-language "consumer profiles" and content blueprints, while traditional deep learning handles last-mile ranking. This hybrid approach allows the platform to adapt to short-lived user intent and massive catalog abundance. (Presentation with transcript included)

Sponsored by Eon

BigQuery at Scale - Sponsored by Eon

BigQuery Day May 19th: For Teams Running It. And the Leaders Asking About It.

Running BigQuery means fielding the same questions from leadership on repeat. Can we recover if something breaks? Are we ready for AI? Why is the bill growing? BigQuery Day is a free one-day event on May 19, presented by Google Cloud and Eon, where practitioners from Google, L.L.Bean, and more will walk through what's actually working in production. Register Now.

TOP DevOps NEWS HEADLINES

  1. Yelp Achieves Zero-Downtime Upgrade of over 1,000 Cassandra Nodes

  2. HashiCorp Vault 2.0 Marks Shift to IBM Lifecycle with New Identity Federation

  3. GitHub Acknowledges Recent Outages, Cites Scaling Challenges and Architectural Weaknesses

Grafana Rearchitects Loki with Kafka and Ships a CLI to Bring Observability into Coding Agent

At GrafanaCON 2026 in Barcelona, Grafana Labs announced Grafana 13 with the new Loki Kafka-backed architecture at the ingestion layer and the AI Observability in Grafana Cloud to monitor and evaluate AI systems in real time. In particular, the new CLI called GCX was announced, designed to surface Grafana Cloud data inside agentic development environments. (News)

Sponsored by Ably Realtime

AI UX that HTTP was never designed for - Sponsored by Ably Realtime

AI UX that HTTP was never designed for

Agents need streaming that resumes, sessions that persist, and state that survives a reconnect. Most stacks have the back-end handled. The transport layer between the agent and the user is where AI UX breaks. Discover the fix →

TOP Cloud NEWS HEADLINES

  1. AWS Ends WorkMail and Moves App Runner to Maintenance Mode

  2. Cloudflare Optimizes Edge Stack for High-Core CPUs Instead of Large Cache

  3. Cloudflare Sandboxes Reach General Availability, Giving AI Agents Persistent Isolated Environments

When a Cloud Region Fails: Rethinking High Availability in a Geopolitically Unstable World

Sovereign fault domains are failure boundaries defined by legal, political, or physical jurisdiction rather than hardware topology. The article maps geopolitical events to known distributed-systems failure modes, argues multi-region should replace multi-AZ as the HA baseline for systems crossing jurisdictions, and outlines design patterns, chaos experiments, and an ALE model to justify the spend. (Article)

TOP Java NEWS HEADLINES

  1. Google ADK for Java 1.0 Introduces New App and Plugin Architecture, External Tools Support, and More

  2. Java News Roundup: OpenJDK JEPs, Jakarta EE 12, Spring Framework, Micrometer, Camel, JBang

Redesigning Banking PDF Table Extraction: A Layered Approach with Java

PDF table extraction often looks easy until it fails in production. Real bank statements can be messy, with scanned pages, shifting layouts, merged cells, and wrapped rows that break standard Java parsers. This article shares how we redesigned the approach using stream parsing, lattice/OCR, validation, scoring, and selective ML to make extraction more reliable in real banking systems. (Article)

TOP Web Development NEWS HEADLINES

  1. React Navigation 8.0 Alpha with Native Bottom Tabs, Reworked TypeScript Inference and History

  2. pnpm 11 Release Candidate: ESM Distribution, Supply Chain Defaults and a New Store Format

  3. Pretext.js Bypasses DOM Layout Reflow, Enabling Advanced UX Patterns at 120 FPS

Building Production-Ready tRPC APIs: The TypeScript Alternative to Apollo Federation

This article details our migration from Apollo Federation to a TypeScript-based tRPC stack, which resulted in an 89% reduction in bugs and 67% faster response times. It also covers the mistakes we made, the unexpected performance gains, and an overview of the production architecture we use today to handle 2.4 million daily requests with 99.97% uptime. (Article)

Google Introduces Room 3.0: a Kotlin-First, Async, Multiplatform Persistence Library

Room 3.0 is a major update to Android's persistence library that introduces breaking changes in key areas. The new release focuses on modernizing Android persistence layer around Kotlin Multiplatform and expands platform support to include JavaScript and WebAssembly. (News)

Sponsored by Guardsquare

Lessons Learned from Security Incidents in Mobile Apps - Sponsored by Guardsquare

Lessons Learned from Security Incidents in Mobile Apps

On May 12, Security Researcher Jan Seredynski will walk through recent mobile app breaches across banking, food delivery, and e-commerce, breaking down how they happened and what teams can do differently. Register now.

TOP Architecture & Design NEWS HEADLINES

  1. Dropbox Collaborates with GitHub to Reduce Monorepo Size from 87GB to 20GB

  2. Cloudflare Outlines MCP Architecture as Enterprises Confront Security and Governance Risks

  3. Anthropic Introduces Managed Agents to Simplify AI Agent Deployment

  4. Slack Rebuilds Notification System, Reports 5X Increase in Settings Engagement

How to Build an Exchange: Sub Millisecond Response Times and 24/7 Uptimes in the Cloud

Frank Yu shares Coinbase’s engineering philosophy for building resilient, fair, and fast financial exchanges. He explains the power of a single-threaded architecture combined with the Raft consensus algorithm to maintain 24/7 availability. He discusses how determinism enables zero-downtime rolling deployments and the ability to replay production logs for perfect bug reproduction. (Presentation with transcript included)

Event-Driven Patterns for Cloud-Native Banking - What Works, What Hurts?

Chris Tacey-Green discusses the shift from synchronous commands to asynchronous events within highly regulated environments. He explains the critical role of Inbox and Outbox patterns in preventing data loss, the nuances of event versioning, and how to maintain decoupling between domains. He shares "battle-tested" principles for implementing fault tolerance and managing eventual consistency. (Presentation with transcript included)

TOP Culture & Methods NEWS HEADLINES

  1. How Observability and Telemetry Can Enhance the Practice of Software Engineering

Panel: Building a Culture that Works

The panelists share insights on evolving company culture. They discuss leveraging feedback loops, lending social capital, and the friction between legacy bureaucracy and agile engineering. The panel explains how to maintain cohesion in remote teams and use interviews to uncover the true "unmanicured" culture of a firm. (Presentation with transcript included)

SPONSORED CONTENT

Latest Sponsored Resources

document AI-Native Software Delivery - Download the eBook (By O'Reilly)

document The InfoQ Trends Reports 2025 eMag

document Architecture Through Different Lenses 2025

document Scalable Enterprise Java for the Cloud - Download the eBook