Latest Articles

Blog

Insights on production AI, platform engineering, and building systems that scale

Insights from Production

Notes on building AI systems that actually work—drawn from 34 years of enterprise systems and a platform currently processing 100K+ monthly operations. No hype, no vendor pitches, no theory disconnected from reality.

AI Partnership Programming: Two Approaches to Staying in the Driver's Seat Video
Jan 22, 2026 13 min read

AI Partnership Programming: Two Approaches to Staying in the Driver's Seat

AI Partnership Programming: Two Approaches to Staying in the Driver’s Seat The promise of AI-assisted development comes packaged with impressive statistics: “AI writes 50% of our code” or “10x productivity gains.” These numbers create a seductive narrative where developers offload coding work to AI while focusing on higher-level tasks—or perhaps...
Building a Multi-Tenant CDN with CloudFront Functions and KeyValueStore AWS
Dec 17, 2025 5 min read

Building a Multi-Tenant CDN with CloudFront Functions and KeyValueStore

Serving multiple tenants from a single CloudFront distribution is a cost-effective way to scale a SaaS platform. Rather than spinning up separate infrastructure for each customer, you can use CloudFront Functions combined with KeyValueStore (KV) to dynamically route requests based on hostname—all at the edge, with sub-millisecond latency. In this...
Agent Security and Prompt Injection: How to Safely Integrate AI Tools Video
Nov 20, 2025 3 min read

Agent Security and Prompt Injection: How to Safely Integrate AI Tools

🛡️ Agent Security and Prompt Injection The capabilities of Large Language Models (LLMs) to control applications via tool calls (functions) are revolutionary. However, this introduces serious security risks, primarily from Prompt Injection. Prompt injection occurs when a user or outside data source (like a LinkedIn profile’s “About” section) injects malicious...
Application Control via LLM Conversation: Fusing the UX/UI Boundary Video
Jul 21, 2024 3 min read

Application Control via LLM Conversation: Fusing the UX/UI Boundary

🗣️ Application Control via LLM Conversation Welcome to the recap of my July 2024 presentation at the Vegas Tech Alley AI Meetup. This talk explores a different paradigm for application design: making the LLM conversation the primary method of control and navigation, effectively fusing the boundaries between the user interface...
Building a Scalable AI Infrastructure: Kubernetes, NVIDIA GPUs, and Beyond
Aug 27, 2023 6 min read

Building a Scalable AI Infrastructure: Kubernetes, NVIDIA GPUs, and Beyond

The journey to creating a scalable and efficient AI infrastructure is fraught with challenges, particularly when dealing with GPU-optimized models for text, image, and audio generation. This blog delves into our experience of building a scalable multi-tenant AI system, emphasizing the management of GPU resources, virtualization, queue structuring, and the...
Preparing aiAgent v1 for Production
Apr 21, 2023 2 min read

Preparing aiAgent v1 for Production

The aiAgent project has been in development for some time, and our team is excited to share the progress made so far. This post will provide a detailed update on the project, focusing on the areas currently under active development. Our goal is to have a stable version 1 release...