Blog

Insights on production AI, platform engineering, and building systems that scale

Insights from Production

Notes on building AI systems that actually work—drawn from 34 years of enterprise systems and a platform currently processing 100K+ monthly operations. No hype, no vendor pitches, no theory disconnected from reality.

Screen-AI: Building a Native macOS App Overnight Without Knowing Swift

Development

Feb 11, 2026 12 min read

Screen-AI: Building a Native macOS App Overnight Without Knowing Swift

Screen-AI: Building a Native macOS App Overnight Without Knowing Swift I don’t know Swift. I’ve never built a macOS application. Last night I started a new project at around 10pm. By this morning I had a native desktop app that watches my screen, maintains a running AI conversation about what...

AI Partnership Programming: Two Approaches to Staying in the Driver's Seat

Video

Jan 22, 2026 13 min read

AI Partnership Programming: Two Approaches to Staying in the Driver's Seat

AI Partnership Programming: Two Approaches to Staying in the Driver’s Seat The promise of AI-assisted development comes packaged with impressive statistics: “AI writes 50% of our code” or “10x productivity gains.” These numbers create a seductive narrative where developers offload coding work to AI while focusing on higher-level tasks—or perhaps...

Jan 06, 2026 4 min read

Building Voice into AI Applications: Streaming TTS and Multi-Voice Audio Production

Building Voice into AI Applications What if your AI-generated story could sound like an audiobook with a full cast? What if each character had a distinct, recognizable voice—and when one picks up a guitar, you actually hear the song? We’ve been deep in audio development at Kusog AI, integrating text-to-speech...

Building a Multi-Tenant CDN with CloudFront Functions and KeyValueStore

AWS

Dec 17, 2025 5 min read

Building a Multi-Tenant CDN with CloudFront Functions and KeyValueStore

Serving multiple tenants from a single CloudFront distribution is a cost-effective way to scale a SaaS platform. Rather than spinning up separate infrastructure for each customer, you can use CloudFront Functions combined with KeyValueStore (KV) to dynamically route requests based on hostname—all at the edge, with sub-millisecond latency. In this...

Agent Security and Prompt Injection: How to Safely Integrate AI Tools

Video

Nov 20, 2025 3 min read

Agent Security and Prompt Injection: How to Safely Integrate AI Tools

🛡️ Agent Security and Prompt Injection The capabilities of Large Language Models (LLMs) to control applications via tool calls (functions) are revolutionary. However, this introduces serious security risks, primarily from Prompt Injection. Prompt injection occurs when a user or outside data source (like a LinkedIn profile’s “About” section) injects malicious...

Application Control via LLM Conversation: Fusing the UX/UI Boundary

Video

Jul 21, 2024 3 min read

Application Control via LLM Conversation: Fusing the UX/UI Boundary

🗣️ Application Control via LLM Conversation Welcome to the recap of my July 2024 presentation at the Vegas Tech Alley AI Meetup. This talk explores a different paradigm for application design: making the LLM conversation the primary method of control and navigation, effectively fusing the boundaries between the user interface...

The Architecture of AI Personalities: Roles, RAG, and Orchestration

Video

Jun 25, 2024 3 min read

The Architecture of AI Personalities: Roles, RAG, and Orchestration

🤖 The Architecture of AI Personalities Hello everyone, and welcome! This post covers the talk I gave at the Vegas Tech Alley AI Meetup in June 2024, where we explored the concept of AI personalities: what they are, what value they provide, and how they are constructed to orchestrate complex...

Semantic Networks in the LLM World: Structuring Unstructured Data for AI

Video

Mar 22, 2024 3 min read

Semantic Networks in the LLM World: Structuring Unstructured Data for AI

🕸️ Semantic Networks in the LLM World Welcome to the recap of my talk at the Vegas Tech Alley AI Meetup, where we explored the powerful, long-standing concept of Semantic Networks and how modern LLMs are finally making them practical for real-world application. A Semantic Network is a way of...

Aug 27, 2023 6 min read

Building a Scalable AI Infrastructure: Kubernetes, NVIDIA GPUs, and Beyond

The journey to creating a scalable and efficient AI infrastructure is fraught with challenges, particularly when dealing with GPU-optimized models for text, image, and audio generation. This blog delves into our experience of building a scalable multi-tenant AI system, emphasizing the management of GPU resources, virtualization, queue structuring, and the...

Apr 21, 2023 2 min read

Preparing aiAgent v1 for Production

The aiAgent project has been in development for some time, and our team is excited to share the progress made so far. This post will provide a detailed update on the project, focusing on the areas currently under active development. Our goal is to have a stable version 1 release...