Latest Articles

Blog

Insights on production AI, platform engineering, and building systems that scale

Insights from Production

Notes on building AI systems that actually work—drawn from 34 years of enterprise systems and a platform currently processing 100K+ monthly operations. No hype, no vendor pitches, no theory disconnected from reality.

Building a Scalable AI Infrastructure: Kubernetes, NVIDIA GPUs, and Beyond
Aug 27, 2023 6 min read

Building a Scalable AI Infrastructure: Kubernetes, NVIDIA GPUs, and Beyond

The journey to creating a scalable and efficient AI infrastructure is fraught with challenges, particularly when dealing with GPU-optimized models for text, image, and audio generation. This blog delves into our experience of building a scalable multi-tenant AI system, emphasizing the management of GPU resources, virtualization, queue structuring, and the...
Preparing aiAgent v1 for Production
Apr 21, 2023 2 min read

Preparing aiAgent v1 for Production

The aiAgent project has been in development for some time, and our team is excited to share the progress made so far. This post will provide a detailed update on the project, focusing on the areas currently under active development. Our goal is to have a stable version 1 release...