• Building a Scalable AI Infrastructure: Kubernetes, NVIDIA GPUs, and Beyond

    The journey to creating a scalable and efficient AI infrastructure is fraught with challenges, particularly when dealing with GPU-optimized models for text, image, and audio generation. This blog delves into our experience of building a scalable multi-tenant AI system, emphasizing the management of GPU resources, virtualization, queue structuring, and the adaptability to diverse computing environments.

    Continue reading >>
  • Preparing aiAgent v1 for Production

    The aiAgent project has been in development for some time, and our team is excited to share the progress made so far. This post will provide a detailed update on the project, focusing on the areas currently under active development. Our goal is to have a stable version 1 release of aiAgent by the end of May, which we will then begin implementing into production systems.

    Continue reading >>