How agents broke app-level infrastructure - Evan Boyle

Introduction to AI Infrastructure Challenges 00:03

  • Evan Boyle introduces the topic of how agents affect app-level infrastructure, particularly focusing on the compute layer.
  • He shares his background as the founder of Genisx and experience with cloud and developer tools.

The Evolution of AI Engineering 00:15

  • Engineers start with simple prompts, leading to successful funding rounds but eventually face complex workflows.
  • Transitioning from AI engineering to data engineering is necessary due to the challenges of context management.

Infrastructure Limitations 01:10

  • Traditional web infrastructure is ill-suited for modern AI applications which often require longer response times.
  • Current AI applications can have latency of a few seconds, unlike the sub-second response times in web 2.0.

Workflow Management Challenges 03:36

  • Users frequently encounter issues due to infrastructure limitations, leading to unreliable applications.
  • Existing tools for long-running workflows often require complex setups that many developers find cumbersome.

Serverless Limitations 04:09

  • Current serverless providers impose timeouts and limitations on long-running processes, complicating user experience.
  • The importance of resumability in applications is highlighted, especially when users refresh or navigate away from the page.

User Experience in AI Applications 04:48

  • Use cases demonstrate the need for background processing to avoid user drop-off during long tasks.
  • The necessity of keeping users informed about task progress to enhance engagement and satisfaction is emphasized.

Development of an Open Source Library 07:28

  • Evan discusses the creation of a component model for building workflows that are infrastructure-aware and reusable.
  • The anti-framework design promotes composition and reusability over abstraction.

Workflow Execution and Debugging 09:29

  • Workflows can automatically create REST APIs that support both synchronous and asynchronous calls.
  • Detailed tracing and error handling are integrated, allowing developers to debug easily and manage retries.

Architectural Considerations 10:04

  • Separation of API and compute layers is essential for scalability and flexibility in handling workflows.
  • Use of Redis streams allows for resumability and easier management of user interactions with workflows.

Lessons for Building Agentic Workflows 12:11

  • Start simple and plan for long-running processes without overcomplicating initial designs.
  • Emphasizes the importance of error handling, user experience, and the intricacies of deploying workflows.

Conclusion and Resources 13:14

  • Evan encourages developers to explore GenSx on GitHub for pre-built infrastructure solutions.
  • He concludes with gratitude for the opportunity to present and wishes the audience success in their projects.