Michael Albada introduces himself as a principal applied scientist at Microsoft, mentioning his work on Security Copilot and background at Uber and startups
The talk is a summary of a forthcoming 300-page O’Reilly book on building applications with AI agents, with sample code available in the book
The presentation will cover the promise and challenges of AI agents, core components for building effective systems, and common pitfalls
The Promise and Challenges of Agentic Systems 01:12
There is a significant increase (254% in three years) in startups labeling themselves as agentic or focused on building agents
While there is excitement and investment, agentic systems are challenging, often requiring multiple tool calls and operating in complex environments
Current benchmarks show substantial progress (50-70% on complex tasks), but perfection is unrealistic, especially for edge cases
It’s easy to achieve initial prototypes with 70% accuracy, but addressing the "long tail" of complex scenarios is much harder
An agent is defined as an entity that can reason, act, communicate, and adapt to solve tasks, building on foundation models
Agency in systems is a spectrum, not a binary — it's more about utility than achieving maximum agency
Effectiveness of the system must not be sacrificed for additional agency; previous generations like Robotic Process Automation were effective but brittle
Agentic systems promise adaptability to changing inputs, but must maintain a high level of performance
Foundation models can leverage exposed tools (via APIs) for expanded functionality but this adds risk and requires careful selection of exposed actions
Tools are invoked in a loop: model generates output, tool is called, observation is fed back, repeating until the final output is produced
Avoid a one-to-one mapping from all APIs to agent tools; too many tools reduce overall accuracy and cause confusion
Tools should be logically grouped, clearly described, and feel like single human-facing actions
Multi-agent systems are useful when a single agent becomes overwhelmed by too many tools; tools are grouped and delegated to specialized agents with a coordinator
Agent-to-agent protocols aim for cooperation among agents built by different teams, but this is still an emerging area with technical and security challenges