A significant portion of enterprise data (90%) exists in unstructured formats like PDFs and Word documents, requiring human interpretation for decision-making.
AI agents are now capable of reasoning over large amounts of unstructured data, performing analysis, research, and taking actions autonomously.
Two main categories of agents are identified: assistive agents (help users retrieve information) and automation agents (perform routine tasks with minimal human input).
The concept of a Document MCP server is introduced as a means to equip AI agents with the necessary tools for understanding and manipulating documents.
Challenges related to complex document formats are acknowledged, stressing the importance of accurate processing.
Two main UX categories for agents are discussed: assistant-based (chat-oriented, more human involvement) and automation interfaces (structured, less human oversight).
Examples of use cases include financial data normalization and invoice reconciliation.
Case studies illustrate the use of document agents in financial due diligence and enterprise search, showcasing their ability to process unstructured data and provide insights.
Automation agents are utilized to streamline tedious tasks, significantly reducing the time required for document processing.
LlamaIndex positions itself as a customizable platform for automating document workflows, inviting further discussion and engagement from the audience.