Thompson Reuters' initial focus for AI assistants was on being helpful, referencing information accurately, and providing citations.
Recently, the priority has shifted from simple helpfulness to productivity, expecting AI to produce output, make judgments, and decisions on behalf of users.
High-stakes environments (law, tax, global trade, risk, and fraud) require higher accuracy due to the severe consequences of errors.
Thompson Reuters has a long history in these fields, with a strong base of domain expertise and proprietary content.
Evaluation (eval) of AI systems is complex and crucial for building trust, hindered by AI's non-deterministic nature.
Both users and internal teams struggle with inconsistency in evaluation; human experts’ assessments can vary by over 10% on identical cases.
Referencing reliable source material grows harder as agency increases; tracing decision drift and building robust guardrails requires deep domain expertise.
Rigorous evaluation rubrics are developed, but user preference remains a key north star in assessing if systems are improving.
Legacy applications, often seen as outdated, are now valuable as decomposable assets—agents can utilize their highly tuned domain logic as tools.
The concept of MVP (minimum viable product) can be limiting; building out the whole system first often yields better understanding and results.
Product Demonstrations and Application Examples 12:01
Demonstration of a tax workflow where AI automates document ingestion, data extraction, mapping to tax engines, applying tax law rules, validating results, and generating returns.
Success in automation is due to legacy tools (tax engines, validation engines) which AI can leverage for calculations and error checking.
Legal research application: AI uses litigation research tools to search, validate, and compare legal documents, statutes, citations, and blogs.
The AI tracks its evidence, writes intermediate notes, and compiles a final report with explicit, traceable citations and risk flags.
These examples highlight the benefits of decomposing legacy systems and building full-fledged products that agents can operate within.