Logan Kilpatrick: Windsurf Acquisition, Gemini 3, Agentic Browsing, Veo 4, and more!

Windsurf Acquisition & Developer Ecosystem 00:56

  • The Windsurf team is joining DeepMind directly, reflecting the growing opportunity in the developer space and changing methods of software creation.
  • The acquisition was primarily a talent play; Google did not acquire any product, IP, or customers—those went to Cognition.
  • DeepMind is increasingly focusing on developer products alongside foundational research, indicating the importance of developers within the organization.
  • There is concern about the trend of team-only acquisitions leaving products and some startup employees behind, which may not be optimal for the startup ecosystem, but overall, the environment for AI startups is positive with many promising outcomes.

Future of AI UI & Form Factors 05:02

  • The ultimate form factor for AI assistants will likely be a mix of interfaces—chat, audio, background asynchronous agents, and potentially hardware like smart glasses.
  • Glasses are seen as a promising interaction layer due to their ability to capture visual context and provide models with the same situational awareness as users.
  • Most productive work still happens on computers and phones; glasses may not be necessary for all use cases.
  • Voice is an increasingly important UI; Google's models have advanced in both understanding and generating high-fidelity, lifelike audio.
  • Contextual awareness—AI adapting its interface (voice, text, or visual) based on the user's environment (e.g., car, desktop)—is a key emerging capability.

Agentic Browsing and Web Interaction 13:49

  • There is a vision of AI agents acting as intermediaries between users and the web, possibly reducing the need for users to visit many traditional websites.
  • However, authentic, human-generated content and trusted perspectives remain highly valued; AI summaries are helpful but do not replace authoritative or personal viewpoints.
  • As AI lowers the barrier for generating credible-looking content, the value of authenticity and trusted sources increases, making search even more important.
  • Human agency and the desire to make personal choices mean AI will often supplement, not replace, human-driven browsing/research, especially for important tasks.
  • For complex, time-consuming tasks like travel planning and research, fully AI-driven agents could introduce entirely new user behaviors that did not previously exist.

Models vs. Systems: Evolution and Scaffolding 19:59

  • Early AI models needed significant scaffolding (additional logic, guardrails, orchestration) for production deployment, but improving model capabilities are reducing this need.
  • Reasoning models and integrated tool use are driving further automation—models are increasingly agentic out-of-the-box.
  • As core models gain capabilities, the boundary between “model” and “scaffolded system” is shifting; while old scaffolding is superseded, new needs for complex capabilities arise as models evolve.
  • Builders should expect to adapt quickly and focus only on scaffolding essential for their current use cases, acknowledging that models will continue to improve and “eat” more tasks natively.

AI’s Impact on Developer Population & Skills 24:41

  • AI makes engineers more productive and is expected to increase, rather than decrease, the total number of developers.
  • Learning to code builds systems thinking and perseverance—skills not likely to go away even as abstraction levels rise.
  • Unfamiliarity with coding remains a hurdle for many; AI tools should aim to educate and empower, not just automate—bringing millions of new developers into the fold.
  • The idea of natural language as the sole source of truth, compiled into code, is discussed but seen as impractical for high-stakes applications due to the loss of determinism and control.

Generative Media: Veo V3 and Beyond 30:12

  • Google V3 (Veo) image/video model, now available via API, enables creation of high-fidelity audio and video, dramatically lowering the barrier for creative production.
  • Tens of millions of videos and billions of views have been generated since launch, with both casual creators and major brands using the technology.
  • The fusion of AI-generated video with audio “brings creations to life” and enables a new range of use cases, from marketing to hyper-personalized content.
  • The progression from V2 to V3 represents a significant qualitative leap, and ongoing advancements (toward V4) are expected to include longer, more coherent video generation and lower costs.
  • Human taste, authenticity, and agency become even more important in a landscape where creation is democratized; AI amplifies but does not replace expert creativity.

Model Architectures & Speed 38:14

  • Diffusion models offer extremely fast inference and are useful for certain code and UI generation use cases, though quality may be lower than transformers.
  • Google is evaluating whether diffusion models will scale as well as transformers for future applications.
  • Model speed is often underappreciated but critical for user experience and iteration; faster models can keep users in the flow during tasks like coding.

Model Scaling: Pretraining, Post-training, Reasoning 40:28

  • AI model advancement now expands on three axes: pretraining, post-training, and reasoning (via RL).
  • Improvements in pretraining effects are magnified by post-training and reasoning, yielding nonlinear gains and making continued scaling along all axes highly advantageous.
  • Gemini 3 is teased as “exciting,” but details are not revealed. User feedback requests focus on more tools, better speed, and enhanced usability in AI Studio.

Google’s Differentiators & Competition 43:04

  • The AI competition landscape is driven by shared aims to improve the world, with Google leveraging advantages in scale and research breadth.
  • DeepMind’s work spans beyond LLMs into foundational science, with some research (e.g., protein folding) earning high recognition and contributing back to core models.
  • Google must build AI products usable by billions—including many unfamiliar with generative AI—via core products like Search, giving unique responsibilities and challenges.
  • Key differentiators for Google include proprietary TPU chips, which support efficient model deployment to vast user bases cost-effectively, and state-of-the-art advancements in multimodal understanding, reasoning, and coding.
  • Continued investment in scaling and research positions Google for long-term influence in the market.