Agent security is about what happens when AI systems can browse, use tools, remember state, and take actions across multiple steps. The security boundary moves from a chat response to a longer workflow with identity, permissions, memory, and operational consequences.
Agent Security
Controls and attack paths for browsing, tool use, memory, identity, and action-taking agents.
- What tools the agent can reach and under which identity
- How memory, plans, and previous steps influence later actions
- What approvals or reversibility exist when the agent gets it wrong
- Unsafe tool use and hidden privilege expansion
- Prompt injection flowing into planning and execution
- Long-running workflows accumulating risky state or momentum
- Teams shipping assistant-to-agent product transitions
- Practitioners studying autonomy and tool use
- Operators responsible for controls around high-impact actions
Current notes, events, and source material
These items are included because they add useful evidence, framing, implementation detail, or upcoming context for teams working in this area.
GAISS 2026: IEEE GenAI for Secure Systems
GAISS 2026 is an IEEE conference at the University of Texas at Austin focused on generative AI for secure systems, including red teaming, blue-team automation, governance, and agentic secure AI.
IAPP P.S.R. + AI Governance Global 2026
IAPP Privacy. Security. Risk. + AI Governance Global 2026 brings privacy, cybersecurity law, technology, and AI governance professionals together in Seattle.
OpenAI DevDay 2026
OpenAI DevDay 2026 is scheduled for September 29 in San Francisco and is OpenAI’s primary developer event for platform updates.
AGNTCon + MCPCon Europe 2026
AGNTCon + MCPCon Europe 2026 brings agent and MCP builders to Amsterdam to cover agent architectures, protocols, infrastructure, security, observability, and interoperability.
Black Hat USA 2026 AI Summit
Black Hat USA 2026 includes an AI Summit and security briefings in Las Vegas focused on how artificial intelligence is changing digital defense.
Play video
Claude Fable Blocked - 11 Quiet Details on What’s Next
Claude Fable 5 banned, but what’s the bigger story. We go through 11 under-reported details, so you have the context to see what’s coming next for your use of AI. From whether the ban will last, what the possible motives are, what the model can actually do, and some wild over-extrapolations going on. Check out my fast-
Powering the next era of Confidential AI
We are thrilled to collaborate with Apple on its expanded Private Cloud Compute (PCC) systems announced this week at WWDC 2026.
New Attacks Trick OpenClaw AI Agent Into Running Code and Leaking Secrets
Two security teams have shown, in separate research published this week, that OpenClaw, the popular self-hosted AI agent, can be driven to run attacker-controlled code or hand over sensitive data through ordinary-looking inputs. Imperva buried instructions inside shared contacts, vCards, and location pins that the agen
Turn specs into evals for any agent with ASSERT
Adaptive Spec-driven Scoring for Evaluation and Regression Testing (ASSERT) is an open-source framework for converting natural language behavior requirements into executable evaluations of AI models and agents. The post Turn specs into evals for any agent with ASSERT appeared first on Microsoft Security Blog .
Investing in multi-agent AI safety research
Google DeepMind and partners announce a $10M funding call for multi-agent safety research.
Anthropic Releases Claude Fable 5, Its Most Powerful AI Yet, With Cyber Safeguards
On June 9, Anthropic released Claude Fable 5, the most capable model it has ever made, generally available. It also did something unusual: it shipped one model as two products, split not by capability but by a layer of safety classifiers. Fable 5 goes to the public. Its twin, Claude Mythos 5, the same underlying model
Play video
Claude Fable 5 - Full 319 page Breakdown
Fable 5 is out - and it’s good, very good. But beyond the splashy demos, I want to bring you the 20+ nuggets from the 319 page system card, which I read in full, all day, plus benchmarks you may not have noticed. https://assemblyai.com/aiexplained Plus two worrying trends inside the ‘mind’ of Claude, how OpenAI counter
Detecting and containing AI-powered threats with Google Security Operations agents
Learn how Google Security Operations works in concert with AI Threat Defense to monitor, detect, and respond to threats, particularly from code you do not own or can not patch.
Reconstructing AI activity in investigations
Learn how to investigate AI activity in Microsoft 365 Copilot and Azure AI services using a structured, telemetry-driven approach. This playbook helps security teams reconstruct events, assess data exposure, and detect potential threats faster. The post Reconstructing AI activity in investigations appeared first on Mic
AI brands as bait: How threat actors are using the AI hype in social engineering
As threat actors operationalize AI to accelerate attacks, they are also leveraging the wider global interest around AI itself as a social engineering lure. The post AI brands as bait: How threat actors are using the AI hype in social engineering appeared first on Microsoft Security Blog .
Securing CI/CD in an agentic world: Claude Code Github action case
Microsoft Threat Intelligence identified a prompt injection pathway in Claude Code GitHub Action that allowed access to workflow secrets under specific conditions. This research examines the attack chain, responsible disclosure process, Anthropic's mitigation, and guidance for securing AI-powered CI/CD workflows. The p
Updating the taxonomy of failure modes in agentic AI systems: What a year of red teaming taught us
A surge in real-world attacks against agentic AI systems is reshaping how we think about risk. Based on 12 months of red teaming, this update introduces seven new failure modes, from supply chain compromise to goal hijacking, and the practical mitigations teams need now. The post Updating the taxonomy of failure modes
Preinstall to persistence: Inside the Red Hat npm Miasma credential-stealing campaign
A large-scale npm supply chain attack compromised over 90 versions of @redhat-cloud-services packages, silently infecting CI/CD environments and developer systems. The malicious code steals credentials from GitHub, cloud platforms, and local machines, then spreads like a worm by republishing trusted packages. Discover
Microsoft Build 2026: Securing code, agents, and models across the development lifecycle
Discover how Microsoft enables fast, secure AI development with MDASH and new security capabilities. The post Microsoft Build 2026: Securing code, agents, and models across the development lifecycle appeared first on Microsoft Security Blog .
Gartner Security & Risk Management Summit 2026
Gartner Security & Risk Management Summit 2026 brings CISOs and security leaders together in National Harbor, Maryland, with tracks covering AI, cyber risk, application security, data security, operations, privacy, and governance.
Cloud CISO Perspectives: How to build an AI-ready security program for the public sector
From industrial control systems to decades-old municipal databases, here’s our CISO guidance to prep AI-ready security programs for the public sector.
Play video
New Claude Opus 4.8: 15 Things You May’ve Missed
The ‘best’ generally available AI model just dropped, but there is plenty I bet you missed about what it is, how it performs, and what the release tells us. 15 highlights from the 244 page system card, plus private testing, leader interview and more. AI Insiders ($9!): https://www.patreon.com/AIExplained Chapters: 00:0
ACM CAIS 2026
ACM CAIS 2026 is a research-focused conference on compound AI architectures, optimization, deployment, and agentic AI systems in San Jose, California.
Project Glasswing: An initial update
Anthropic reports early Project Glasswing results using Mythos Preview with infrastructure partners and external testers, including large-scale vulnerability discovery and a cautious disclosure posture.
ChatGPT Enterprise & Edu Codex release notes: May 2026
OpenAI’s Enterprise and Edu release notes describe Codex updates including goal mode, browser improvements, locked computer use, app-window context, admin analytics, and plugin sharing status.
Gemini 3.5: frontier intelligence with action
Gemini 3.5 is built to help you execute complex, agentic workflows.
Play video
GPT 5.5 Arrives, DeepSeek V4 Drops, and the Compute War Intensifies
GPT 5.5 full analysis, plus DeepSeek V4 paper highlights, comparisons with Mythos, a vibe-coded game w/ GPT Image 2, and 50 data-points you wouldn’t get from just reading the headlines. https://80000hours.org/aiexplained Check out my fast-growing (!) app, free to use, and code INSIDER15 for paid tiers: https://lmcounci
Play video
AgentCraft: Putting the Orc in Orchestration — Ido Salomon
AI Engineer session on AgentCraft: Putting the Orc in Orchestration, presented by Ido Salomon. It adds practical context for how teams are building and operating AI systems in production.
Play video
Agents need more than a chat - Jacob Lauritzen, CTO Legora
AI Engineer session on Agents need more than a chat - Jacob Lauritzen, CTO Legora. It adds practical context for how teams are building and operating AI systems in production.
Play video
Full Workshop: Build Your Own Deep Research Agents - Louis-François Bouchard, Paul Iusztin, Samridhi
AI Engineer session on Full Workshop: Build Your Own Deep Research Agents - Louis-François Bouchard, Paul Iusztin, Samridhi. It adds practical context for how teams are building and operating AI systems in production.
Play video
The Future of MCP — David Soria Parra, Anthropic
AI Engineer session on The Future of MCP, presented by David Soria Parra, Anthropic. It adds practical context for how teams are building and operating AI systems in production.
Play video
Harness Engineering: How to Build Software When Humans Steer, Agents Execute — Ryan Lopopolo, OpenAI
AI Engineer session on Harness Engineering: How to Build Software When Humans Steer, Agents Execute, presented by Ryan Lopopolo, OpenAI. It adds practical context for how teams are building and operating AI systems in production.
Play video
Paperclip: Open Source Human Control Plane for AI Labor — Dotta Bippa
AI Engineer session on Paperclip: Open Source Human Control Plane for AI Labor, presented by Dotta Bippa. It adds practical context for how teams are building and operating AI systems in production.
Play video
From Chaos to Choreography: Multi-Agent Orchestration Patterns That Actually Work — Sandipan Bhaumik
AI Engineer session on From Chaos to Choreography: Multi-Agent Orchestration Patterns That Actually Work, presented by Sandipan Bhaumik. It adds practical context for how teams are building and operating AI systems in production.
Play video
Agentic Engineering: Working With AI, Not Just Using It — Brendan O'Leary
AI Engineer session on Agentic Engineering: Working With AI, Not Just Using It, presented by Brendan O'Leary. It adds practical context for how teams are building and operating AI systems in production.
Play video
Your Insecure MCP Server Won't Survive Production — Tun Shwe, Lenses
AI Engineer session on Your Insecure MCP Server Won't Survive Production, presented by Tun Shwe, Lenses. It adds practical context for how teams are building and operating AI systems in production.
Play video
Bending a Public MCP Server Without Breaking It — Nimrod Hauser, Baz
AI Engineer session on Bending a Public MCP Server Without Breaking It, presented by Nimrod Hauser, Baz. It adds practical context for how teams are building and operating AI systems in production.
Play video
Judge the Judge: Building LLM Evaluators That Actually Work with GEPA — Mahmoud Mabrouk, Agenta AI
AI Engineer session on Judge the Judge: Building LLM Evaluators That Actually Work with GEPA, presented by Mahmoud Mabrouk, Agenta AI. It adds practical context for how teams are building and operating AI systems in production.
Play video
Platforms for Humans and Machines: Engineering for the Age of Agents — Juan Herreros Elorza
AI Engineer session on Platforms for Humans and Machines: Engineering for the Age of Agents, presented by Juan Herreros Elorza. It adds practical context for how teams are building and operating AI systems in production.
Play video
Claude Mythos: Highlights from 244-page Release
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Assessing Claude Mythos Preview’s cybersecurity capabilities
Claude Mythos Preview is a new general-purpose language model that is strikingly capable at computer security tasks. This post provides technical details for researchers and practitioners who want to understand exactly how we have been testing this model, and what we have found over the past month. We hope this will sh
Detecting and analyzing prompt abuse in AI tools
Microsoft Incident Response explains how to detect prompt abuse using logging, telemetry, and incident response workflows.
Designing AI agents to resist prompt injection
OpenAI frames prompt injection as an agent-security problem that increasingly resembles social engineering rather than simple string matching.
Reverse engineering Claude's CVE-2026-2796 exploit
This post dives deep into how Claude wrote an exploit for one of the vulnerabilities it found in Firefox.
Play video
Deadline Day for Autonomous AI Weapons & Mass Surveillance
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Gemini 3.1 Pro and the Downfall of Benchmarks: Welcome to the Vibe Era of AI
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
MITRE ATLAS OpenClaw Investigation Discovers New and Likeliest Techniques
MITRE maps incidents in an open-source agentic ecosystem to ATLAS techniques, showing how AI-first systems create distinct attacker paths.
Play video
The Two Best AI Models/Enemies Just Got Released Simultaneously
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
LLM-discovered 0-days
AI models can now find high-severity vulnerabilities at scale. This is a moment to empower defenders. We're now using Claude to find and help fix vulnerabilities in open source software.
Play video
Your MCP Server is Bad (and you should feel bad) - Jeremiah Lowin, Prefect
AI Engineer session on Your MCP Server is Bad (and you should feel bad) - Jeremiah Lowin, Prefect. It adds practical context for how teams are building and operating AI systems in production.
Play video
Spec-Driven Development: Agentic Coding at FAANG Scale and Quality — Al Harris, Amazon Kiro
AI Engineer session on Spec-Driven Development: Agentic Coding at FAANG Scale and Quality, presented by Al Harris, Amazon Kiro. It adds practical context for how teams are building and operating AI systems in production.
Play video
Why Agent Hype can fall short of reality — Joel Becker, METR
AI Engineer session on Why Agent Hype can fall short of reality, presented by Joel Becker, METR. It adds practical context for how teams are building and operating AI systems in production.
Play video
Claude Agent SDK [Full Workshop] — Thariq Shihipar, Anthropic
AI Engineer session on Claude Agent SDK [Full Workshop], presented by Thariq Shihipar, Anthropic. It adds practical context for how teams are building and operating AI systems in production.
Play video
Identity for AI Agents - Patrick Riley & Carlos Galan, Auth0
AI Engineer session on Identity for AI Agents - Patrick Riley & Carlos Galan, Auth0. It adds practical context for how teams are building and operating AI systems in production.
Play video
OpenAI + @Temporalio : Building Durable, Production Ready Agents - Cornelia Davis, Temporal
AI Engineer session on OpenAI + @Temporalio : Building Durable, Production Ready Agents - Cornelia Davis, Temporal. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building durable Agents with Workflow DevKit & AI SDK - Peter Wielander, Vercel
AI Engineer session on Building durable Agents with Workflow DevKit & AI SDK - Peter Wielander, Vercel. It adds practical context for how teams are building and operating AI systems in production.
Play video
Automating Large Scale Refactors with Parallel Agents - Robert Brennan, OpenHands
AI Engineer session on Automating Large Scale Refactors with Parallel Agents - Robert Brennan, OpenHands. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Intelligent Research Agents with Manus - Ivan Leo, Manus AI (now Meta Superintelligence)
AI Engineer session on Building Intelligent Research Agents with Manus - Ivan Leo, Manus AI (now Meta Superintelligence). It adds practical context for how teams are building and operating AI systems in production.
AI Models on Realistic Cyber Ranges
In a recent evaluation of AI models’ cyber capabilities, current Claude models can now succeed at multistage attacks on networks with dozens of hosts using only standard, open-source tools, instead of the custom tools needed by previous generations.
Finding Bugs with Claude and Property-based Testing
Ensuring that programs are bug-free is one of the most challenging aspects of software engineering. We developed an agent that can efficiently identify bugs in large software projects. Our agent infers general properties of code that should be true, and then applies property-based testing. After extensive manual valida
Play video
Anthropic: Our AI just created a tool that can ‘automate all white collar work’, Me:
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Experimenting with AI to Defend Critical Infrastructure
AI could help defenders of critical infrastructure identify the vulnerabilities that attackers might exploit—and close them before they are exploited. Anthropic has partnered with Pacific Northwest National Laboratory (PNNL) to explore this defensive application of AI, demonstrating both the potential of AI-accelerated
Play video
Hacking Subagents Into Codex CLI — Brian John, Betterup
AI Engineer session on Hacking Subagents Into Codex CLI, presented by Brian John, Betterup. It adds practical context for how teams are building and operating AI systems in production.
Play video
Don't Build Agents, Build Skills Instead — Barry Zhang & Mahesh Murag, Anthropic
AI Engineer session on Don't Build Agents, Build Skills Instead, presented by Barry Zhang & Mahesh Murag, Anthropic. It adds practical context for how teams are building and operating AI systems in production.
Play video
Hard Won Lessons from Building Effective AI Coding Agents — Nik Pash, Cline
AI Engineer session on Hard Won Lessons from Building Effective AI Coding Agents, presented by Nik Pash, Cline. It adds practical context for how teams are building and operating AI systems in production.
Play video
Infra that fixes itself, thanks to coding agents — Mahmoud Abdelwahab, Railway
AI Engineer session on Infra that fixes itself, thanks to coding agents, presented by Mahmoud Abdelwahab, Railway. It adds practical context for how teams are building and operating AI systems in production.
Play video
Making Codebases Agent Ready — Eno Reyes, Factory AI
AI Engineer session on Making Codebases Agent Ready, presented by Eno Reyes, Factory AI. It adds practical context for how teams are building and operating AI systems in production.
Play video
Government Agents: AI Agents Meet Tough Regulations — Mark Myshatyn, Los Alamos National Lab
AI Engineer session on Government Agents: AI Agents Meet Tough Regulations, presented by Mark Myshatyn, Los Alamos National Lab. It adds practical context for how teams are building and operating AI systems in production.
Play video
Katelyn Lesse — Evolving Claude APIs for Agents, Anthropic
AI Engineer session on Katelyn Lesse, presented by Evolving Claude APIs for Agents, Anthropic. It adds practical context for how teams are building and operating AI systems in production.
Play video
Enterprise Deep Research: The Next Killer App for Enterprise AI — Ofer Mendelevitch, Vectara
AI Engineer session on Enterprise Deep Research: The Next Killer App for Enterprise AI, presented by Ofer Mendelevitch, Vectara. It adds practical context for how teams are building and operating AI systems in production.
Play video
From Stateless Nightmares to Durable Agents — Samuel Colvin, Pydantic
AI Engineer session on From Stateless Nightmares to Durable Agents, presented by Samuel Colvin, Pydantic. It adds practical context for how teams are building and operating AI systems in production.
Play video
Developing Taste in Coding Agents: Applied Meta Neuro-Symbolic RL — Ahmad Awais, CommandCode
AI Engineer session on Developing Taste in Coding Agents: Applied Meta Neuro-Symbolic RL, presented by Ahmad Awais, CommandCode. It adds practical context for how teams are building and operating AI systems in production.
Play video
Agent Reinforcement Fine Tuning — Will Hang & Cathy Zhou, OpenAI
AI Engineer session on Agent Reinforcement Fine Tuning, presented by Will Hang & Cathy Zhou, OpenAI. It adds practical context for how teams are building and operating AI systems in production.
Play video
Agents are Robots Too: What Self-Driving Taught Me About Building Agents — Jesse Hu, Abundant
AI Engineer session on Agents are Robots Too: What Self-Driving Taught Me About Building Agents, presented by Jesse Hu, Abundant. It adds practical context for how teams are building and operating AI systems in production.
Play video
Developer Experience in the Age of AI Coding Agents — Max Kanat-Alexander, Capital One
AI Engineer session on Developer Experience in the Age of AI Coding Agents, presented by Max Kanat-Alexander, Capital One. It adds practical context for how teams are building and operating AI systems in production.
Play video
Proactive Agents — Kath Korevec, Google Labs
AI Engineer session on Proactive Agents, presented by Kath Korevec, Google Labs. It adds practical context for how teams are building and operating AI systems in production.
Play video
Future-Proof Coding Agents — Bill Chen & Brian Fioca, OpenAI
AI Engineer session on Future-Proof Coding Agents, presented by Bill Chen & Brian Fioca, OpenAI. It adds practical context for how teams are building and operating AI systems in production.
Play video
Backlog.md: Terminal Kanban Board for Managing Tasks with AI Agents — Alex Gavrilescu, Funstage
AI Engineer session on Backlog.md: Terminal Kanban Board for Managing Tasks with AI Agents, presented by Alex Gavrilescu, Funstage. It adds practical context for how teams are building and operating AI systems in production.
Play video
The Unbearable Lightness of Agent Optimization — Alberto Romero, Jointly
AI Engineer session on The Unbearable Lightness of Agent Optimization, presented by Alberto Romero, Jointly. It adds practical context for how teams are building and operating AI systems in production.
Play video
What the Freakiness of 2025 in AI Tells Us About 2026
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Continuously hardening ChatGPT Atlas against prompt injection attacks
OpenAI describes using automated red teaming and reinforcement learning to discover agent prompt injection attacks before they appear in the wild.
Play video
Gemini Exponential, Demis Hassabis' ‘Proto-AGI’ coming, but …
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Building a Production-Ready AI Security Foundation
Google Cloud outlines a defense-in-depth view of AI security spanning application controls, data protections, and infrastructure isolation.
Play video
Is GPT-5.1 Really an Upgrade? But Models Can Auto-Hack Govts, so … there’s that
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Did you miss these 2 AI stories? A *Real* LLM-crafted Breakthrough + Continual Learning Blocked?
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Sora 2 - It will only get more realistic from here
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Building an Agentic Platform — Ben Kus, CTO Box
AI Engineer session on Building an Agentic Platform, presented by Ben Kus, CTO Box. It adds practical context for how teams are building and operating AI systems in production.
Play video
An ‘AI Bubble’? What Altman Actually said, the Facts and Nano Banana
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Scaling AI Agents Without Breaking Reliability — Preeti Somal, Temporal
AI Engineer session on Scaling AI Agents Without Breaking Reliability, presented by Preeti Somal, Temporal. It adds practical context for how teams are building and operating AI systems in production.
Play video
Agents vs Workflows: Why Not Both? — Sam Bhagwat, Mastra.ai
AI Engineer session on Agents vs Workflows: Why Not Both?, presented by Sam Bhagwat, Mastra.ai. It adds practical context for how teams are building and operating AI systems in production.
Play video
Piloting agents in GitHub Copilot - Christopher Harrison, Microsoft
AI Engineer session on Piloting agents in GitHub Copilot - Christopher Harrison, Microsoft. It adds practical context for how teams are building and operating AI systems in production.
Play video
Ship Agents that Ship: A Hands-On Workshop - Kyle Penfound, Jeremy Adams, Dagger
AI Engineer session on Ship Agents that Ship: A Hands-On Workshop - Kyle Penfound, Jeremy Adams, Dagger. It adds practical context for how teams are building and operating AI systems in production.
Play video
Pipecat Cloud: Enterprise Voice Agents Built On Open Source - Kwindla Hultman Kramer, Daily
AI Engineer session on Pipecat Cloud: Enterprise Voice Agents Built On Open Source - Kwindla Hultman Kramer, Daily. It adds practical context for how teams are building and operating AI systems in production.
Play video
[Full Workshop] Building Conversational AI Agents - Thor Schaeff, ElevenLabs
AI Engineer session on [Full Workshop] Building Conversational AI Agents - Thor Schaeff, ElevenLabs. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Applications with AI Agents — Michael Albada, Microsoft
AI Engineer session on Building Applications with AI Agents, presented by Michael Albada, Microsoft. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building the platform for agent coordination — Tom Moor, Linear
AI Engineer session on Building the platform for agent coordination, presented by Tom Moor, Linear. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Agents at Cloud Scale — Antje Barth, AWS
AI Engineer session on Building Agents at Cloud Scale, presented by Antje Barth, AWS. It adds practical context for how teams are building and operating AI systems in production.
Play video
Your Coding Agent Just Got Cloned And Your Brain Isn't Ready - Rustin Banks, Google Jules
AI Engineer session on Your Coding Agent Just Got Cloned And Your Brain Isn't Ready - Rustin Banks, Google Jules. It adds practical context for how teams are building and operating AI systems in production.
Play video
How to Secure Agents using OAuth — Jared Hanson (Keycard, Passport.js)
AI Engineer session on How to Secure Agents using OAuth, presented by Jared Hanson (Keycard, Passport.js). It adds practical context for how teams are building and operating AI systems in production.
Play video
From Self-driving to Autonomous Voice Agents — Brooke Hopkins, Coval
AI Engineer session on From Self-driving to Autonomous Voice Agents, presented by Brooke Hopkins, Coval. It adds practical context for how teams are building and operating AI systems in production.
Play video
How we hacked YC Spring 2025 batch’s AI agents — Rene Brandel, Casco
AI Engineer session on How we hacked YC Spring 2025 batch’s AI agents, presented by Rene Brandel, Casco. It adds practical context for how teams are building and operating AI systems in production.
Play video
Multi Agent AI and Network Knowledge Graphs for Change — Ola Mabadeje, Cisco
AI Engineer session on Multi Agent AI and Network Knowledge Graphs for Change, presented by Ola Mabadeje, Cisco. It adds practical context for how teams are building and operating AI systems in production.
Play video
Software Development Agents: What Works and What Doesn't - Robert Brennan, OpenHands
AI Engineer session on Software Development Agents: What Works and What Doesn't - Robert Brennan, OpenHands. It adds practical context for how teams are building and operating AI systems in production.
Play video
OpenAI on Securing Code-Executing AI Agents — Fouad Matin (Codex, Agent Robustness)
AI Engineer session on OpenAI on Securing Code-Executing AI Agents, presented by Fouad Matin (Codex, Agent Robustness). It adds practical context for how teams are building and operating AI systems in production.
Play video
A2A & MCP Workshop: Automating Business Processes with LLMs — Damien Murphy, Bench
AI Engineer session on A2A & MCP Workshop: Automating Business Processes with LLMs, presented by Damien Murphy, Bench. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building a Smarter AI Agent with Neural RAG - Will Bryk, Exa.ai
AI Engineer session on Building a Smarter AI Agent with Neural RAG - Will Bryk, Exa.ai. It adds practical context for how teams are building and operating AI systems in production.
Play video
Genie 3: The World Becomes Playable (DeepMind)
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Building Effective Voice Agents — Toki Sherbakov + Anoop Kotha, OpenAI
AI Engineer session on Building Effective Voice Agents, presented by Toki Sherbakov + Anoop Kotha, OpenAI. It adds practical context for how teams are building and operating AI systems in production.
Play video
(possible dupe but better sound) What does Enterprise Ready MCP mean? — Tobin South, WorkOS
AI Engineer session on (possible dupe but better sound) What does Enterprise Ready MCP mean?, presented by Tobin South, WorkOS. It adds practical context for how teams are building and operating AI systems in production.
Play video
"Data readiness" is a Myth: Reliable AI with an Agentic Semantic Layer — Anushrut Gupta, PromptQL
AI Engineer session on "Data readiness" is a Myth: Reliable AI with an Agentic Semantic Layer, presented by Anushrut Gupta, PromptQL. It adds practical context for how teams are building and operating AI systems in production.
Play video
Stateful environments for vertical agents — Josh Purtell, Synth Labs
AI Engineer session on Stateful environments for vertical agents, presented by Josh Purtell, Synth Labs. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Multimodal AI Agents From Scratch — Apoorva Joshi, MongoDB
AI Engineer session on Building Multimodal AI Agents From Scratch, presented by Apoorva Joshi, MongoDB. It adds practical context for how teams are building and operating AI systems in production.
Play video
To the moon! Navigating deep context in legacy code with Augment Agent — Forrest Brazeal, Matt Ball
AI Engineer session on To the moon! Navigating deep context in legacy code with Augment Agent, presented by Forrest Brazeal, Matt Ball. It adds practical context for how teams are building and operating AI systems in production.
Play video
Effective agent design patterns in production — Laurie Voss, LlamaIndex
AI Engineer session on Effective agent design patterns in production, presented by Laurie Voss, LlamaIndex. It adds practical context for how teams are building and operating AI systems in production.
Play video
12-Factor Agents: Patterns of reliable LLM applications — Dex Horthy, HumanLayer
AI Engineer session on 12-Factor Agents: Patterns of reliable LLM applications, presented by Dex Horthy, HumanLayer. It adds practical context for how teams are building and operating AI systems in production.
Play video
Agentic GraphRAG: AI’s Logical Edge — Stephen Chin, Neo4j
AI Engineer session on Agentic GraphRAG: AI’s Logical Edge, presented by Stephen Chin, Neo4j. It adds practical context for how teams are building and operating AI systems in production.
Play video
Why Your Agent’s Brain Needs a Playbook: Practical Wins from Using Ontologies - Jesús Barrasa, Neo4j
AI Engineer session on Why Your Agent’s Brain Needs a Playbook: Practical Wins from Using Ontologies - Jesús Barrasa, Neo4j. It adds practical context for how teams are building and operating AI systems in production.
Play video
Agentic GraphRAG: Simplifying Retrieval Across Structured & Unstructured Data — Zach Blumenfeld
AI Engineer session on Agentic GraphRAG: Simplifying Retrieval Across Structured & Unstructured Data, presented by Zach Blumenfeld. It adds practical context for how teams are building and operating AI systems in production.
Play video
CIAM for AI: Authn/Authz for Agents — Michael Grinich, CEO of WorkOS
AI Engineer session on CIAM for AI: Authn/Authz for Agents, presented by Michael Grinich, CEO of WorkOS. It adds practical context for how teams are building and operating AI systems in production.
Play video
Full Spec MCP: Hidden Capabilities of the MCP spec — Harald Kirschner, Microsoft/VSCode
AI Engineer session on Full Spec MCP: Hidden Capabilities of the MCP spec, presented by Harald Kirschner, Microsoft/VSCode. It adds practical context for how teams are building and operating AI systems in production.
Play video
MCP Is Not Good Yet — David Cramer, Sentry
AI Engineer session on MCP Is Not Good Yet, presented by David Cramer, Sentry. It adds practical context for how teams are building and operating AI systems in production.
Play video
[Workshop] AI Pipelines and Agents in Pure TypeScript with Mastra.ai — Nick Nisi, Zack Proser
AI Engineer session on [Workshop] AI Pipelines and Agents in Pure TypeScript with Mastra.ai, presented by Nick Nisi, Zack Proser. It adds practical context for how teams are building and operating AI systems in production.
Play video
Securing Agents with Open Standards — Bobby Tiernay and Kam Sween, Auth0
AI Engineer session on Securing Agents with Open Standards, presented by Bobby Tiernay and Kam Sween, Auth0. It adds practical context for how teams are building and operating AI systems in production.
Play video
The emerging skillset of wielding coding agents — Beyang Liu, Sourcegraph / Amp
AI Engineer session on The emerging skillset of wielding coding agents, presented by Beyang Liu, Sourcegraph / Amp. It adds practical context for how teams are building and operating AI systems in production.
Play video
Collaborating with Agents in your Software Dev Workflow - Jon Peck & Christopher Harrison, Microsoft
AI Engineer session on Collaborating with Agents in your Software Dev Workflow - Jon Peck & Christopher Harrison, Microsoft. It adds practical context for how teams are building and operating AI systems in production.
Play video
Shipping an Enterprise Voice AI Agent in 100 Days - Peter Bar, Intercom Fin
AI Engineer session on Shipping an Enterprise Voice AI Agent in 100 Days - Peter Bar, Intercom Fin. It adds practical context for how teams are building and operating AI systems in production.
Play video
Ship it! Building Production Ready Agents — Mike Chambers, AWS
AI Engineer session on Ship it! Building Production Ready Agents, presented by Mike Chambers, AWS. It adds practical context for how teams are building and operating AI systems in production.
Play video
Agentic Excellence: Mastering AI Agent Evals w/ Azure AI Evaluation SDK — Cedric Vidal, Microsoft
AI Engineer session on Agentic Excellence: Mastering AI Agent Evals w/ Azure AI Evaluation SDK, presented by Cedric Vidal, Microsoft. It adds practical context for how teams are building and operating AI systems in production.
Play video
AI Red Teaming Agent: Azure AI Foundry — Nagkumar Arkalgud & Keiji Kanazawa, Microsoft
AI Engineer session on AI Red Teaming Agent: Azure AI Foundry, presented by Nagkumar Arkalgud & Keiji Kanazawa, Microsoft. It adds practical context for how teams are building and operating AI systems in production.
Play video
Events are the Wrong Abstraction for Your AI Agents - Mason Egger, Temporal.io
AI Engineer session on Events are the Wrong Abstraction for Your AI Agents - Mason Egger, Temporal.io. It adds practical context for how teams are building and operating AI systems in production.
Play video
How agents will unlock the $500B promise of AI - Donald Hruska, Retool
AI Engineer session on How agents will unlock the $500B promise of AI - Donald Hruska, Retool. It adds practical context for how teams are building and operating AI systems in production.
Play video
Claude Code & the evolution of agentic coding — Boris Cherny, Anthropic
AI Engineer session on Claude Code & the evolution of agentic coding, presented by Boris Cherny, Anthropic. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Code First AI Agents with Azure AI Agent Service — Cedric Vidal, Microsoft
AI Engineer session on Building Code First AI Agents with Azure AI Agent Service, presented by Cedric Vidal, Microsoft. It adds practical context for how teams are building and operating AI systems in production.
Play video
[Full Workshop] Reinforcement Learning, Kernels, Reasoning, Quantization & Agents — Daniel Han
AI Engineer session on [Full Workshop] Reinforcement Learning, Kernels, Reasoning, Quantization & Agents, presented by Daniel Han. It adds practical context for how teams are building and operating AI systems in production.
Play video
Training Agentic Reasoners — Will Brown, Prime Intellect
AI Engineer session on Training Agentic Reasoners, presented by Will Brown, Prime Intellect. It adds practical context for how teams are building and operating AI systems in production.
Play video
Introducing Strands Agents, an Open Source AI Agents SDK — Suman Debnath, AWS
AI Engineer session on Introducing Strands Agents, an Open Source AI Agents SDK, presented by Suman Debnath, AWS. It adds practical context for how teams are building and operating AI systems in production.
Play video
Real world MCPs in GitHub Copilot Agent Mode — Jon Peck, Microsoft
AI Engineer session on Real world MCPs in GitHub Copilot Agent Mode, presented by Jon Peck, Microsoft. It adds practical context for how teams are building and operating AI systems in production.
Play video
Architecting Agent Memory: Principles, Patterns, and Best Practices — Richmond Alake, MongoDB
AI Engineer session on Architecting Agent Memory: Principles, Patterns, and Best Practices, presented by Richmond Alake, MongoDB. It adds practical context for how teams are building and operating AI systems in production.
Play video
Containing Agent Chaos — Solomon Hykes, Dagger
AI Engineer session on Containing Agent Chaos, presented by Solomon Hykes, Dagger. It adds practical context for how teams are building and operating AI systems in production.
Play video
The rise of the agentic economy on the shoulders of MCP — Jan Curn, Apify
AI Engineer session on The rise of the agentic economy on the shoulders of MCP, presented by Jan Curn, Apify. It adds practical context for how teams are building and operating AI systems in production.
Play video
MCP is all you need — Samuel Colvin, Pydantic
AI Engineer session on MCP is all you need, presented by Samuel Colvin, Pydantic. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building agent fleet architectures your CISO doesn't hate — Lou Bichard, Gitpod
AI Engineer session on Building agent fleet architectures your CISO doesn't hate, presented by Lou Bichard, Gitpod. It adds practical context for how teams are building and operating AI systems in production.
Play video
Milliseconds to Magic: Real‑Time Workflows using the Gemini Live API and Pipecat
AI Engineer session on Milliseconds to Magic: Real‑Time Workflows using the Gemini Live API and Pipecat. It adds practical context for how teams are building and operating AI systems in production.
Play video
UX Design Principles for Semi Autonomous Multi Agent Systems — Victor Dibia, Microsoft
AI Engineer session on UX Design Principles for Semi Autonomous Multi Agent Systems, presented by Victor Dibia, Microsoft. It adds practical context for how teams are building and operating AI systems in production.
Play video
How to Train Your Agent: Building Reliable Agents with RL — Kyle Corbitt, OpenPipe
AI Engineer session on How to Train Your Agent: Building Reliable Agents with RL, presented by Kyle Corbitt, OpenPipe. It adds practical context for how teams are building and operating AI systems in production.
Play video
Memory Masterclass: Make Your AI Agents Remember What They Do! — Mark Bain, AIUS
AI Engineer session on Memory Masterclass: Make Your AI Agents Remember What They Do!, presented by Mark Bain, AIUS. It adds practical context for how teams are building and operating AI systems in production.
Play video
How to build Enterprise Aware Agents - Chau Tran, Glean
AI Engineer session on How to build Enterprise Aware Agents - Chau Tran, Glean. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building voice agents with OpenAI — Dominik Kundel, OpenAI
AI Engineer session on Building voice agents with OpenAI, presented by Dominik Kundel, OpenAI. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Agents (the hard parts!) - Rita Kozlov, Cloudflare
AI Engineer session on Building Agents (the hard parts!) - Rita Kozlov, Cloudflare. It adds practical context for how teams are building and operating AI systems in production.
Play video
From Copilot to Colleague: Trustworthy Agents for High-Stakes - Joel Hron, CTO Thomson Reuters
AI Engineer session on From Copilot to Colleague: Trustworthy Agents for High-Stakes - Joel Hron, CTO Thomson Reuters. It adds practical context for how teams are building and operating AI systems in production.
Play video
3 ingredients for building reliable enterprise agents - Harrison Chase, LangChain/LangGraph
AI Engineer session on 3 ingredients for building reliable enterprise agents - Harrison Chase, LangChain/LangGraph. It adds practical context for how teams are building and operating AI systems in production.
Play video
Forget RAG Pipelines — Build Production Ready Agents in 15 Mins: Nina Lopatina, Rajiv Shah, Contextual
AI Engineer session on Forget RAG Pipelines, presented by Build Production Ready Agents in 15 Mins: Nina Lopatina, Rajiv Shah, Contextual. It adds practical context for how teams are building and operating AI systems in production.
Play video
Agents, Access, and the Future of Machine Identity — Nick Nisi (WorkOS) + Lizzie Siegle (Cloudflare)
AI Engineer session on Agents, Access, and the Future of Machine Identity, presented by Nick Nisi (WorkOS) + Lizzie Siegle (Cloudflare). It adds practical context for how teams are building and operating AI systems in production.
Play video
How to Build Planning Agents without losing control - Yogendra Miraje, Factset
AI Engineer session on How to Build Planning Agents without losing control - Yogendra Miraje, Factset. It adds practical context for how teams are building and operating AI systems in production.
Play video
The Agent Awakens: Collaborative Development with Copilot - Christopher Harrison, GitHub
AI Engineer session on The Agent Awakens: Collaborative Development with Copilot - Christopher Harrison, GitHub. It adds practical context for how teams are building and operating AI systems in production.
Play video
From Mixture of Experts to Mixture of Agents with Super Fast Inference - Daniel Kim & Daria Soboleva
AI Engineer session on From Mixture of Experts to Mixture of Agents with Super Fast Inference - Daniel Kim & Daria Soboleva. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Agentic Applications w/ Heroku Managed Inference and Agents — Julián Duque & Anush Dsouza
AI Engineer session on Building Agentic Applications w/ Heroku Managed Inference and Agents, presented by Julián Duque & Anush Dsouza. It adds practical context for how teams are building and operating AI systems in production.
Play video
Taming Rogue AI Agents with Observability-Driven Evaluation — Jim Bennett, Galileo
AI Engineer session on Taming Rogue AI Agents with Observability-Driven Evaluation, presented by Jim Bennett, Galileo. It adds practical context for how teams are building and operating AI systems in production.
Play video
Conquering Agent Chaos — Rick Blalock, Agentuity
AI Engineer session on Conquering Agent Chaos, presented by Rick Blalock, Agentuity. It adds practical context for how teams are building and operating AI systems in production.
Play video
Knowledge Graphs in Litigation Agents — Tom Smoker, WhyHow
AI Engineer session on Knowledge Graphs in Litigation Agents, presented by Tom Smoker, WhyHow. It adds practical context for how teams are building and operating AI systems in production.
Play video
How Not to Read a Headline on AI (ft. new Olympiad Gold, GPT-5 …)
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Remote MCPs: What we learned from shipping — John Welsh, Anthropic
AI Engineer session on Remote MCPs: What we learned from shipping, presented by John Welsh, Anthropic. It adds practical context for how teams are building and operating AI systems in production.
Play video
The Agent Native Company — Rick Blalock, Agentuity
AI Engineer session on The Agent Native Company, presented by Rick Blalock, Agentuity. It adds practical context for how teams are building and operating AI systems in production.
Play video
Effective AI Agents Need Data Flywheels, Not The Next Biggest LLM — Sylendran Arunagiri, NVIDIA
AI Engineer session on Effective AI Agents Need Data Flywheels, Not The Next Biggest LLM, presented by Sylendran Arunagiri, NVIDIA. It adds practical context for how teams are building and operating AI systems in production.
Play video
The Current State of Browser Agents - Jerry Wu and Wyatt Marshall
AI Engineer session on The Current State of Browser Agents - Jerry Wu and Wyatt Marshall. It adds practical context for how teams are building and operating AI systems in production.
Play video
Letting AI Interface with your App with MCP — Kent C Dodds
AI Engineer session on Letting AI Interface with your App with MCP, presented by Kent C Dodds. It adds practical context for how teams are building and operating AI systems in production.
Play video
MCPs are Boring (or: Why we are losing the Sparkle of LLMs) - Manuel Odendahl
AI Engineer session on MCPs are Boring (or: Why we are losing the Sparkle of LLMs) - Manuel Odendahl. It adds practical context for how teams are building and operating AI systems in production.
Play video
The State of MCP observability: Observable.tools — Alex Volkov and Benjamin Eckel, W&B and Dylibso
AI Engineer session on The State of MCP observability: Observable.tools, presented by Alex Volkov and Benjamin Eckel, W&B and Dylibso. It adds practical context for how teams are building and operating AI systems in production.
Play video
MCP Agent Fine tuning Workshop - Ronan McGovern
AI Engineer session on MCP Agent Fine tuning Workshop - Ronan McGovern. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Protected MCP Servers — Den Delimarsky and Julia Kasper, MCP Steering Committee & Microsoft
AI Engineer session on Building Protected MCP Servers, presented by Den Delimarsky and Julia Kasper, MCP Steering Committee & Microsoft. It adds practical context for how teams are building and operating AI systems in production.
Play video
Break It 'Til You Make It: Building the Self-Improving Stack for AI Agents - Aparna Dhinakaran
AI Engineer session on Break It 'Til You Make It: Building the Self-Improving Stack for AI Agents - Aparna Dhinakaran. It adds practical context for how teams are building and operating AI systems in production.
Play video
Will Agent evaluation via MCP Stabilize Agent Networks? - Ari Heljakka
AI Engineer session on Will Agent evaluation via MCP Stabilize Agent Networks? - Ari Heljakka. It adds practical context for how teams are building and operating AI systems in production.
Play video
Real AI Agents Need Planning, Not Just Prompting - Yuval Belfer
AI Engineer session on Real AI Agents Need Planning, Not Just Prompting - Yuval Belfer. It adds practical context for how teams are building and operating AI systems in production.
Play video
The Demo I Wish I'd Had: OpenAI's Agents SDK... serverless! - Brook Riggio
AI Engineer session on The Demo I Wish I'd Had: OpenAI's Agents SDK... serverless! - Brook Riggio. It adds practical context for how teams are building and operating AI systems in production.
Play video
Breaking the Chain: Agent Continuations for Resumable AI Workflows - Greg Benson
AI Engineer session on Breaking the Chain: Agent Continuations for Resumable AI Workflows - Greg Benson. It adds practical context for how teams are building and operating AI systems in production.
Play video
How agents broke app-level infrastructure - Evan Boyle
AI Engineer session on How agents broke app-level infrastructure - Evan Boyle. It adds practical context for how teams are building and operating AI systems in production.
Play video
The Future of Qwen: A Generalist Agent Model — Junyang Lin, Alibaba Qwen
AI Engineer session on The Future of Qwen: A Generalist Agent Model, presented by Junyang Lin, Alibaba Qwen. It adds practical context for how teams are building and operating AI systems in production.
Play video
Agentic Enterprise - What your CEO must know about AI - Hubert Misztela
AI Engineer session on Agentic Enterprise - What your CEO must know about AI - Hubert Misztela. It adds practical context for how teams are building and operating AI systems in production.
Play video
When Will AI Models Blackmail You, and Why?
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Why the Best AI Agents Are Built Without Frameworks (Primitives over Frameworks) — Ahmad Awais, CHAI
AI Engineer session on Why the Best AI Agents Are Built Without Frameworks (Primitives over Frameworks), presented by Ahmad Awais, CHAI. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building AI Agents that actually automate Knowledge Work - Jerry Liu, LlamaIndex
AI Engineer session on Building AI Agents that actually automate Knowledge Work - Jerry Liu, LlamaIndex. It adds practical context for how teams are building and operating AI systems in production.
Play video
Blender MCP and The Future Of Creative Tools - Siddharth Ahuja
AI Engineer session on Blender MCP and The Future Of Creative Tools - Siddharth Ahuja. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Reliable Support Agents Using the Effect Typescript Library - Michael Fester
AI Engineer session on Building Reliable Support Agents Using the Effect Typescript Library - Michael Fester. It adds practical context for how teams are building and operating AI systems in production.
Play video
Case Study + Deep Dive: Telemedicine Support Agents with LangGraph/MCP - Dan Mason
AI Engineer session on Case Study + Deep Dive: Telemedicine Support Agents with LangGraph/MCP - Dan Mason. It adds practical context for how teams are building and operating AI systems in production.
Play video
Are MCPs Overhyped? A Rant about MCPs — Henry Mao, Smithery
AI Engineer session on Are MCPs Overhyped? A Rant about MCPs, presented by Henry Mao, Smithery. It adds practical context for how teams are building and operating AI systems in production.
Play video
Exposing Agents as MCP servers with mcp-agent: Sarmad Qadri
AI Engineer session on Exposing Agents as MCP servers with mcp-agent: Sarmad Qadri. It adds practical context for how teams are building and operating AI systems in production.
Play video
Supercharging developer workflow with Amazon Q Developer - Vikash Agrawal
AI Engineer session on Supercharging developer workflow with Amazon Q Developer - Vikash Agrawal. It adds practical context for how teams are building and operating AI systems in production.
Play video
Agents reported thousands of bugs, how many were real? - Ian Butler and Nick Gregory
AI Engineer session on Agents reported thousands of bugs, how many were real? - Ian Butler and Nick Gregory. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Agents with Amazon Nova Act and MCP - Du'An Lightfoot, Amazon (Full Workshop)
AI Engineer session on Building Agents with Amazon Nova Act and MCP - Du'An Lightfoot, Amazon (Full Workshop). It adds practical context for how teams are building and operating AI systems in production.
Play video
MCP: Origins and Requests For Startups — Theodora Chu, Model Context Protocol PM, Anthropic
AI Engineer session on MCP: Origins and Requests For Startups, presented by Theodora Chu, Model Context Protocol PM, Anthropic. It adds practical context for how teams are building and operating AI systems in production.
Cloud CISO Perspectives: How Google secures AI Agents
Google’s CISO perspective on why agents need a new security paradigm and what changes when models can observe, plan, and act.
Play video
Creating Agents that Co-Create — Karina Nguyen, OpenAI
AI Engineer session on Creating Agents that Co-Create, presented by Karina Nguyen, OpenAI. It adds practical context for how teams are building and operating AI systems in production.
Play video
AI Improves at Self-improving
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
o3 breaks (some) records, but AI becomes pay-to-win
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Rethinking how we Scaffold AI Agents - Rahul Sengottuvelu, Ramp
AI Engineer session on Rethinking how we Scaffold AI Agents - Rahul Sengottuvelu, Ramp. It adds practical context for how teams are building and operating AI systems in production.
Play video
The Agent Development Life Cycle — Zack Reneau-Wedeen, Sierra
AI Engineer session on The Agent Development Life Cycle, presented by Zack Reneau-Wedeen, Sierra. It adds practical context for how teams are building and operating AI systems in production.
Play video
Voice Agent Engineering — Nik Caryotakis, SuperDial
AI Engineer session on Voice Agent Engineering, presented by Nik Caryotakis, SuperDial. It adds practical context for how teams are building and operating AI systems in production.
Play video
Cohere: Building enterprise LLM agents that work (Shaan Desai)
AI Engineer session on Cohere: Building enterprise LLM agents that work (Shaan Desai). It adds practical context for how teams are building and operating AI systems in production.
Play video
Why Agent Engineering — swyx
AI Engineer session on Why Agent Engineering, presented by swyx. It adds practical context for how teams are building and operating AI systems in production.
Play video
Patrick Dougherty: How to Build AI Agents that Actually Work
AI Engineer session on Patrick Dougherty: How to Build AI Agents that Actually Work. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building and Scaling an AI Agent Swarm of low latency real time voice bots: Damien Murphy
AI Engineer session on Building and Scaling an AI Agent Swarm of low latency real time voice bots: Damien Murphy. It adds practical context for how teams are building and operating AI systems in production.
Play video
Keynote: Why people think "agent" is a buzzword but it isn't
AI Engineer session on Keynote: Why people think "agent" is a buzzword but it isn't. It adds practical context for how teams are building and operating AI systems in production.
Play video
How We Build Effective Agents: Barry Zhang, Anthropic
AI Engineer session on How We Build Effective Agents: Barry Zhang, Anthropic. It adds practical context for how teams are building and operating AI systems in production.
Play video
Using agents to build an agent company: Joao Moura
AI Engineer session on Using agents to build an agent company: Joao Moura. It adds practical context for how teams are building and operating AI systems in production.
Play video
Stateful Agents — Full Workshop with Charles Packer of Letta and MemGPT
AI Engineer session on Stateful Agents, presented by Full Workshop with Charles Packer of Letta and MemGPT. It adds practical context for how teams are building and operating AI systems in production.
Play video
How Coding Agents change Software Development Forever - Hailong Zhang
AI Engineer session on How Coding Agents change Software Development Forever - Hailong Zhang. It adds practical context for how teams are building and operating AI systems in production.
Play video
Reverse Conway's law and GenAI: How agents will take over the organisation - Patrick Debois
AI Engineer session on Reverse Conway's law and GenAI: How agents will take over the organisation - Patrick Debois. It adds practical context for how teams are building and operating AI systems in production.
Play video
Build an AI Research Agent: Apoorva Joshi
AI Engineer session on Build an AI Research Agent: Apoorva Joshi. It adds practical context for how teams are building and operating AI systems in production.
Play video
The Price of Intelligence - AI Agent Pricing in 2025
AI Engineer session on The Price of Intelligence - AI Agent Pricing in 2025. It adds practical context for how teams are building and operating AI systems in production.
Play video
Self Coding Agents — Colin Flaherty, Augment Code
AI Engineer session on Self Coding Agents, presented by Colin Flaherty, Augment Code. It adds practical context for how teams are building and operating AI systems in production.
Play video
Reinforcement Learning for Agents - Will Brown, ML Researcher at Morgan Stanley
AI Engineer session on Reinforcement Learning for Agents - Will Brown, ML Researcher at Morgan Stanley. It adds practical context for how teams are building and operating AI systems in production.
Play video
Ionic Launch: Opening the economy to AI agents
AI Engineer session on Ionic Launch: Opening the economy to AI agents. It adds practical context for how teams are building and operating AI systems in production.
Play video
Disrupting the $15 Trillion Construction Industry with Autonomous Agents: Dr. Sarah Buchner
AI Engineer session on Disrupting the $15 Trillion Construction Industry with Autonomous Agents: Dr. Sarah Buchner. It adds practical context for how teams are building and operating AI systems in production.
Play video
Your AI Agent Isn't an Engineer: The Art of Thoughtful Anthropomorphism
AI Engineer session on Your AI Agent Isn't an Engineer: The Art of Thoughtful Anthropomorphism. It adds practical context for how teams are building and operating AI systems in production.
Play video
Trust, but Verify: Knowledge Agents for Finance Workflows - Mike Conover
AI Engineer session on Trust, but Verify: Knowledge Agents for Finance Workflows - Mike Conover. It adds practical context for how teams are building and operating AI systems in production.
Play video
Ensure AI Agents Work: Evaluation Frameworks for Scaling Success — Aparna Dhinkaran, CEO Arize
AI Engineer session on Ensure AI Agents Work: Evaluation Frameworks for Scaling Success, presented by Aparna Dhinkaran, CEO Arize. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Multi agent Systems with Finite State Machines
AI Engineer session on Building Multi agent Systems with Finite State Machines. It adds practical context for how teams are building and operating AI systems in production.
Play video
AI Agents, Meet Test Driven Development
AI Engineer session on AI Agents, Meet Test Driven Development. It adds practical context for how teams are building and operating AI systems in production.
Play video
How to Improve Your Agents: Academic Lit Review
AI Engineer session on How to Improve Your Agents: Academic Lit Review. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building AI Agents with Real ROI in the Enterprise SDLC: Bruno (Booking.com) & Beyang (Sourcegraph)
AI Engineer session on Building AI Agents with Real ROI in the Enterprise SDLC: Bruno (Booking.com) & Beyang (Sourcegraph). It adds practical context for how teams are building and operating AI systems in production.
Play video
Multi model multimodal and multi agent innovations in Azure AI: Cedric Vidal
AI Engineer session on Multi model multimodal and multi agent innovations in Azure AI: Cedric Vidal. It adds practical context for how teams are building and operating AI systems in production.
Play video
Privacy First Enterprise AI: Building AI Agents that Never Leave Your Security Boundary
AI Engineer session on Privacy First Enterprise AI: Building AI Agents that Never Leave Your Security Boundary. It adds practical context for how teams are building and operating AI systems in production.
Play video
Scaling Agents for Gen AI Products - Anju Kambadur, Bloomberg Head of AI Engineering
AI Engineer session on Scaling Agents for Gen AI Products - Anju Kambadur, Bloomberg Head of AI Engineering. It adds practical context for how teams are building and operating AI systems in production.
Play video
How Windsurf writes 90% of your code with an Agentic IDE - Kevin Hou, Windsurf
AI Engineer session on How Windsurf writes 90% of your code with an Agentic IDE - Kevin Hou, Windsurf. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Reliable Agentic Systems: Eno Reyes
AI Engineer session on Building Reliable Agentic Systems: Eno Reyes. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building and evaluating AI Agents — Sayash Kapoor, AI Snake Oil
AI Engineer session on Building and evaluating AI Agents, presented by Sayash Kapoor, AI Snake Oil. It adds practical context for how teams are building and operating AI systems in production.
Play video
How Deep Research Works - Mukund Sridhar & Aarush Selvan, Google DeepMind
AI Engineer session on How Deep Research Works - Mukund Sridhar & Aarush Selvan, Google DeepMind. It adds practical context for how teams are building and operating AI systems in production.
Play video
Giving a Voice to AI Agents: Scott Stephenson, CEO, Deepgram
AI Engineer session on Giving a Voice to AI Agents: Scott Stephenson, CEO, Deepgram. It adds practical context for how teams are building and operating AI systems in production.
Play video
Architecting and Testing Controllable Agents: Lance Martin
AI Engineer session on Architecting and Testing Controllable Agents: Lance Martin. It adds practical context for how teams are building and operating AI systems in production.
Play video
Personal, Local, Private AI Agents: Soumith Chintala
AI Engineer session on Personal, Local, Private AI Agents: Soumith Chintala. It adds practical context for how teams are building and operating AI systems in production.
Play video
OpenAI for VP's of AI + Advice for Building Agents
AI Engineer session on OpenAI for VP's of AI + Advice for Building Agents. It adds practical context for how teams are building and operating AI systems in production.
Play video
Vercel AI SDK Masterclass: From Fundamentals to Deep Research
AI Engineer session on Vercel AI SDK Masterclass: From Fundamentals to Deep Research. It adds practical context for how teams are building and operating AI systems in production.
Play video
RAG Agents in Prod: 10 Lessons We Learned — Douwe Kiela, creator of RAG
AI Engineer session on RAG Agents in Prod: 10 Lessons We Learned, presented by Douwe Kiela, creator of RAG. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Agents with Model Context Protocol - Full Workshop with Mahesh Murag of Anthropic
AI Engineer session on Building Agents with Model Context Protocol - Full Workshop with Mahesh Murag of Anthropic. It adds practical context for how teams are building and operating AI systems in production.
Play video
Beyond APIs: How AI Web Agents Are Automating the "Long Tail" of Knowledge Work
AI Engineer session on Beyond APIs: How AI Web Agents Are Automating the "Long Tail" of Knowledge Work. It adds practical context for how teams are building and operating AI systems in production.
Play video
Personality Driven Development: Exploring the Frontier of Agents with Attitude
AI Engineer session on Personality Driven Development: Exploring the Frontier of Agents with Attitude. It adds practical context for how teams are building and operating AI systems in production.
Play video
Agentic Workflows on Vertex AI: Rukma Sen
AI Engineer session on Agentic Workflows on Vertex AI: Rukma Sen. It adds practical context for how teams are building and operating AI systems in production.
Play video
Voice Agents: the good, the bad, and the ugly
AI Engineer session on Voice Agents: the good, the bad, and the ugly. It adds practical context for how teams are building and operating AI systems in production.
Play video
This video was edited with AI agent. But how?
AI Engineer session on This video was edited with AI agent. But how?. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building State of the Art Open Weights Tool Use: The Command R Family: Sandra Kublik
AI Engineer session on Building State of the Art Open Weights Tool Use: The Command R Family: Sandra Kublik. It adds practical context for how teams are building and operating AI systems in production.
Play video
Emergence Launch: AI Agents and the future enterprise: Dr. Satya Nitta
AI Engineer session on Emergence Launch: AI Agents and the future enterprise: Dr. Satya Nitta. It adds practical context for how teams are building and operating AI systems in production.
Play video
The missing pieces of workflow automation — Shirsha Chaudhuri, Thomson Reuters Labs
AI Engineer session on The missing pieces of workflow automation, presented by Shirsha Chaudhuri, Thomson Reuters Labs. It adds practical context for how teams are building and operating AI systems in production.
Play video
Lets Build An Agent from Scratch
AI Engineer session on Lets Build An Agent from Scratch. It adds practical context for how teams are building and operating AI systems in production.
Play video
Agent Evals: Finally, With The Map
AI Engineer session on Agent Evals: Finally, With The Map. It adds practical context for how teams are building and operating AI systems in production.
Play video
Finetuning: 500m AI agents in production with 2 engineers — Mustafa Ali & Kyle Corbitt
AI Engineer session on Finetuning: 500m AI agents in production with 2 engineers, presented by Mustafa Ali & Kyle Corbitt. It adds practical context for how teams are building and operating AI systems in production.
Play video
Tool Calling Is Not Just Plumbing for AI Agents — Roy Derks
AI Engineer session on Tool Calling Is Not Just Plumbing for AI Agents, presented by Roy Derks. It adds practical context for how teams are building and operating AI systems in production.
Play video
Manus AI - The Calm Before the Hypestorm … (vs Deep Research + Grok 3)
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Announcing AI Protection: Security for the AI era
Google introduced AI Protection and Model Armor to address prompt injection, jailbreaks, data loss, and multicloud AI workload security.
Play video
Claude 3.7 is More Significant than its Name Implies (ft DeepSeek R2 + GPT 4.5 coming soon)
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Nothing Much Happens in AI, Then Everything Does All At Once
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Operator System Card
The Operator system card documents red teaming and mitigation choices for a computer-using agent, with prompt injections listed as a central risk area.
Play video
Altman Expects a ‘Fast Take-off’, ‘Super-Agent’ Debuting Soon and DeepSeek R1 Out
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Enhancing AI safety: Insights and lessons from red teaming
Microsoft summarizes lessons from red teaming more than one hundred generative AI products, emphasizing system-level testing, human expertise, and automation.
Play video
OpenAI Backtracks, Gunning for Superintelligence: Altman Brings His AGI Timeline Closer - '25 to '29
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
OWASP Top 10 for Large Language Model Applications
OWASP’s GenAI security project remains a practical baseline for teams building or assessing LLM applications and agentic systems.
Play video
Never Browse Alone? Gemini 2 Live and ChatGPT Vision
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
AI Breaks Its Silence: OpenAI’s ‘Next 12 Days’, Genie 2, and a Word of Caution
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
New Google Model Ranked ‘No. 1 LLM’, But There’s a Problem
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
The New Claude 3.5 Sonnet: Better, Yes, But Not Just in the Way You Might Think
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
OpenAI: ‘We Just Reached Human-level Reasoning’.
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
New OpenAI Model 'Imminent' and AI Stakes Get Raised (plus Med Gemini, GPT 2 Chatbot and Scale AI)
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
The Age of the Agent: Flo Crivello
AI Engineer session on The Age of the Agent: Flo Crivello. It adds practical context for how teams are building and operating AI systems in production.
Play video
‘Her’ AI, Almost Here? Llama 3, Vasa-1, and Altman ‘Plugging Into Everything You Want To Do’
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
AI Agents Take the Wheel: Devin, SIMA, Figure 01 and The Future of Jobs
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
State of AI 2023: Highlights of 163 Page Report + Eureka Self-Improvement, MEG, Suno AI and GPT F
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
An Actually Big Week in AI: AutoGen, The A-Phone, Mistral 7B, GPT-Fathom and Meta Hunts CharacterAI
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
9 AI Developments: HeyGen 2.0 to AjaxGPT, Open Interpreter to NExT-GPT and Roblox AI
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
AGI Will Not Be A Chatbot - Autonomy, Acceleration, and Arguments Behind the Scenes
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Google Gemini: AlphaGo-GPT?
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
12 New Code Interpreter Uses (Image to 3D, Book Scans, Multiple Datasets, Error Analysis ... )
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
GPT 4 Got Upgraded - Code Interpreter (ft. Image Editing, MP4s, 3D Plots, Data Analytics and more!)
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
GPT 4 is Smarter than You Think: Introducing SmartGPT
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
8 Signs It's The Future: Thought-to-Text, Nvidia Text-to-Video, Character AI, and P(Doom) @Ted
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
‘We Must Slow Down the Race’ – X AI, GPT 4 Can Now Do Science and Altman GPT 5 Statement
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Can GPT 4 Prompt Itself? MemoryGPT, AutoGPT, Jarvis, Claude-Next [10x GPT 4!] and more...
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.