Prompt engineering is not just about better outputs. In practice it shapes reliability, scope, fallback behavior, and how well an AI system resists misuse when instructions, tools, and untrusted content collide.
Prompt Engineering
Prompt design patterns, instruction hierarchy, and defensive prompt construction.
- Instruction hierarchy and role separation
- Clear task boundaries, fallback behavior, and refusal handling
- Prompt structures that support monitoring and repeatable evaluation
- Overloading prompts with too many responsibilities
- Relying on wording instead of system controls
- Treating prompts as static text instead of part of application design
- Teams operating prompt-heavy workflows
- Builders refining assistant and agent behavior
- Reviewers trying to connect prompt design to safety and risk
Current notes, events, and source material
These items are included because they add useful evidence, framing, implementation detail, or upcoming context for teams working in this area.
Play video
The agent-ready web: Simplify user actions with WebMCP — Tara Agyemang, Google
Buying two concert tickets costs an AI agent the entire DOM, the accessibility tree, a screenshot, pixel coordinate math, and then a click that might miss because an ad just loaded and shifted the layout. Tara Agyemang from the Google Chrome team introduces WebMCP, a proposed web standard that replaces that process wit
Play video
Why Can't Anyone Answer Questions About the Business? — Garrett Galow, WorkOS
Every business question that needs SQL follows the same loop: explain the question, wait for an engineer, get an answer, realize it needs one more join, share a one-off in Slack, repeat. Garrett Galow from WorkOS built Studio to break that loop — an internal workspace where anyone can ask questions against Snowflake, L
Play video
How to Keep Shipping When You Walk Away from Your Desk — Zack Proser, WorkOS
Simon Willison fires up four parallel agents and is wiped out by 11am. That is the problem Zack Proser is solving: not that the tools are too slow but that human attention is still the hard constraint. His loop: voice brief at 184 words per minute, agent dispatched to an isolated git worktree, laptop closed, progress c
Play video
Sovereign Escape Velocity: Ownership w Open Models — Gus Martins, & Ian Ballantyne, Google DeepMind
Gemma 4's 31B model sits fourth on the LM Arena open model leaderboard. The models around it are at least twice as large; some are 20 times larger. It runs on a single GPU. Competitors at comparable quality need four or five. Ian Ballantyne and Gus Martins walk through what that size efficiency unlocks: running on a Pi
Play video
Stop Making Models Bigger, Make Them Behave — Kobie Crawdord, Snorkel
Qwen 3 235B was asked for YouTube's year over year ad revenue growth from 2023 to 2024. It queried a table that didn't exist, tried again, got nothing back both times, and hallucinated an answer. The 4B model Snorkel finetuned with RL called `get_table_name` first, inspected the schema, ran a query, hit a column error,
Play video
Self Driving Products: Product Signals to Pull Requests — Joshua Snyder, PostHog
A rage click, a 2am error spike, a customer Slack message — today each sits until a developer notices, triages, tickets, and writes a fix. PostHog is building a pipeline that collapses that chain: signal arrives, a background agent groups it with related errors and session replays, researches the codebase, and opens a
Play video
From Transcription to Live Music: Gemini's Audio Stack — Thor Schaeff, Google DeepMind
One API call to Gemini 3 Flash Preview: speaker labels by name, timestamps, emotion tags, language detection with English translation, and a full summary. That is the audio understanding layer that underlies everything else Thor Schaeff demos here, including speech generation directed by a "director's note" rather than
Play video
RAG is dead, right?? — Kuba Rogut, Turbopuffer
Cursor added semantic search and measured a 24% increase in answer accuracy on their composer model, a 2.6% gain in code retention in large codebases, and a 2.2% drop in dissatisfied user requests. Those numbers look small until you factor in that semantic search does not fire on every query. Meanwhile Google search vo
Play video
2026 AI Engineer Vibe Reel
W are getting ready for the World's Fair in San Francisco - Jun 29 to July 2! https://ai.engineer/wf - get tickets and see schedule!
Play video
GPU Cloud Deployment Without Leaving Your IDE — Audry Hsu, RunPod
The iteration cycle before Flash: commit, push, build a Docker image, pull it from the registry, load it onto a server, allocate a GPU, then find out if it works. Audrey Hsu demos what replacing that with a single decorator looks like — add `@flash.endpoint` to an async Python function and it deploys to GPU cloud from
Play video
Why More Context Makes Your Agent Dumber and What to Do About It — Nupur Sharma, Qodo
Give an agent your full codebase and it will attend to the start and the end, then quietly drop the middle. Nupur from Qodo calls this the U curve and builds the whole talk around it: why growing the context window did not fix the problem, and what actually does. She runs through iterative retrieval, hierarchical summa
Play video
Why Eval++ Is the Next Great Compute Primitive — Sunil Pai & Matt Carrie, Cloudflare
Matt Carrie and Sunil Pai from Cloudflare's agents team explain why Durable Objects turned out to be the right compute unit for AI agents: addressable, persistent, hibernating, stateful, and fast enough that 15ms London latency puts you inside a single animation frame. The Agents SDK builds on this to give resumable st
Play video
Road to 5 Million Tokens: Breaking Barriers in Long Context Training — Max Ryabinin, Together AI
Training a standard LLaMA 3B model with a 3 million token context on a single 8xH100 node fails before you even start: the model parameters alone exhaust GPU memory. Max Ryabinin from Together AI walks through the full stack of techniques needed to get there: fully sharded data parallelism, DeepSpeed Ulysses context pa
Play video
Under 5 minutes to a deployed LLM endpoint — Audry Hsu, RunPod
Two failed crypto mining rigs in a basement in 2022. The founders posted on Reddit offering the GPUs for free in exchange for feedback. That is the origin of RunPod, now at $120 million in annual recurring revenue with 500,000 developers on the platform. The demo runs in under five minutes: pick a model from the Hub, c
Play video
LLM Observability, Evaluation, Experimentation Platform — Dat Ngo, Arize
Your agent called tool B before tool A, and B has a dependency on A. You did not catch it because nothing in your code audits agents. The telemetry does. Dat from Arize AI walks through what observability actually means when the system you are debugging is nondeterministic and the execution path changes with every run.
Play video
From MCP to Scale: Pipelines That Build Themselves — Rafael Levi, Bright Data
Scraping is not the hard part anymore. Maintaining scrapers is. This session shows what it looks like when an agent uses MCP to inspect a site, understand its structure, generate a production scraper, and keep that pipeline working when the site changes. Using Bright Data's MCP, APIs, and browser infrastructure, the fl
Play video
Building safe Payment Infrastructure for the autonomous economy — Steve Kaliski, Stripe
Agents are evolving from calling free APIs to executing real transactions, creating a new challenge: how do we let software spend money autonomously without catastrophic risk? This talk presents Stripe's approach to solving the dual problems of secure credential transmission and making businesses discoverable to agents
Play video
Evals Are Broken, Use Them Anyway — Ara Khan, Cline
Cline started at 43% on Terminal Bench. The improvements came from container CPU and memory settings, raised timeouts, and prompt engineering techniques specific to Anthropic model families that do not transfer to Codex or Gemini. Not from switching to a better model. Ara Khan's argument is that benchmark numbers are n
Play video
Building Interactive UIs in VS Code with MCP Apps — Marlene Mhangami & Liam Hampton, GitHub
The demo profiles a Go app running bubble sort and Fibonacci and the result renders as an interactive flame graph directly inside the VS Code chat window. Not a link. Not a text summary. A live iframe you can scroll and query, sandboxed for the same reason you put a hamster in a cage: so it cannot chew up your VS Code
Play video
Building Agent Interfaces: Lessons from Chrome DevTools (MCP) for Agents — Michael Hablich, Google
Chrome DevTools MCP shipped with one tool: debug_webpage. Agents failed silently because they couldn't compose behaviors. The team decomposed it into 25 focused tools and assumed the problem was solved. It wasn't — now agents had 25 tools and no reliable way to pick the right one. Michael Hablich's talk is an honest ac
Play video
Beyond Transcription: Building Voice AI That Understands Conversations — Hervé Bredin, pyannoteAI
The open ASR leaderboard reports Nvidia Parakeet at 11.4% word error rate on AMI meeting data. Hervé Bredin runs the same model on the same dataset and gets 26%. Same model, same recordings, different microphone: the leaderboard uses headset audio, he uses the table mic. Most voice AI benchmarks are measuring single sp
Play video
Dark Factory: OpenClaw Ships Faster Than You Can Read the Diff — Vincent Koc, OpenClaw
OpenClaw hit 3,000 commits in a single day. Vincent Koc's commit history shows exactly when he goes to sleep and when he wakes up. He and Peter Steinberger ran roughly 60 to 70 agents between them during the great refactor: 2,700 commits, close to a million lines of code changed, 82% of the core codebase touched in one
Play video
AI Engineer Melbourne 2026 Keynote Livestream | Day 2
Live from Federation Square in Melbourne, AI Engineer Melbourne 2026 brings the keynote stage to viewers online in partnership with Web Directions. This is AI Engineer’s first partner event in Australia, featuring keynote-stage sessions from one of the most thoughtfully produced developer events in the region. Watch li
Play video
The Art & Science of Benchmarking Agents — Vincent Chen, Snorkel AI
ARC AGI 3 launched a few weeks before this talk with every task human solvable and frontier models under 1%. That gap is the argument: our ability to measure AI has fallen behind our ability to build it, and benchmarks that actually shape the field are bets on where capabilities are going, not snapshots of where they a
Play video
Text Diffusion — Brendon Dillon, Google DeepMind
GPT-4o answered 40. Gemini 2.5 Flash answered 42 and stuck to it even after working through the reasoning incorrectly. The Gemini Diffusion model, considerably smaller than both, answered 60 on the first forward pass, then 49, then corrected itself to 39 once it finished reasoning. Bidirectional attention means it can
Play video
SWE-rebench: Lessons from Evaluating Coding Agents — Ibragim Badertdinov, Nebius
Claude Code solved SWE rebench tasks by reading git history to find the solution patch. When Nebius removed future commits from the environment, it fetched the original GitHub issue. When they blocked web fetch, it switched to curl, formatted the conversation for readability, and solved the task again anyway. Ibragim B
Play video
BDD, ADR, PRD, WTF: Capturing Decisions for Humans and AI Alike — Michal Cichra, Safe Intelligence
"One thing harder than reading AI code is reading AI tests." Mikuel from Safe Intelligence argues spec driven development leaves a loop open: you have a markdown spec, but how do you know the product actually behaves that way? His answer is Cucumber, nearly forgotten and suddenly useful again. Executable, human readabl
Play video
Beyond Components: Designing Generative UI for MCP Apps — Ruben Casas, Postman
Ruben Casas from Postman prompted a model to rewrite his blog. It built a search box with a blur animation and accessibility out of the box, without being asked. That was when he concluded the model writes better frontend code than he does. His question for the talk: if the models are this capable, why are most agent U
Play video
AI Engineer Melbourne 2026 Keynote Livestream | Day 1
Live from Federation Square in Melbourne, AI Engineer Melbourne 2026 brings the keynote stage to viewers online in partnership with Web Directions. This is AI Engineer’s first partner event in Australia, featuring keynote-stage sessions from one of the most thoughtfully produced developer events in the region. Watch li
Play video
Benchmarking semantic code retrieval on Claude Code — Kuba Rogut, Turbopuffer
By default, Claude Code wastes one in every three file reads. Add windowed grep and that drops to one in five. Add semantic search on top and it drops to one in eight, with file precision climbing from 65% to 87%. Kuba Rogut from Turbopuffer ran a 50-task benchmark against ContextBench to measure not whether the agent
Play video
What Lies Beneath the API — Benjamin Cowen, Modal
Intercom is beating their frontier API at one tenth the cost. Pinterest claims orders of magnitude. Ben Cowen from Modal argues this pattern is not the exception for maturing AI products. It is the destination. Frontier labs want their models to win at everything. You want to win at your specific business logic. Those
Play video
How Lovable self-improves every hour — Benjamin Verbeek, Lovable
Within the first hour of launching the vent tool, the agent filed 20 complaints about a silent file copy failure. The team checked: the tool worked fine. What the agent had caught was that filenames with a space in them silently failed to copy, a bug that never surfaced in logs. Benjamin Verbeek from Lovable built it a
Play video
Build & deploy AI-powered apps — Paige Bailey, Google DeepMind
Got a massive idea but stuck in the "just talking about it" phase? This session cuts the fluff and dives straight into how to build and prototype at lightning speed using AI Studio Build and Antigravity for free. It breaks down Google DeepMind's AI tech stack so viewers know exactly which tools to use, when to reach fo
Play video
Everything I Learned Training Frontier Small Models — Maxime Labonne, Liquid AI
A new class of small models is emerging with the ability to reliably follow instructions and call tools while running on-device under 1 GB of memory. In this talk, we'll break down how to post-train frontier small models using the LFM2.5 recipe: on-policy preference alignment, agentic reinforcement learning, and curric
Play video
One Login to Rule Them All: Cross-App Access for MCP — Garrett Galow, WorkOS
Connecting a coding agent to multiple services often means facing a dozen OAuth consent screens, a dozen token lifecycles, and a dozen chances for something to break. Despite having Single Sign-On, users still find themselves signing in repeatedly. This talk explores how Cross-App Access leverages a three-way trust bet
Play video
Why building eval platforms is hard — Phil Hetzel, Braintrust
An eval platform is not just a test runner. You are building shared definitions of "good," reliable data pipelines, labelling workflows, versioning, and trust in results across many teams and model changes. This session breaks down the hidden complexity, the common failure modes, and the design principles that make eva
Play video
Building your own software factory — Eric Zakariasson, Cursor
Most of us are pair-programming with one agent and stopping there. There's a lot more on the table. This workshop is about going from one agent to many. We'll start with codebase setup, the foundational work that makes agents effective on their own. Then we'll scale up to running agents in parallel, kicking off async w
Play video
Lessons from Scaling GitHub's Remote MCP Server — Sam Morrow, GitHub
GitHub operates one of the most heavily-utilised MCP servers in the ecosystem, with over 4 million downloads of the stdio server alone. Discover the architectural decisions, technical challenges and lessons learned while building and scaling a remote MCP server on production infrastructure. The session walks through th
Play video
Bringing MCPs to the Enterprise — Karan Sampath, Anthropic
MCPs are often flaky, face multiple security vulnerabilities, and are generally hard to scale. Most enterprises struggle to use more than single digit numbers of MCPs due to issues with security, observability, and access control. In this talk, we'll explore the approaches and learnings we at Anthropic have been taking
Play video
Open Models at Google DeepMind — Cassidy Hardin, Google DeepMind
Open models are getting smaller, faster, and far more capable. In this talk, Cassidy Hardin walks through the latest advances in the Gemma family, with a focus on Gemma 4 and what it enables for developers building on-device and open-weight AI systems. She covers the architecture behind Gemma’s dense, effective, and mi
Play video
Collaborative AI Engineering — Maggie Appleton, GitHub Next
Agentic engineering so far has been a solo story: one developer and a dozen agents moving at warp speed. But speed without thoughtful planning and team alignment is just wasting tokens. When everyone on a team is directing agents alone in their personal CLI tools with no shared context, you get duplicate work, conflict
Play video
MCP = Mega Context Problem - Matt Carey
The best MCP server is the one you didn't have to build. At Cloudflare we have a lot of products. Our REST OpenAPI spec is over 2.3 million tokens. When teams started building MCP servers, they did what everyone does: cherry-picked important endpoints for their product, wrote some tool definitions and shipped a separat
Play video
Full Walkthrough: Workflow for AI Coding from Planning to Production — Matt Pocock (@mattpocockuk )
A hands-on workshop covering the full lifecycle of AI-assisted development, from turning ambiguous requirements into agent-ready plans to running autonomous coding agents that ship production features. You'll learn to stress-test vague briefs into structured PRDs, slice work into thin "tracer bullet" vertical slices, a
Play video
The End of Apps — Kitze, Sizzy.co
AI Engineer session on The End of Apps, presented by Kitze, Sizzy.co. It adds practical context for how teams are building and operating AI systems in production.
Play video
How AI is changing Software Engineering: A Conversation with Gergely Orosz, @The Pragmatic Engineer
AI Engineer session on How AI is changing Software Engineering: A Conversation with Gergely Orosz, @The Pragmatic Engineer. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Generative Image & Video models at Scale - Sander Dieleman (Veo and Nano Banana)
AI Engineer session on Building Generative Image & Video models at Scale - Sander Dieleman (Veo and Nano Banana). It adds practical context for how teams are building and operating AI systems in production.
Play video
AIE Miami Day 2 ft. Cerebras, OpenCode, Cursor, Arize AI, and more!
April 21, 2026 - all times in EST -- 9:00am - Welcome to Day 2 -- 9:10am - David House, G2i Transforming Programming Mindsets: Case Studies in Agentic Coding Adoption -- 9:35am - Sarah Chieng, Cerebras Help! We're DEEP in (latency) Debt -- 10:00am - Lech Kalinowski, CallStack Ambient Generative AI: Deploying Latent Dif
Play video
The New Application Layer - Malte Ubl, CTO Vercel
AI Engineer session on The New Application Layer - Malte Ubl, CTO Vercel. It adds practical context for how teams are building and operating AI systems in production.
Play video
Running LLMs on your iPhone: 40 tok/s Gemma 4 with MLX — Adrien Grondin, Locally AI
AI Engineer session on Running LLMs on your iPhone: 40 tok/s Gemma 4 with MLX, presented by Adrien Grondin, Locally AI. It adds practical context for how teams are building and operating AI systems in production.
Play video
Full Workshop: Build Your Own Deep Research Agents - Louis-François Bouchard, Paul Iusztin, Samridhi
AI Engineer session on Full Workshop: Build Your Own Deep Research Agents - Louis-François Bouchard, Paul Iusztin, Samridhi. It adds practical context for how teams are building and operating AI systems in production.
Play video
Taste & Craft: A Conversation with Tuomas Artman, CTO Linear & Gergely Orosz, @The Pragmatic Engineer
AI Engineer session on Taste & Craft: A Conversation with Tuomas Artman, CTO Linear & Gergely Orosz, @The Pragmatic Engineer. It adds practical context for how teams are building and operating AI systems in production.
Play video
AIE Miami Keynote & Talks ft. OpenCode. Google Deepmind, OpenAI, and more!
April 20, 2026 - all times in EST -- 9:00am - Welcome to AI Engineer Miami -- 9:10am - Gabe Greenberg, G2i Opening Remarks -- 9:15am - Dax Raad, OpenCode Keynote -- 9:40am - Dexter Horthy, HumanLayer Everything We got Wrong About RPI -- 10:05am - Max Stoiber, OpenAI Coming Soon -- 10:30am - Morning Break -- 11:00am - B
Play video
Code Mode: Let the Code do the Talking - Sunil Pai, Cloudflare
AI Engineer session on Code Mode: Let the Code do the Talking - Sunil Pai, Cloudflare. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building pi in a World of Slop — Mario Zechner
AI Engineer session on Building pi in a World of Slop, presented by Mario Zechner. It adds practical context for how teams are building and operating AI systems in production.
Play video
Harness Engineering: How to Build Software When Humans Steer, Agents Execute — Ryan Lopopolo, OpenAI
AI Engineer session on Harness Engineering: How to Build Software When Humans Steer, Agents Execute, presented by Ryan Lopopolo, OpenAI. It adds practical context for how teams are building and operating AI systems in production.
Play video
Let LLMs Wander: Engineering RL Environments — Stefano Fiorucci
AI Engineer session on Let LLMs Wander: Engineering RL Environments, presented by Stefano Fiorucci. It adds practical context for how teams are building and operating AI systems in production.
Play video
Why, and how you need to sandbox AI-Generated Code? — Harshil Agrawal, Cloudflare
AI Engineer session on Why, and how you need to sandbox AI-Generated Code?, presented by Harshil Agrawal, Cloudflare. It adds practical context for how teams are building and operating AI systems in production.
Play video
Agentic Engineering: Working With AI, Not Just Using It — Brendan O'Leary
AI Engineer session on Agentic Engineering: Working With AI, Not Just Using It, presented by Brendan O'Leary. It adds practical context for how teams are building and operating AI systems in production.
Play video
Judge the Judge: Building LLM Evaluators That Actually Work with GEPA — Mahmoud Mabrouk, Agenta AI
AI Engineer session on Judge the Judge: Building LLM Evaluators That Actually Work with GEPA, presented by Mahmoud Mabrouk, Agenta AI. It adds practical context for how teams are building and operating AI systems in production.
Play video
Platforms for Humans and Machines: Engineering for the Age of Agents — Juan Herreros Elorza
AI Engineer session on Platforms for Humans and Machines: Engineering for the Age of Agents, presented by Juan Herreros Elorza. It adds practical context for how teams are building and operating AI systems in production.
OpenAI to acquire Promptfoo
OpenAI announced plans to acquire Promptfoo, highlighting automated AI security testing, red teaming, and evaluation as core enterprise requirements.
Play video
DSPy: The End of Prompt Engineering - Kevin Madura, AlixPartners
AI Engineer session on DSPy: The End of Prompt Engineering - Kevin Madura, AlixPartners. It adds practical context for how teams are building and operating AI systems in production.
Play video
How Claude Code Works - Jared Zoneraich, PromptLayer
AI Engineer session on How Claude Code Works - Jared Zoneraich, PromptLayer. It adds practical context for how teams are building and operating AI systems in production.
Play video
Build a Prompt Learning Loop - SallyAnn DeLucia & Fuad Ali, Arize
AI Engineer session on Build a Prompt Learning Loop - SallyAnn DeLucia & Fuad Ali, Arize. It adds practical context for how teams are building and operating AI systems in production.
Play video
OpenAI + @Temporalio : Building Durable, Production Ready Agents - Cornelia Davis, Temporal
AI Engineer session on OpenAI + @Temporalio : Building Durable, Production Ready Agents - Cornelia Davis, Temporal. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building durable Agents with Workflow DevKit & AI SDK - Peter Wielander, Vercel
AI Engineer session on Building durable Agents with Workflow DevKit & AI SDK - Peter Wielander, Vercel. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Intelligent Research Agents with Manus - Ivan Leo, Manus AI (now Meta Superintelligence)
AI Engineer session on Building Intelligent Research Agents with Manus - Ivan Leo, Manus AI (now Meta Superintelligence). It adds practical context for how teams are building and operating AI systems in production.
Play video
Don't Build Agents, Build Skills Instead — Barry Zhang & Mahesh Murag, Anthropic
AI Engineer session on Don't Build Agents, Build Skills Instead, presented by Barry Zhang & Mahesh Murag, Anthropic. It adds practical context for how teams are building and operating AI systems in production.
Play video
Hard Won Lessons from Building Effective AI Coding Agents — Nik Pash, Cline
AI Engineer session on Hard Won Lessons from Building Effective AI Coding Agents, presented by Nik Pash, Cline. It adds practical context for how teams are building and operating AI systems in production.
Play video
From Vibe Coding To Vibe Engineering — Kitze, Sizzy
AI Engineer session on From Vibe Coding To Vibe Engineering, presented by Kitze, Sizzy. It adds practical context for how teams are building and operating AI systems in production.
Play video
Context Engineering: Connecting the Dots with Graphs — Stephen Chin, Neo4j
AI Engineer session on Context Engineering: Connecting the Dots with Graphs, presented by Stephen Chin, Neo4j. It adds practical context for how teams are building and operating AI systems in production.
Play video
Small Bets, Big Impact Building GenBI at a Fortune 100 — Asaf Bord, Northwestern Mutual
AI Engineer session on Small Bets, Big Impact Building GenBI at a Fortune 100, presented by Asaf Bord, Northwestern Mutual. It adds practical context for how teams are building and operating AI systems in production.
Play video
Dispatch from the Future: building an AI-native Company — Dan Shipper, Every, AI & I
AI Engineer session on Dispatch from the Future: building an AI-native Company, presented by Dan Shipper, Every, AI & I. It adds practical context for how teams are building and operating AI systems in production.
Play video
Context Platform Engineering to Reduce Token Anxiety — Val Bercovici, WEKA
AI Engineer session on Context Platform Engineering to Reduce Token Anxiety, presented by Val Bercovici, WEKA. It adds practical context for how teams are building and operating AI systems in production.
Play video
Leadership in AI Assisted Engineering — Justin Reock, DX (acq. Atlassian)
AI Engineer session on Leadership in AI Assisted Engineering, presented by Justin Reock, DX (acq. Atlassian). It adds practical context for how teams are building and operating AI systems in production.
Play video
What We Learned Deploying AI within Bloomberg’s Engineering Organization — Lei Zhang, Bloomberg
AI Engineer session on What We Learned Deploying AI within Bloomberg’s Engineering Organization, presented by Lei Zhang, Bloomberg. It adds practical context for how teams are building and operating AI systems in production.
Play video
AI Copilots for Tech Architecture: The Highest-ROI Use Case You’re Not Building — Boris B., Catio
AI Engineer session on AI Copilots for Tech Architecture: The Highest-ROI Use Case You’re Not Building, presented by Boris B., Catio. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Cursor Composer — Lee Robinson, Cursor
AI Engineer session on Building Cursor Composer, presented by Lee Robinson, Cursor. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building in the Gemini Era — Kat Kampf & Ammaar Reshi, Google DeepMind
AI Engineer session on Building in the Gemini Era, presented by Kat Kampf & Ammaar Reshi, Google DeepMind. It adds practical context for how teams are building and operating AI systems in production.
Play video
Minimax M2: Building the #1 Open Model — Olive Song, MiniMax
AI Engineer session on Minimax M2: Building the #1 Open Model, presented by Olive Song, MiniMax. It adds practical context for how teams are building and operating AI systems in production.
Play video
From Arc to Dia: Lessons learned building AI Browsers — Samir Mody, The Browser Company of New York
AI Engineer session on From Arc to Dia: Lessons learned building AI Browsers, presented by Samir Mody, The Browser Company of New York. It adds practical context for how teams are building and operating AI systems in production.
Play video
The Unreasonable Effectiveness of Prompt Learning — Aparna Dhinakaran, Arize
AI Engineer session on The Unreasonable Effectiveness of Prompt Learning, presented by Aparna Dhinakaran, Arize. It adds practical context for how teams are building and operating AI systems in production.
Play video
Agents are Robots Too: What Self-Driving Taught Me About Building Agents — Jesse Hu, Abundant
AI Engineer session on Agents are Robots Too: What Self-Driving Taught Me About Building Agents, presented by Jesse Hu, Abundant. It adds practical context for how teams are building and operating AI systems in production.
Play video
Developer Experience in the Age of AI Coding Agents — Max Kanat-Alexander, Capital One
AI Engineer session on Developer Experience in the Age of AI Coding Agents, presented by Max Kanat-Alexander, Capital One. It adds practical context for how teams are building and operating AI systems in production.
Play video
Code World Model: Building World Models for Computation — Jacob Kahn, FAIR Meta
AI Engineer session on Code World Model: Building World Models for Computation, presented by Jacob Kahn, FAIR Meta. It adds practical context for how teams are building and operating AI systems in production.
Play video
Nano Banana Pro: But Did You Catch These 10 Details?
This AI Explained video reviews a major AI development through the lens of benchmarks and evaluation evidence. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Is GPT-5.1 Really an Upgrade? But Models Can Auto-Hack Govts, so … there’s that
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Understanding prompt injections: a frontier security challenge
An accessible explanation of prompt injection risk in real AI products, including how third-party content can redirect or manipulate agent behavior.
Play video
Building an Agentic Platform — Ben Kus, CTO Box
AI Engineer session on Building an Agentic Platform, presented by Ben Kus, CTO Box. It adds practical context for how teams are building and operating AI systems in production.
Play video
Practical tactics to build reliable AI apps — Dmitry Kuchin, Multinear
AI Engineer session on Practical tactics to build reliable AI apps, presented by Dmitry Kuchin, Multinear. It adds practical context for how teams are building and operating AI systems in production.
Play video
How BlackRock Builds Custom Knowledge Apps at Scale — Vaibhav Page & Infant Vasanth, BlackRock
AI Engineer session on How BlackRock Builds Custom Knowledge Apps at Scale, presented by Vaibhav Page & Infant Vasanth, BlackRock. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Alice’s Brain: an AI Sales Rep that Learns Like a Human - Sherwood & Satwik, 11x
AI Engineer session on Building Alice’s Brain: an AI Sales Rep that Learns Like a Human - Sherwood & Satwik, 11x. It adds practical context for how teams are building and operating AI systems in production.
Play video
Evals Are Not Unit Tests — Ido Pesok, Vercel v0
AI Engineer session on Evals Are Not Unit Tests, presented by Ido Pesok, Vercel v0. It adds practical context for how teams are building and operating AI systems in production.
Play video
[Full Workshop] Building Conversational AI Agents - Thor Schaeff, ElevenLabs
AI Engineer session on [Full Workshop] Building Conversational AI Agents - Thor Schaeff, ElevenLabs. It adds practical context for how teams are building and operating AI systems in production.
Play video
Make your LLM app a Domain Expert: How to Build an Expert System — Christopher Lovejoy, Anterior
AI Engineer session on Make your LLM app a Domain Expert: How to Build an Expert System, presented by Christopher Lovejoy, Anterior. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Applications with AI Agents — Michael Albada, Microsoft
AI Engineer session on Building Applications with AI Agents, presented by Michael Albada, Microsoft. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building the platform for agent coordination — Tom Moor, Linear
AI Engineer session on Building the platform for agent coordination, presented by Tom Moor, Linear. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Agents at Cloud Scale — Antje Barth, AWS
AI Engineer session on Building Agents at Cloud Scale, presented by Antje Barth, AWS. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building AI Products That Actually Work — Ben Hylak (Raindrop), Sid Bendre (Oleve)
AI Engineer session on Building AI Products That Actually Work, presented by Ben Hylak (Raindrop), Sid Bendre (Oleve). It adds practical context for how teams are building and operating AI systems in production.
Play video
[Full Workshop] Building Metrics that actually work — David Karam, Pi Labs (fmr Google Search)
AI Engineer session on [Full Workshop] Building Metrics that actually work, presented by David Karam, Pi Labs (fmr Google Search). It adds practical context for how teams are building and operating AI systems in production.
Play video
The 2025 AI Engineering Report — Barr Yaron, Amplify
AI Engineer session on The 2025 AI Engineering Report, presented by Barr Yaron, Amplify. It adds practical context for how teams are building and operating AI systems in production.
Play video
On Engineering AI Systems that Endure The Bitter Lesson - Omar Khattab, DSPy & Databricks
AI Engineer session on On Engineering AI Systems that Endure The Bitter Lesson - Omar Khattab, DSPy & Databricks. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building a Smarter AI Agent with Neural RAG - Will Bryk, Exa.ai
AI Engineer session on Building a Smarter AI Agent with Neural RAG - Will Bryk, Exa.ai. It adds practical context for how teams are building and operating AI systems in production.
Play video
Everything is ugly, so go build something that isn't — Raiza Martin, Huxe (ex NotebookLM)
AI Engineer session on Everything is ugly, so go build something that isn't, presented by Raiza Martin, Huxe (ex NotebookLM). It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Effective Voice Agents — Toki Sherbakov + Anoop Kotha, OpenAI
AI Engineer session on Building Effective Voice Agents, presented by Toki Sherbakov + Anoop Kotha, OpenAI. It adds practical context for how teams are building and operating AI systems in production.
Play video
Revenue Engineering: How to Price (and Reprice) Your AI Product — Kshitij Grover, Orb
AI Engineer session on Revenue Engineering: How to Price (and Reprice) Your AI Product, presented by Kshitij Grover, Orb. It adds practical context for how teams are building and operating AI systems in production.
Play video
Survive the AI Knife Fight: Building Products That Win — Brian Balfour, Reforge
AI Engineer session on Survive the AI Knife Fight: Building Products That Win, presented by Brian Balfour, Reforge. It adds practical context for how teams are building and operating AI systems in production.
Play video
The Build-Operate Divide: Bridging Product Vision and AI Operational Reality
AI Engineer session on The Build-Operate Divide: Bridging Product Vision and AI Operational Reality. It adds practical context for how teams are building and operating AI systems in production.
Play video
"Data readiness" is a Myth: Reliable AI with an Agentic Semantic Layer — Anushrut Gupta, PromptQL
AI Engineer session on "Data readiness" is a Myth: Reliable AI with an Agentic Semantic Layer, presented by Anushrut Gupta, PromptQL. It adds practical context for how teams are building and operating AI systems in production.
Play video
From Hype to Habit: How We’re Building an AI-First SaaS Company — While Still Shipping the Roadmap
AI Engineer session on From Hype to Habit: How We’re Building an AI-First SaaS Company, presented by While Still Shipping the Roadmap. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Multimodal AI Agents From Scratch — Apoorva Joshi, MongoDB
AI Engineer session on Building Multimodal AI Agents From Scratch, presented by Apoorva Joshi, MongoDB. It adds practical context for how teams are building and operating AI systems in production.
Play video
How to build world-class AI products — Sarah Sachs (AI lead @ Notion) & Carlos Esteban (Braintrust)
AI Engineer session on How to build world-class AI products, presented by Sarah Sachs (AI lead @ Notion) & Carlos Esteban (Braintrust). It adds practical context for how teams are building and operating AI systems in production.
Play video
Fun stories from building OpenRouter and where all this is going - Alex Atallah, OpenRouter
AI Engineer session on Fun stories from building OpenRouter and where all this is going - Alex Atallah, OpenRouter. It adds practical context for how teams are building and operating AI systems in production.
Play video
Build Dynamic Products, and Stop the AI Sideshow — Eliza Cabrera (Workday) + Jeremy Silva (Freeplay)
AI Engineer session on Build Dynamic Products, and Stop the AI Sideshow, presented by Eliza Cabrera (Workday) + Jeremy Silva (Freeplay). It adds practical context for how teams are building and operating AI systems in production.
Play video
Ship it! Building Production Ready Agents — Mike Chambers, AWS
AI Engineer session on Ship it! Building Production Ready Agents, presented by Mike Chambers, AWS. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Code First AI Agents with Azure AI Agent Service — Cedric Vidal, Microsoft
AI Engineer session on Building Code First AI Agents with Azure AI Agent Service, presented by Cedric Vidal, Microsoft. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building a 10 person unicorn - Max Brodeur-Urbas, Gumloop
AI Engineer session on Building a 10 person unicorn - Max Brodeur-Urbas, Gumloop. It adds practical context for how teams are building and operating AI systems in production.
Play video
Data is Your Differentiator: Building Secure and Tailored AI Systems — Mani Khanuja, AWS
AI Engineer session on Data is Your Differentiator: Building Secure and Tailored AI Systems, presented by Mani Khanuja, AWS. It adds practical context for how teams are building and operating AI systems in production.
Play video
Mastering Engineering Flow with Windsurf - Eashan Sinha, Windsurf
AI Engineer session on Mastering Engineering Flow with Windsurf - Eashan Sinha, Windsurf. It adds practical context for how teams are building and operating AI systems in production.
Play video
Prompt Engineering and AI Red Teaming — Sander Schulhoff, HackAPrompt/LearnPrompting
AI Engineer session on Prompt Engineering and AI Red Teaming, presented by Sander Schulhoff, HackAPrompt/LearnPrompting. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building agent fleet architectures your CISO doesn't hate — Lou Bichard, Gitpod
AI Engineer session on Building agent fleet architectures your CISO doesn't hate, presented by Lou Bichard, Gitpod. It adds practical context for how teams are building and operating AI systems in production.
Play video
How to Train Your Agent: Building Reliable Agents with RL — Kyle Corbitt, OpenPipe
AI Engineer session on How to Train Your Agent: Building Reliable Agents with RL, presented by Kyle Corbitt, OpenPipe. It adds practical context for how teams are building and operating AI systems in production.
Play video
Using OSS models to build AI apps with millions of users — Hassan El Mghari
AI Engineer session on Using OSS models to build AI apps with millions of users, presented by Hassan El Mghari. It adds practical context for how teams are building and operating AI systems in production.
Play video
How to build Enterprise Aware Agents - Chau Tran, Glean
AI Engineer session on How to build Enterprise Aware Agents - Chau Tran, Glean. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building voice agents with OpenAI — Dominik Kundel, OpenAI
AI Engineer session on Building voice agents with OpenAI, presented by Dominik Kundel, OpenAI. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Agents (the hard parts!) - Rita Kozlov, Cloudflare
AI Engineer session on Building Agents (the hard parts!) - Rita Kozlov, Cloudflare. It adds practical context for how teams are building and operating AI systems in production.
Play video
Prompt Engineering is Dead — Nir Gazit, Traceloop
AI Engineer session on Prompt Engineering is Dead, presented by Nir Gazit, Traceloop. It adds practical context for how teams are building and operating AI systems in production.
Play video
3 ingredients for building reliable enterprise agents - Harrison Chase, LangChain/LangGraph
AI Engineer session on 3 ingredients for building reliable enterprise agents - Harrison Chase, LangChain/LangGraph. It adds practical context for how teams are building and operating AI systems in production.
Play video
Forget RAG Pipelines — Build Production Ready Agents in 15 Mins: Nina Lopatina, Rajiv Shah, Contextual
AI Engineer session on Forget RAG Pipelines, presented by Build Production Ready Agents in 15 Mins: Nina Lopatina, Rajiv Shah, Contextual. It adds practical context for how teams are building and operating AI systems in production.
Play video
Agents, Access, and the Future of Machine Identity — Nick Nisi (WorkOS) + Lizzie Siegle (Cloudflare)
AI Engineer session on Agents, Access, and the Future of Machine Identity, presented by Nick Nisi (WorkOS) + Lizzie Siegle (Cloudflare). It adds practical context for how teams are building and operating AI systems in production.
Play video
Rethinking Team Building: how a 30-person Startup serves 50 Million Users — Grant Lee, Gamma
AI Engineer session on Rethinking Team Building: how a 30-person Startup serves 50 Million Users, presented by Grant Lee, Gamma. It adds practical context for how teams are building and operating AI systems in production.
Play video
How to Build Planning Agents without losing control - Yogendra Miraje, Factset
AI Engineer session on How to Build Planning Agents without losing control - Yogendra Miraje, Factset. It adds practical context for how teams are building and operating AI systems in production.
Play video
Engineering Better Evals: Scalable LLM Evaluation Pipelines That Work — Dat Ngo, Aman Khan, Arize
AI Engineer session on Engineering Better Evals: Scalable LLM Evaluation Pipelines That Work, presented by Dat Ngo, Aman Khan, Arize. It adds practical context for how teams are building and operating AI systems in production.
Play video
Does AI Actually Boost Developer Productivity? (100k Devs Study) - Yegor Denisov-Blanch, Stanford
AI Engineer session on Does AI Actually Boost Developer Productivity? (100k Devs Study) - Yegor Denisov-Blanch, Stanford. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Agentic Applications w/ Heroku Managed Inference and Agents — Julián Duque & Anush Dsouza
AI Engineer session on Building Agentic Applications w/ Heroku Managed Inference and Agents, presented by Julián Duque & Anush Dsouza. It adds practical context for how teams are building and operating AI systems in production.
Play video
AI Engineering with the Google Gemini 2.5 Model Family - Philipp Schmid, Google DeepMind
AI Engineer session on AI Engineering with the Google Gemini 2.5 Model Family - Philipp Schmid, Google DeepMind. It adds practical context for how teams are building and operating AI systems in production.
Play video
Grok 4 - 10 New Things to Know
This AI Explained video reviews a major AI development through the lens of benchmarks and evaluation evidence. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Buy Now, Maybe Pay Later: Dealing with Prompt-Tax While Staying at the Frontier - Andrew Thomspson
AI Engineer session on Buy Now, Maybe Pay Later: Dealing with Prompt-Tax While Staying at the Frontier - Andrew Thomspson. It adds practical context for how teams are building and operating AI systems in production.
Play video
Stop Ordering AI Takeout A Cookbook for Winning When You Build In House - Jan Siml
AI Engineer session on Stop Ordering AI Takeout A Cookbook for Winning When You Build In House - Jan Siml. It adds practical context for how teams are building and operating AI systems in production.
Play video
From PM at Stripe to Building an AI startup, a recent founder's journey - Mounir Mouawad
AI Engineer session on From PM at Stripe to Building an AI startup, a recent founder's journey - Mounir Mouawad. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Protected MCP Servers — Den Delimarsky and Julia Kasper, MCP Steering Committee & Microsoft
AI Engineer session on Building Protected MCP Servers, presented by Den Delimarsky and Julia Kasper, MCP Steering Committee & Microsoft. It adds practical context for how teams are building and operating AI systems in production.
Play video
Break It 'Til You Make It: Building the Self-Improving Stack for AI Agents - Aparna Dhinakaran
AI Engineer session on Break It 'Til You Make It: Building the Self-Improving Stack for AI Agents - Aparna Dhinakaran. It adds practical context for how teams are building and operating AI systems in production.
Play video
Real AI Agents Need Planning, Not Just Prompting - Yuval Belfer
AI Engineer session on Real AI Agents Need Planning, Not Just Prompting - Yuval Belfer. It adds practical context for how teams are building and operating AI systems in production.
Play video
When Will AI Models Blackmail You, and Why?
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Veo 3 for Developers — Paige Bailey, Google DeepMind
AI Engineer session on Veo 3 for Developers, presented by Paige Bailey, Google DeepMind. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building AI Agents that actually automate Knowledge Work - Jerry Liu, LlamaIndex
AI Engineer session on Building AI Agents that actually automate Knowledge Work - Jerry Liu, LlamaIndex. It adds practical context for how teams are building and operating AI systems in production.
Play video
How to Build Trustworthy AI — Allie Howe
AI Engineer session on How to Build Trustworthy AI, presented by Allie Howe. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Reliable Support Agents Using the Effect Typescript Library - Michael Fester
AI Engineer session on Building Reliable Support Agents Using the Effect Typescript Library - Michael Fester. It adds practical context for how teams are building and operating AI systems in production.
Play video
Supercharging developer workflow with Amazon Q Developer - Vikash Agrawal
AI Engineer session on Supercharging developer workflow with Amazon Q Developer - Vikash Agrawal. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Agents with Amazon Nova Act and MCP - Du'An Lightfoot, Amazon (Full Workshop)
AI Engineer session on Building Agents with Amazon Nova Act and MCP - Du'An Lightfoot, Amazon (Full Workshop). It adds practical context for how teams are building and operating AI systems in production.
Play video
Arrakis: How To Build An AI Sandbox From Scratch - Abhishek Bhardwaj, OpenAI
AI Engineer session on Arrakis: How To Build An AI Sandbox From Scratch - Abhishek Bhardwaj, OpenAI. It adds practical context for how teams are building and operating AI systems in production.
Play video
How to Build Your Own AI Data Center in 2025 — Paul Gilbert, Arista Networks
AI Engineer session on How to Build Your Own AI Data Center in 2025, presented by Paul Gilbert, Arista Networks. It adds practical context for how teams are building and operating AI systems in production.
Play video
AI Improves at Self-improving
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
AI Engineering at Jane Street - John Crepezzi
AI Engineer session on AI Engineering at Jane Street - John Crepezzi. It adds practical context for how teams are building and operating AI systems in production.
Play video
Knowledge Graphs & GraphRAG: Techniques for Building Effective GenAI Applications: Zach Blumenthal
AI Engineer session on Knowledge Graphs & GraphRAG: Techniques for Building Effective GenAI Applications: Zach Blumenthal. It adds practical context for how teams are building and operating AI systems in production.
Play video
Don't just slap on a chatbot: building AI that works before you ask
AI Engineer session on Don't just slap on a chatbot: building AI that works before you ask. It adds practical context for how teams are building and operating AI systems in production.
Play video
Voice Agent Engineering — Nik Caryotakis, SuperDial
AI Engineer session on Voice Agent Engineering, presented by Nik Caryotakis, SuperDial. It adds practical context for how teams are building and operating AI systems in production.
Play video
Cohere: Building enterprise LLM agents that work (Shaan Desai)
AI Engineer session on Cohere: Building enterprise LLM agents that work (Shaan Desai). It adds practical context for how teams are building and operating AI systems in production.
Play video
Why Agent Engineering — swyx
AI Engineer session on Why Agent Engineering, presented by swyx. It adds practical context for how teams are building and operating AI systems in production.
Play video
AI Platform Engineering: Patrick Debois
AI Engineer session on AI Platform Engineering: Patrick Debois. It adds practical context for how teams are building and operating AI systems in production.
Play video
The Hidden Costs of Building Your Own RAG Stack — Ofer Vectara
AI Engineer session on The Hidden Costs of Building Your Own RAG Stack, presented by Ofer Vectara. It adds practical context for how teams are building and operating AI systems in production.
Play video
Prompt Engineering Tactics: Dan Cleary
AI Engineer session on Prompt Engineering Tactics: Dan Cleary. It adds practical context for how teams are building and operating AI systems in production.
Play video
Patrick Dougherty: How to Build AI Agents that Actually Work
AI Engineer session on Patrick Dougherty: How to Build AI Agents that Actually Work. It adds practical context for how teams are building and operating AI systems in production.
Play video
Unlocking Developer Productivity across CPU and GPU with MAX: Chris Lattner
AI Engineer session on Unlocking Developer Productivity across CPU and GPU with MAX: Chris Lattner. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building and Scaling an AI Agent Swarm of low latency real time voice bots: Damien Murphy
AI Engineer session on Building and Scaling an AI Agent Swarm of low latency real time voice bots: Damien Murphy. It adds practical context for how teams are building and operating AI systems in production.
Play video
From Software Developer to AI Engineer: Antje Barth
AI Engineer session on From Software Developer to AI Engineer: Antje Barth. It adds practical context for how teams are building and operating AI systems in production.
Play video
AI Music Generation, From Prompt to Production: Phlo Young
AI Engineer session on AI Music Generation, From Prompt to Production: Phlo Young. It adds practical context for how teams are building and operating AI systems in production.
Play video
How We Build Effective Agents: Barry Zhang, Anthropic
AI Engineer session on How We Build Effective Agents: Barry Zhang, Anthropic. It adds practical context for how teams are building and operating AI systems in production.
Play video
Using agents to build an agent company: Joao Moura
AI Engineer session on Using agents to build an agent company: Joao Moura. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building security around ML: Dr. Andrew Davis
AI Engineer session on Building security around ML: Dr. Andrew Davis. It adds practical context for how teams are building and operating AI systems in production.
Play video
The Model Isn’t Wrong — You’re Just Bad at Prompting
AI Engineer session on The Model Isn’t Wrong, presented by You’re Just Bad at Prompting. It adds practical context for how teams are building and operating AI systems in production.
Play video
Build an AI Research Agent: Apoorva Joshi
AI Engineer session on Build an AI Research Agent: Apoorva Joshi. It adds practical context for how teams are building and operating AI systems in production.
Play video
Hiring & Building an AI Engineering Team: Dr. Bryan Bischof
AI Engineer session on Hiring & Building an AI Engineering Team: Dr. Bryan Bischof. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Multi agent Systems with Finite State Machines
AI Engineer session on Building Multi agent Systems with Finite State Machines. It adds practical context for how teams are building and operating AI systems in production.
Play video
Iterating on LLM apps at scale Learnings from Discord: Ian Webster
AI Engineer session on Iterating on LLM apps at scale Learnings from Discord: Ian Webster. It adds practical context for how teams are building and operating AI systems in production.
Play video
Build, Evaluate and Deploy a RAG-Based Retail Copilot with Azure AI: Cedric Vidal and David Smith
AI Engineer session on Build, Evaluate and Deploy a RAG-Based Retail Copilot with Azure AI: Cedric Vidal and David Smith. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building AI Agents with Real ROI in the Enterprise SDLC: Bruno (Booking.com) & Beyang (Sourcegraph)
AI Engineer session on Building AI Agents with Real ROI in the Enterprise SDLC: Bruno (Booking.com) & Beyang (Sourcegraph). It adds practical context for how teams are building and operating AI systems in production.
Play video
Stop Guessing: Build Robust AI with Layered CoT
AI Engineer session on Stop Guessing: Build Robust AI with Layered CoT. It adds practical context for how teams are building and operating AI systems in production.
Play video
Lessons from building GenAI based applications — Juan Peredo
AI Engineer session on Lessons from building GenAI based applications, presented by Juan Peredo. It adds practical context for how teams are building and operating AI systems in production.
Play video
RAG at scale: production ready GenAI apps with Azure AI Search
AI Engineer session on RAG at scale: production ready GenAI apps with Azure AI Search. It adds practical context for how teams are building and operating AI systems in production.
Play video
Privacy First Enterprise AI: Building AI Agents that Never Leave Your Security Boundary
AI Engineer session on Privacy First Enterprise AI: Building AI Agents that Never Leave Your Security Boundary. It adds practical context for how teams are building and operating AI systems in production.
Play video
Scaling Agents for Gen AI Products - Anju Kambadur, Bloomberg Head of AI Engineering
AI Engineer session on Scaling Agents for Gen AI Products - Anju Kambadur, Bloomberg Head of AI Engineering. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Reliable Agentic Systems: Eno Reyes
AI Engineer session on Building Reliable Agentic Systems: Eno Reyes. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building and evaluating AI Agents — Sayash Kapoor, AI Snake Oil
AI Engineer session on Building and evaluating AI Agents, presented by Sayash Kapoor, AI Snake Oil. It adds practical context for how teams are building and operating AI systems in production.
Play video
How to build the world's fastest voice bot: Kwindla Hultman Kramer
AI Engineer session on How to build the world's fastest voice bot: Kwindla Hultman Kramer. It adds practical context for how teams are building and operating AI systems in production.
Play video
AI Engineering Without Borders — swyx
AI Engineer session on AI Engineering Without Borders, presented by swyx. It adds practical context for how teams are building and operating AI systems in production.
Play video
Open Challenges for AI Engineering: Simon Willison
AI Engineer session on Open Challenges for AI Engineering: Simon Willison. It adds practical context for how teams are building and operating AI systems in production.
Play video
How Zapier Builds AI Products and Features with the Help of Braintrust: Ankur Goyal & Olmo Maldonado
AI Engineer session on How Zapier Builds AI Products and Features with the Help of Braintrust: Ankur Goyal & Olmo Maldonado. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building with Anthropic Claude: Prompt Workshop with Zack Witten
AI Engineer session on Building with Anthropic Claude: Prompt Workshop with Zack Witten. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building an AI assistant that makes phone calls [Convex Workshop]
AI Engineer session on Building an AI assistant that makes phone calls [Convex Workshop]. It adds practical context for how teams are building and operating AI systems in production.
Play video
OpenAI for VP's of AI + Advice for Building Agents
AI Engineer session on OpenAI for VP's of AI + Advice for Building Agents. It adds practical context for how teams are building and operating AI systems in production.
Play video
Vercel AI SDK Masterclass: From Fundamentals to Deep Research
AI Engineer session on Vercel AI SDK Masterclass: From Fundamentals to Deep Research. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Agents with Model Context Protocol - Full Workshop with Mahesh Murag of Anthropic
AI Engineer session on Building Agents with Model Context Protocol - Full Workshop with Mahesh Murag of Anthropic. It adds practical context for how teams are building and operating AI systems in production.
Play video
Build enterprise generative AI apps using Llama 3 at 1,000 tokens/s on the SambaNova AI platform
AI Engineer session on Build enterprise generative AI apps using Llama 3 at 1,000 tokens/s on the SambaNova AI platform. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building LinkedIn's GenAI Platform — Xiaofeng Wang
AI Engineer session on Building LinkedIn's GenAI Platform, presented by Xiaofeng Wang. It adds practical context for how teams are building and operating AI systems in production.
Play video
Lessons from the Trenches: Building LLM Evals That Work IRL: Aparna Dhinkaran
AI Engineer session on Lessons from the Trenches: Building LLM Evals That Work IRL: Aparna Dhinkaran. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building efficient hybrid context query for LLM grounding: Simrat Hanspal
AI Engineer session on Building efficient hybrid context query for LLM grounding: Simrat Hanspal. It adds practical context for how teams are building and operating AI systems in production.
Play video
Lessons From A Year Building With LLMs
AI Engineer session on Lessons From A Year Building With LLMs. It adds practical context for how teams are building and operating AI systems in production.
Play video
Insights on Building AI Teams — Heath Black, SignalFire
AI Engineer session on Insights on Building AI Teams, presented by Heath Black, SignalFire. It adds practical context for how teams are building and operating AI systems in production.
Play video
Keynote: The AI developer experience doesn't have to suck — why and how we built Modal
AI Engineer session on Keynote: The AI developer experience doesn't have to suck, presented by why and how we built Modal. It adds practical context for how teams are building and operating AI systems in production.
Play video
The LLM Triangle: Engineering Principles for Robust AI Applications - Almog Baku:
AI Engineer session on The LLM Triangle: Engineering Principles for Robust AI Applications - Almog Baku:. It adds practical context for how teams are building and operating AI systems in production.
Play video
[Full Workshop from Microsoft] Github Copilot - The World's Most Widely Adopted AI Developer Tool
AI Engineer session on [Full Workshop from Microsoft] Github Copilot - The World's Most Widely Adopted AI Developer Tool. It adds practical context for how teams are building and operating AI systems in production.
Play video
GitHub Copilot: The World's Most Widely Adopted AI Developer Tool
AI Engineer session on GitHub Copilot: The World's Most Widely Adopted AI Developer Tool. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building State of the Art Open Weights Tool Use: The Command R Family: Sandra Kublik
AI Engineer session on Building State of the Art Open Weights Tool Use: The Command R Family: Sandra Kublik. It adds practical context for how teams are building and operating AI systems in production.
Play video
Lets Build An Agent from Scratch
AI Engineer session on Lets Build An Agent from Scratch. It adds practical context for how teams are building and operating AI systems in production.
Play video
AI CEO: ‘Stock Crash Could Stop AI Progress’, Llama 4 Anti-climax + ‘Superintelligence in 2027’ ...
This AI Explained video reviews a major AI development through the lens of benchmarks and evaluation evidence. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Gemini 2.5 Pro - It’s a Darn Smart Chatbot … (New Simple High Score)
This AI Explained video reviews a major AI development through the lens of benchmarks and evaluation evidence. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
OpenAI’s New ImageGen is Unexpectedly Epic … (ft. Reve, Imagen 3, Midjourney etc)
This AI Explained video reviews a major AI development through the lens of multimodal generation and provenance. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Claude 3.7 is More Significant than its Name Implies (ft DeepSeek R2 + GPT 4.5 coming soon)
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Nothing Much Happens in AI, Then Everything Does All At Once
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
AI - 2024AD: 212-page Report (from this morning) Fully Read w/ Highlights
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
o1 - What is Going On? Why o1 is a 3rd Paradigm of Model + 10 Things You Might Not Know
This AI Explained video reviews a major AI development through the lens of benchmarks and evaluation evidence. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Grok-2 Actually Out, But What If It Were 10,000x the Size?
This AI Explained video reviews a major AI development through the lens of benchmarks and evaluation evidence. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
How Far Can We Scale AI? Gen 3, Claude 3.5 Sonnet and AI Hype
This AI Explained video reviews a major AI development through the lens of AI safety and model behavior. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
New OpenAI Model 'Imminent' and AI Stakes Get Raised (plus Med Gemini, GPT 2 Chatbot and Scale AI)
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Principles for Prompt Engineering - Karina Nguyen (Claude Instant @ Anthropic)
AI Engineer session on Principles for Prompt Engineering - Karina Nguyen (Claude Instant @ Anthropic). It adds practical context for how teams are building and operating AI systems in production.
Play video
Open Questions for AI Engineering: Simon Willison
AI Engineer session on Open Questions for AI Engineering: Simon Willison. It adds practical context for how teams are building and operating AI systems in production.
Play video
[Workshop] AI Engineering 101
AI Engineer session on [Workshop] AI Engineering 101. It adds practical context for how teams are building and operating AI systems in production.
Play video
Storyteller: Building Multi-modal Apps with TS & ModelFusion - Lars Grammel, PhD
AI Engineer session on Storyteller: Building Multi-modal Apps with TS & ModelFusion - Lars Grammel, PhD. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Blocks for LLM Systems & Products: Eugene Yan
AI Engineer session on Building Blocks for LLM Systems & Products: Eugene Yan. It adds practical context for how teams are building and operating AI systems in production.
Play video
[Workshop] AI Engineering 201: Inference
AI Engineer session on [Workshop] AI Engineering 201: Inference. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Production-Ready RAG Applications: Jerry Liu
AI Engineer session on Building Production-Ready RAG Applications: Jerry Liu. It adds practical context for how teams are building and operating AI systems in production.
Play video
Using AI to Build an Infinite Game: Jeff Schomay
AI Engineer session on Using AI to Build an Infinite Game: Jeff Schomay. It adds practical context for how teams are building and operating AI systems in production.
Play video
GPT Web App Generator - 10,000 apps created in a month: Matija Sosic
AI Engineer session on GPT Web App Generator - 10,000 apps created in a month: Matija Sosic. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Context-Aware Reasoning Applications with LangChain and LangSmith: Harrison Chase
AI Engineer session on Building Context-Aware Reasoning Applications with LangChain and LangSmith: Harrison Chase. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building AI For All: Amjad Masad & Michele Catasta
AI Engineer session on Building AI For All: Amjad Masad & Michele Catasta. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Reactive AI Apps: Matt Welsh
AI Engineer session on Building Reactive AI Apps: Matt Welsh. It adds practical context for how teams are building and operating AI systems in production.
Play video
AI Engineering 201: The Rest of the Owl
AI Engineer session on AI Engineering 201: The Rest of the Owl. It adds practical context for how teams are building and operating AI systems in production.
Play video
Gemini Ultra - Full Review
This AI Explained video reviews a major AI development through the lens of scaling and compute economics. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
AI On An Exponential? Data, Mamba, and More
This AI Explained video reviews a major AI development through the lens of scaling and compute economics. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Phi-2, Imagen-2, Optimus-Gen-2: Small New Models to Change the World?
This AI Explained video reviews a major AI development through the lens of benchmarks and evaluation evidence. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
OpenAI Insights and Training Data Shenanigans - 7 'Complicated' Developments + Guest Star
This AI Explained video reviews a major AI development through the lens of model capability and AI systems in practice. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
AI Declarations and AGI Timelines – Looking More Optimistic?
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
RT-X and the Dawn of Large Multimodal Models: Google Breakthrough and 160-page Report Highlights
This AI Explained video reviews a major AI development through the lens of multimodal generation and provenance. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
ChatGPT Fails Basic Logic but Now Has Vision, Wins at Chess and Prompts a Masterpiece
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
11 Major AI Developments: RT-2 to '100X GPT-4'
This AI Explained video reviews a major AI development through the lens of AI safety and model behavior. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
ChatGPT's Achilles' Heel
This AI Explained video reviews a major AI development through the lens of scaling and compute economics. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
'Show Your Working': ChatGPT Performance Doubled w/ Process Rewards (+Synthetic Data Event Horizon)
This AI Explained video reviews a major AI development through the lens of benchmarks and evaluation evidence. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
GPT 4 is Smarter than You Think: Introducing SmartGPT
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Can GPT 4 Prompt Itself? MemoryGPT, AutoGPT, Jarvis, Claude-Next [10x GPT 4!] and more...
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Google Bard - The Full Review. Bard vs Bing [LaMDA vs GPT 4]
This AI Explained video reviews a major AI development through the lens of multimodal generation and provenance. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
8 New Ways to Use Bing's Upgraded 8 [now 20] Message Limit (ft. pdfs, quizzes, tables, scenarios...)
This AI Explained video reviews a major AI development through the lens of model capability and AI systems in practice. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
9 of the Best Bing (GPT 4) Prompts
This AI Explained video reviews a major AI development through the lens of model capability and AI systems in practice. It is useful context for AI engineering, evaluation, governance, and operational risk.