<?xml version="1.0" encoding="UTF-8"?><oembed><type>video</type><version>1.0</version><html>&lt;iframe src=&quot;https://www.loom.com/embed/e1ebc51fd7ac4dd1ac23efea3b58c2ee&quot; frameborder=&quot;0&quot; width=&quot;1920&quot; height=&quot;1440&quot; webkitallowfullscreen mozallowfullscreen allowfullscreen&gt;&lt;/iframe&gt;</html><height>1440</height><width>1920</width><provider_name>Loom</provider_name><provider_url>https://www.loom.com</provider_url><thumbnail_height>1440</thumbnail_height><thumbnail_width>1920</thumbnail_width><thumbnail_url>https://cdn.loom.com/sessions/thumbnails/e1ebc51fd7ac4dd1ac23efea3b58c2ee-8dce62c02631f335.gif</thumbnail_url><duration>462.742</duration><title>Variant AI Lead Qualification and First Response</title><description>This Loom presents Apoorv Gupta’s architecture for Variant, an AI lead qualification and first-response system with an API-first design. It normalizes inputs and runs two LLM steps to classify and respond behind one endpoint, with a full deployment plan that persists to a DB, syncs to a CRM, and uses a queue plus workers for scale. The implementation emphasizes guardrails such as JSON-only outputs that prohibit inventing pricing or features, and includes a live versus mock mode that returns an indicator of the path. He also covers handling garbage input, ambiguous leads, and provider failures with retries, fallback templates, and monitoring via logs, metrics, traces, and alerts for latency, error spikes, and bucket distribution shifts.</description></oembed>