<?xml version="1.0" encoding="UTF-8"?><oembed><type>video</type><version>1.0</version><html>&lt;iframe src=&quot;https://www.loom.com/embed/233b6074ec874490b124afb2227c6dda&quot; frameborder=&quot;0&quot; width=&quot;1920&quot; height=&quot;1440&quot; webkitallowfullscreen mozallowfullscreen allowfullscreen&gt;&lt;/iframe&gt;</html><height>1440</height><width>1920</width><provider_name>Loom</provider_name><provider_url>https://www.loom.com</provider_url><thumbnail_height>1440</thumbnail_height><thumbnail_width>1920</thumbnail_width><thumbnail_url>https://cdn.loom.com/sessions/thumbnails/233b6074ec874490b124afb2227c6dda-f7cfc7abf2ddf2c9.gif</thumbnail_url><duration>670.054</duration><title>Kyvera: Stress-Testing Voice Agents in Real Conditions</title><description>This Loom demonstrates Cabra, a platform for benchmarking and stress testing voice agents under realistic conditions before production deployment. It generates natural call dialogues, adds filler words, converts the text to audio, applies a telephonic codec, adds background noise, and streams the degraded audio to the agent to test speech-to-text and escalation behavior. In a demo scenario with a frustrated caller complaining about a deducted amount, the agent scored 23% overall and produced garbled ASR output due to language mixing and noise. The trace identified key failures: it did not collect a required account number despite the user prompt, and it escalated to a human even when escalation was instructed not to occur.</description></oembed>