<?xml version="1.0" encoding="UTF-8"?><oembed><type>video</type><version>1.0</version><html>&lt;iframe src=&quot;https://www.loom.com/embed/173c11356f0342359b92975b0e3ede1a&quot; frameborder=&quot;0&quot; width=&quot;1664&quot; height=&quot;1248&quot; webkitallowfullscreen mozallowfullscreen allowfullscreen&gt;&lt;/iframe&gt;</html><height>1248</height><width>1664</width><provider_name>Loom</provider_name><provider_url>https://www.loom.com</provider_url><thumbnail_height>1248</thumbnail_height><thumbnail_width>1664</thumbnail_width><thumbnail_url>https://cdn.loom.com/sessions/thumbnails/173c11356f0342359b92975b0e3ede1a-1688736844966.gif</thumbnail_url><duration>602.868032241</duration><title>Exploring Bench LLM: Evaluating and Comparing LLM-based Solutions 🧪</title><description>Hey there, AI engineers! In this video, I&apos;ll be introducing you to Bench LLM, a Python-based open source library that is revolutionizing the testing of LLMs and AI-powered applications. Bench LLM is a powerful tool used for evaluating the accuracy of LLM-powered products. Today, we&apos;ll be using Bench LLM to evaluate and compare different LLM-based solutions, specifically GPT-3 and run chain agents. We&apos;ll be running a test suite designed to challenge their ability to compute 3-digit multiplications. Join me as we install Bench LLM, import the library, mark the functions we&apos;d like to test, and prepare our test suite. I&apos;ll also show you how to set up your OpenAI API key for BenchLLM. By the end of this video, you&apos;ll have a clear understanding of how Bench LLM works and how it can enhance your AI testing process. Let&apos;s dive in and see how these LLM-based solutions perform! 🚀</description></oembed>