<?xml version="1.0" encoding="UTF-8"?><oembed><type>video</type><version>1.0</version><html>&lt;iframe src=&quot;https://www.loom.com/embed/0ed2e16d43504661908adf201f8187ba&quot; frameborder=&quot;0&quot; width=&quot;1152&quot; height=&quot;864&quot; webkitallowfullscreen mozallowfullscreen allowfullscreen&gt;&lt;/iframe&gt;</html><height>864</height><width>1152</width><provider_name>Loom</provider_name><provider_url>https://www.loom.com</provider_url><thumbnail_height>864</thumbnail_height><thumbnail_width>1152</thumbnail_width><thumbnail_url>https://cdn.loom.com/sessions/thumbnails/0ed2e16d43504661908adf201f8187ba-ddeb678d321d1f76.gif</thumbnail_url><duration>603.862</duration><title>Memorang mini by Aashni!</title><description>In this video, I walk you through the process of creating and testing custom graders for our movie dataset using an LLM style grader called IMDBX, which checks if a given actor is in a movie. I demonstrate how to run multiple graders simultaneously, including numeric tolerance and non-null checks, and share the outcomes of these tests. I also highlight the importance of detailed feedback from the LLM, which helps us understand why certain tests pass or fail. I encourage you to think about potential improvements, such as optimizing the API call process. I&apos;m looking forward to discussing any questions you might have about the architectural and product decisions I&apos;ve made.</description></oembed>