{"type":"video","version":"1.0","html":"<iframe src=\"https://www.loom.com/embed/dd586dc734294914ba4d4e2c16610161\" frameborder=\"0\" width=\"1832\" height=\"1374\" webkitallowfullscreen mozallowfullscreen allowfullscreen></iframe>","height":1374,"width":1832,"provider_name":"Loom","provider_url":"https://www.loom.com","thumbnail_height":1374,"thumbnail_width":1832,"thumbnail_url":"https://cdn.loom.com/sessions/thumbnails/dd586dc734294914ba4d4e2c16610161-e7eb8a86f66473bd.gif","duration":472.638,"title":"SWE - Benchmark - AI Agents - Update","description":"In this video, I discuss the groundbreaking advancements in AI software engineering, particularly focusing on the impressive performance of Devon AI on the SWE benchmark. I highlight how Honeycomb has excelled in resolving GitHub issues and the significant impact of Amazon Q in saving development time and costs. Additionally, I touch upon the unique approach of ChattyPT in problem-solving and the innovative work of Sakana AI in producing cutting-edge scientific papers. Viewers are encouraged to explore these AI technologies and their potential applications."}