<?xml version="1.0" encoding="UTF-8"?><oembed><type>video</type><version>1.0</version><html>&lt;iframe src=&quot;https://www.loom.com/embed/e2efba2ea55e479cb055c586a0e300ca&quot; frameborder=&quot;0&quot; width=&quot;2214&quot; height=&quot;1660&quot; webkitallowfullscreen mozallowfullscreen allowfullscreen&gt;&lt;/iframe&gt;</html><height>1660</height><width>2214</width><provider_name>Loom</provider_name><provider_url>https://www.loom.com</provider_url><thumbnail_height>1660</thumbnail_height><thumbnail_width>2214</thumbnail_width><thumbnail_url>https://cdn.loom.com/sessions/thumbnails/e2efba2ea55e479cb055c586a0e300ca-7823bf88e2cde484.gif</thumbnail_url><duration>259.746</duration><title>Function Based Protein Hazard Screening Model</title><description>Hey guys, I am Sissi from UC Berkeley, and I presented function based protein hazard screening for DNA screening and synthesis controls at our hackathon. Our ESM embedding classifier achieves 0.996 AUROC under the hardest setting, where no test sequence shares more than 40 percent identity with training, catching 95.7 percent of toxins at a 1 percent false positive rate with a minimal generalization gap. Compared to baselines, ESM2 barely drops on cluster splits. The pipeline is ESM2 mean pooled embeddings into an MLP for a toxin, non toxin confidence score in under a second. I am aiming to integrate this as a secondary screening layer alongside SecureDNA and iBeast comet.</description></oembed>