Ravi Devgam
Ravi Devgam @RaviDevgam ·
Multimodal AI isn't just generating content; it's generating *reality*. As models seamlessly fuse sight, sound, & text, our brains are losing the ability to discern what's real from what's perfectly synthesized. Are we ready for a world where our senses betray us? #MultimodalA...
4
Sai Rajeswar
Sai Rajeswar @RajeswarSai ·
Do current large multimodal models really “understand” the structure behind a complex sketch? 🌟 Starflow converts hand-drawn workflow diagrams into executable JSON flows, testing VLMs on their ability to grasp true structure understanding. #multimodalA @patricebechard @PerouzT
Patrice Bechard @ EACL2026 Patrice Bechard @ EACL2026 @patricebechard ·
🚀 New paper from our team at @ServiceNowRSRCH!⁣ ⁣ 💫𝐒𝐭𝐚𝐫𝐅𝐥𝐨𝐰: 𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐧𝐠 𝐒𝐭𝐫𝐮𝐜𝐭𝐮𝐫𝐞𝐝 𝐖𝐨𝐫𝐤𝐟𝐥𝐨𝐰 𝐎𝐮𝐭𝐩𝐮𝐭𝐬 𝐅𝐫𝐨𝐦 𝐒𝐤𝐞𝐭𝐜𝐡 𝐈𝐦𝐚𝐠𝐞𝐬⁣ We use VLMs to turn 𝘩𝘢𝘯𝘥-𝘥𝘳𝘢𝘸𝘯 𝘴𝘬𝘦𝘵𝘤𝘩𝘦𝘴 and diagrams into executable workflows. 🖍️→⚙️⁣ ⁣ 🔗arxiv.org/abs/2503.21889⁣ 📝tinyurl.com/3utdbn97#Sketch2Flow #AI #VLM
217