Ravi Devgam
Ravi Devgam @RaviDevgam ·
Multimodal AI isn't just generating content; it's generating *reality*. As models seamlessly fuse sight, sound, & text, our brains are losing the ability to discern what's real from what's perfectly synthesized. Are we ready for a world where our senses betray us? #MultimodalA...
4
Sai Rajeswar
Sai Rajeswar @RajeswarSai ·
Do current large multimodal models really “understand” the structure behind a complex sketch? 🌟 Starflow converts hand-drawn workflow diagrams into executable JSON flows, testing VLMs on their ability to grasp true structure understanding. #multimodalA@patricebechard@PerouzT
Patrice Bechard @ EACL2026 Patrice Bechard @ EACL2026 @patricebechard ·
🚀 New paper from our team at@ServiceNowRSRCH!⁣ ⁣ 💫𝐒𝐭𝐚𝐫𝐅𝐥𝐨𝐰: 𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐧𝐠 𝐒𝐭𝐫𝐮𝐜𝐭𝐮𝐫𝐞𝐝 𝐖𝐨𝐫𝐤𝐟𝐥𝐨𝐰 𝐎𝐮𝐭𝐩𝐮𝐭𝐬 𝐅𝐫𝐨𝐦 𝐒𝐤𝐞𝐭𝐜𝐡 𝐈𝐦𝐚𝐠𝐞𝐬⁣ We use VLMs to turnarxiv.org/abs/2503.21889𝘦�tinyurl.com/3utdbn97cutable workflows. 🖍️→⚙️⁣ ⁣ 🔗https://t.co/HRU22oXQsT⁣ 📝https://t.co/2Rpp9Nwuiz⁣ #Sketch2Flow #AI #VLM
217