AIQuantumAndScienceNews
AIQuantumAndScienceNews @AIQuantumLifeEx ·
MATA: A Trainable Hierarchical Automaton System for Multi-Agent Visual Reasoning Preprint: MATA introduces a hierarchical automaton system for visual reasoning, using… arxiv.org/abs/2601.19204 #AI #VisualReasoning #MultiAgent #MachineLearning #Preprint #Arxiv #ScienceNews
arXiv logo
MATA: A Trainable Hierarchical Automaton System for Multi-Agent...

Recent vision-language models have strong perceptual ability but their implicit reasoning is hard to explain and easily generates hallucinations on complex queries. Compositional methods improve...

From arxiv.org
10
Jiang Bian
Jiang Bian @jbian22 ·
Replying to @jbian22
PixelCraft significantly boosts performance for strong MLLMs (GPT-40, Claude 3.7) on tough benchmarks like ChartXiv, ChartQAPro, & Geometry3K. Paper: arxiv.org/pdf/2509.25185 Code: github.com/microsoft/Pixe… #AI #MLLM #VisualReasoning #ComputerVision #MultiAgent
GitHub - microsoft/PixelCraft: [ICLR 2026] High-Fidelity Visual Reasoning on Structured Images

[ICLR 2026] High-Fidelity Visual Reasoning on Structured Images - microsoft/PixelCraft

From github.com
31