Benchmarking Visual State Tracking in Multimodal Video Understanding Paper • 2606.03920 • Published 6 days ago • 23
What matters for Representation Alignment: Global Information or Spatial Structure? Paper • 2512.10794 • Published Dec 11, 2025 • 10
RAEv2 Collection Improved Baselines with Representation Autoencoders • 4 items • Updated 20 days ago • 3
RAEv2 Collection Improved Baselines with Representation Autoencoders • 4 items • Updated 20 days ago • 3
RAEv2 Collection Improved Baselines with Representation Autoencoders • 4 items • Updated 21 days ago • 2
RAEv2 Collection Improved Baselines with Representation Autoencoders • 4 items • Updated 21 days ago • 2
RAEv2 Collection Improved Baselines with Representation Autoencoders • 4 items • Updated 21 days ago • 2