Patch-as-Decodable-Token: Towards Unified Multi-Modal Vision Tasks in MLLMsPublished in International Conference on Learning Representations (ICLR), 2026Share on Bluesky Facebook LinkedIn Mastodon X (formerly Twitter) Previous Next