PixelWorld: Towards Perceiving Everything as Pixels
Published in TMLR, 2025
Converting textual reasoning data into images to probe vision-language model reasoning capabilities
Recommended citation: Lyu, Z., Ma, X., & Chen, W. (2025). PixelWorld: Towards Perceiving Everything as Pixels. Transactions on Machine Learning Research. https://arxiv.org/abs/placeholder
