Latest Tech News - VisualThinker-R1-Zero: First ever R1-Zero's Aha Moment on just a 2B non-SFT Model

VisualThinker-R1-Zero is a replication of DeepSeek-R1-Zero in visual reasoning. We are the first to successfully observe the emergent “aha moment” and increased response length in visual reasoning on just a 2B non-SFT models.

VisualThinker-R1-Zero: First ever R1-Zero's Aha Moment on just a 2B non-SFT Model March 5, 2025