THUDM: GLM 4.1V 9B Thinking

thudm/glm-4.1v-9b-thinking

Created Jul 11, 202565,536 context

$0.035/M input tokens$0.138/M output tokens

GLM-4.1V-9B-Thinking is a 9B parameter vision-language model developed by THUDM, based on the GLM-4-9B foundation. It introduces a reasoning-centric "thinking paradigm" enhanced with reinforcement learning to improve multimodal reasoning, long-context understanding (up to 64K tokens), and complex problem solving. It achieves state-of-the-art performance among models in its class, outperforming even larger models like Qwen-2.5-VL-72B on a majority of benchmark tasks.

THUDM: GLM 4.1V 9B Thinking