Qwen 2 72B Instruct

qwen/qwen-2-72b-instruct

Created Jun 7, 202432,768 context
$0.90/M input tokens$0.90/M output tokens

Qwen2 72B is a transformer-based model that excels in language understanding, multilingual capabilities, coding, mathematics, and reasoning.

It features SwiGLU activation, attention QKV bias, and group query attention. It is pretrained on extensive data with supervised finetuning and direct preference optimization.

For more details, see this blog post and GitHub repo.

Usage of this model is subject to Tongyi Qianwen LICENSE AGREEMENT.

Recent activity on Qwen 2 72B Instruct

Tokens processed per day

May 20May 27Jun 3Jun 10Jun 17Jun 24Jul 1Jul 8Jul 15Jul 22Jul 29Aug 5Aug 12150M300M450M600M

More models from Qwen

    Qwen 2 72B Instruct – Recent Activity | OpenRouter