Qwen
2025-01-27
New long-context models: mind the output gap
While attention has been recently focused on DeepSeek, Alibaba has quietly made significant strides with two new models: Qwen2.5-7B-Instruct-1M and Qwen2.5-14B-Instruct-1M. These represent a breakthrough as the first open-source Qwen models capable of handling contexts up to 1 million tokens.