ai/qwen2.5

Verified Publisher

By Docker

Updated 12 months ago

Versatile Qwen update with better language skills and wider support

Model
9

100K+

ai/qwen2.5 repository overview

Qwen2.5-7B Instruct

logo

Qwen2.5-7B-Instruct is an instruction-tuned large language model developed by Alibaba Cloud. It is part of the Qwen2.5 series, which includes models ranging from 0.5 to 72 billion parameters. This model offers significant improvements in knowledge, coding, and mathematical capabilities, along with enhanced instruction-following and long-text generation abilities. It supports a context length of up to 131,072 tokens and can generate outputs up to 8,192 tokens. Additionally, it provides multilingual support for over 29 languages, including Chinese, English, French, Spanish, Portuguese, German, Italian, Russian, Japanese, Korean, Vietnamese, Thai, and Arabic.

Intended uses

Qwen2.5-7B-Instruct is designed to assist in various natural language processing tasks, including:

  • Conversational AI: Engaging in dialogue with users, providing informative and contextually relevant responses.
  • Text generation: Creating coherent and contextually appropriate text based on prompts.
  • Multilingual support: Understanding and generating text in multiple languages, facilitating cross-lingual communication.
  • Structured data understanding: Working with tables, JSON, and semi-structured input/output

Characteristics

AttributeDetails
ProviderAlibaba Cloud
Architectureqwen2
Cutoff dateNovember 2024 (est)
LanguagesChinese, English, French, Spanish, Portuguese, German, Italian, Russian, Japanese, Korean, Vietnamese, Thai, Arabic, and more (29 languages)
Tool calling
Input modalitiesText
Output modalitiesText
LicenseApache 2.0

Available model variants

Model variantParametersQuantizationContext windowVRAM¹Size
ai/qwen2.5:latest

ai/qwen2.5:7B-Q4_K_M
7BIQ2_XXS/Q4_K_M33K tokens4.83 GiB4.36 GB
ai/qwen2.5:0.5B-F160.5BF1633K tokens1.38 GiB942.43 MB
ai/qwen2.5:1.5B-F161.5BF1633K tokens3.39 GiB2.88 GB
ai/qwen2.5:3B-Q4_K_M3BIQ2_XXS/Q4_K_M33K tokens2.37 GiB1.79 GB
ai/qwen2.5:3B-F163BF1633K tokens6.33 GiB5.75 GB
ai/qwen2.5:7B-Q4_07BQ4_033K tokens4.60 GiB4.12 GB
ai/qwen2.5:7B-Q4_K_M7BIQ2_XXS/Q4_K_M33K tokens4.83 GiB4.36 GB
ai/qwen2.5:7B-F167BF1633K tokens13.93 GiB14.19 GB

¹: VRAM estimated based on model characteristics.

latest7B-Q4_K_M

Use this AI model with Docker Model Runner

First, pull the model:

docker model pull ai/qwen2.5

Then run the model:

docker model run ai/qwen2.5

For more information on Docker Model Runner, explore the documentation.

Considerations

  • Ensure that the model is used in accordance with its Apache 2.0 license.
  • Be mindful of the computational resources required, especially when handling long-context inputs.
  • Regularly update to the latest version to benefit from improvements and security updates.

Benchmark performance

MetricsBenchmarkQwen2.5-7B-Instruct
Knowledge & QAMMLU-Pro56.3
MMLU-redux75.4
GPQA36.4
Math & ReasoningMATH75.5
GSM8K91.6
CodeHumanEval84.8
MBPP79.2
MultiPL-E70.4
LiveCodeBench 2305-240928.7
LiveBench 083135.9
Instruction FollowingIFeval strict-prompt71.2
Arena-Hard52.0
Alignment & PreferenceAlignBench v1.17.33
MTbench8.75

Tag summary

Content type

Model

Digest

sha256:b117fb2e0

Size

4.1 GB

Last updated

12 months ago

docker model pull ai/qwen2.5:7B-Q4_0

This week's pulls

Pulls:

2,797

Last week