Qwen3
This model was released on 2025-04-29 and added to Hugging Face Transformers on 2025-03-31.
Overview
Section titled “Overview”Qwen3 refers to the dense model architecture Qwen3-32B which was released with its mixture of experts variant Qwen3MoE (blog post).
Model Details
Section titled “Model Details”To be released with the official model launch.
Usage tips
Section titled “Usage tips”To be released with the official model launch.
Qwen3Config
Section titled “Qwen3Config”[[autodoc]] Qwen3Config
Qwen3Model
Section titled “Qwen3Model”[[autodoc]] Qwen3Model - forward
Qwen3ForCausalLM
Section titled “Qwen3ForCausalLM”[[autodoc]] Qwen3ForCausalLM - forward
Qwen3ForSequenceClassification
Section titled “Qwen3ForSequenceClassification”[[autodoc]] Qwen3ForSequenceClassification - forward
Qwen3ForTokenClassification
Section titled “Qwen3ForTokenClassification”[[autodoc]] Qwen3ForTokenClassification - forward
Qwen3ForQuestionAnswering
Section titled “Qwen3ForQuestionAnswering”[[autodoc]] Qwen3ForQuestionAnswering - forward