Menu
Home
Forums
New posts
Search forums
What's new
Featured content
New posts
New media
New media comments
New resources
Latest activity
Media
New media
New comments
Search media
Resources
Latest reviews
Search resources
Misc
Log in
Register
What's new
Search
Search
Search titles only
By:
New posts
Search forums
Menu
Log in
Register
Install the app
Install
Home
Forums
Labrish
Nalij
Jinaral kantent
What is "Qwen's Thinking Budget"?
JavaScript is disabled. For a better experience, please enable JavaScript in your browser before proceeding.
You are using an out of date browser. It may not display this or other websites correctly.
You should upgrade or use an
alternative browser
.
Reply to thread
Message
[QUOTE="Munyaradzi Mafaro, post: 68661, member: 636"] In short, the [B]thinking budget[/B] is a parameter that allows you to control the [B]maximum length or depth of the model's reasoning process[/B] before it provides a final answer. To understand this, you first need to know that Qwen models (like Qwen3) have two different modes for answering questions: [LIST=1] [*][B]Non-Thinking Mode:[/B] This is for simple, straightforward questions (e.g., "What is the capital of France?"). The model gives a direct, fast answer. [*][B]Thinking Mode:[/B] This is for complex problems that require step-by-step reasoning (e.g., a math word problem or a complex coding task). In this mode, the model first "thinks" through the problem internally, a process often called a "chain of thought", and then uses that reasoning to formulate the final answer. [/LIST] The "thinking budget" applies [B]only to this "Thinking Mode."[/B] [HEADING=2]How it works[/HEADING] [LIST] [*][B]It's a Trade-off:[/B] The "thinking budget" lets you balance [B]performance vs. cost/speed[/B]. [LIST] [*][B]High Budget:[/B] You allow the model to spend more "thinking" tokens. This can lead to more accurate, thorough, and well-reasoned answers for very complex tasks. The downside is that it takes more time and computational resources. [*][B]Low Budget:[/B] You restrict the model's thinking time. The model will provide an answer much faster and more cheaply, but it might be less detailed or accurate if the problem is very difficult. [/LIST] [*][B]It's a Limit: [/B]You are essentially setting a maximum number of tokens (the basic units of text) that the model can use for its internal reasoning. [LIST] [*]If the model solves the problem before hitting the budget, it will simply stop thinking and give you the answer. [*]If the model hits the budget limit before it has "finished" thinking, it will be forced to stop and provide the best possible answer based on the reasoning it has completed so far. [/LIST] [/LIST] [/QUOTE]
Insert quotes…
Name
Post reply
Home
Forums
Labrish
Nalij
Jinaral kantent
What is "Qwen's Thinking Budget"?
This site uses cookies to help personalise content, tailor your experience and to keep you logged in if you register.
By continuing to use this site, you are consenting to our use of cookies.
Accept
Learn more…
Top