chain-of-thought reasoning
6 mentions across all digests
Chain-of-thought reasoning is a technique where AI models generate intermediate reasoning steps before producing a final answer, improving accuracy on complex tasks and forming the basis of 'thinking' models like Gemini 2.5 Pro.
ETR: Entropy Trend Reward for Efficient Chain-of-Thought Reasoning
Entropy Trend Reward (ETR) reduces the computational cost of chain-of-thought reasoning by learning which intermediate reasoning steps are essential, addressing a critical bottleneck in deploying reasoning-heavy language models.
InCoder-32B-Thinking: Industrial Code World Model for Thinking
InCoder-32B-Thinking brings chain-of-thought reasoning to code generation in a 32B parameter model optimized for industrial coding workflows.
Gemini 2.5: Our most intelligent AI model
GPT-5.4 Thinking System Card
Thinking with images