ML Wiki

Tag: reasoning

11 items with this tag.

  • May 09, 2026

    CodeAct: Executable Code Actions Elicit Better LLM Agents

    • source
    • agents
    • tool-use
    • code-generation
    • reasoning
  • Apr 28, 2026

    Where does RL-on-verifiable-rewards stop generalizing?

    • thread
    • reasoning
    • rl
    • deepseek-r1
  • Apr 24, 2026

    Self-Consistency

    • concept
    • reasoning
    • sampling
    • ensemble-methods
  • Apr 17, 2026

    From Prompting to Agency — Reasoning and Tool-Using LLMs

    • learning-path
    • reasoning
    • agents
    • prompting
  • Apr 17, 2026

    Self-Consistency Improves Chain of Thought Reasoning in Language Models

    • source
    • chain-of-thought
    • reasoning
    • in-context-learning
    • sampling
    • decoding
  • Apr 17, 2026

    Tree of Thoughts: Deliberate Problem Solving with Large Language Models

    • source
    • reasoning
    • chain-of-thought
    • search
    • planning
    • in-context-learning
  • Apr 16, 2026

    RL for Reasoning (Test-Time Compute Scaling)

    • concept
    • reinforcement-learning
    • reasoning
    • chain-of-thought
    • scaling
  • Apr 16, 2026

    DeepSeek-R1: Incentivizing Reasoning via Reinforcement Learning

    • source
    • reasoning
    • reinforcement-learning
    • chain-of-thought
    • rl
    • grpo
  • Apr 16, 2026

    ReAct: Synergizing Reasoning and Acting in Language Models

    • source
    • agents
    • reasoning
    • acting
    • tool-use
    • chain-of-thought
  • Apr 05, 2026

    Chain-of-Thought (CoT) Prompting

    • concept
    • prompting
    • reasoning
  • Apr 05, 2026

    Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

    • source
    • chain-of-thought
    • prompting
    • reasoning
    • emergent-abilities