Close Menu
SkytikSkytik

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    At Least 32 People Dead After a Mine Bridge Collapsed Due to Overcrowding

    November 17, 2025

    Here’s how I turned a Raspberry Pi into an in-car media server

    November 17, 2025

    Beloved SF cat’s death fuels Waymo criticism

    November 17, 2025
    Facebook X (Twitter) Instagram
    • About Us
    • Contact Us
    SkytikSkytik
    • Home
    • AI Tools
    • Online Tools
    • Tech News
    • Guides
    • Reviews
    • SEO & Marketing
    • Social Media Tools
    SkytikSkytik
    Home»AI Tools»[2505.12189] Mitigating Content Effects on Reasoning in Language Models through Fine-Grained Activation Steering
    AI Tools

    [2505.12189] Mitigating Content Effects on Reasoning in Language Models through Fine-Grained Activation Steering

    AwaisBy AwaisApril 2, 2026No Comments2 Mins Read0 Views
    Facebook Twitter Pinterest LinkedIn Telegram Tumblr Email
    Measuring Intelligence Efficiency of Local AI
    Share
    Facebook Twitter LinkedIn Pinterest Email

    [Submitted on 18 May 2025 (v1), last revised 1 Apr 2026 (this version, v3)]

    View a PDF of the paper titled Mitigating Content Effects on Reasoning in Language Models through Fine-Grained Activation Steering, by Marco Valentino and 4 other authors

    View PDF
    HTML (experimental)

    Abstract:Large language models (LLMs) exhibit reasoning biases, often conflating content plausibility with formal logical validity. This can lead to wrong inferences in critical domains, where plausible arguments are incorrectly deemed logically valid or vice versa. This paper investigates how content biases on reasoning can be mitigated through activation steering, an inference-time technique that modulates internal activations. Specifically, after localising the layers responsible for formal and plausible inference, we investigate activation steering on a controlled syllogistic reasoning task, designed to disentangle formal validity from content plausibility. An extensive empirical analysis reveals that contrastive steering methods consistently support linear control over content biases. However, a static approach is insufficient to debias all the tested models. We then investigate how to control content effects by dynamically determining the steering parameters through fine-grained conditional methods. By introducing a novel kNN-based conditional approach (K-CAST), we demonstrate that conditional steering can effectively reduce biases on unresponsive models, achieving up to 15% absolute improvement in formal reasoning accuracy. Finally, we found that steering for content effects is robust to prompt variations, incurs minimal side effects on multilingual language modeling capabilities, and can partially generalize to different reasoning tasks. In practice, we demonstrate that activation-level interventions offer a scalable inference-time strategy for enhancing the robustness of LLMs, contributing towards more systematic and unbiased reasoning capabilities.

    Submission history

    From: Marco Valentino [view email]
    [v1]
    Sun, 18 May 2025 01:34:34 UTC (4,113 KB)
    [v2]
    Fri, 6 Mar 2026 09:25:57 UTC (3,958 KB)
    [v3]
    Wed, 1 Apr 2026 05:20:18 UTC (3,958 KB)

    activation content Effects finegrained Language Mitigating Models Reasoning steering
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Awais
    • Website

    Related Posts

    How to Handle Classical Data in Quantum Models

    April 2, 2026

    Linear Regression Is Actually a Projection Problem (Part 2: From Projections to Predictions)

    April 2, 2026

    Source-Aware Dual-Track Tokenization for Multi-Track Music Language Modeling

    April 2, 2026

    Quantum Simulations with Python | Towards Data Science

    April 2, 2026

    [2506.08915] Two-stage Vision Transformers and Hard Masking offer Robust Object Representations

    April 2, 2026

    A Benchmark Dataset for Epitope-Specific Antibody Design

    April 2, 2026
    Leave A Reply Cancel Reply

    Top Posts

    At Least 32 People Dead After a Mine Bridge Collapsed Due to Overcrowding

    November 17, 20250 Views

    Here’s how I turned a Raspberry Pi into an in-car media server

    November 17, 20250 Views

    Beloved SF cat’s death fuels Waymo criticism

    November 17, 20250 Views
    Don't Miss

    How to Handle Classical Data in Quantum Models

    April 2, 2026

    recent years, quantum computing has attracted growing interest from researchers, businesses, and the public. “Quantum”…

    Mint Julep (Bourbon Cocktail With Mint) Recipe

    April 2, 2026

    ChatGPT ads favor clarity over creativity, new data shows

    April 2, 2026

    AI Leads All Reasons For U.S. Job Cuts In March, Report Says

    April 2, 2026
    Stay In Touch
    • Facebook
    • YouTube
    • TikTok
    • WhatsApp
    • Twitter
    • Instagram
    Latest Reviews

    Linear Regression Is Actually a Projection Problem (Part 2: From Projections to Predictions)

    April 2, 2026

    17 Recipes to Help You Survive Tax Day

    April 2, 2026
    Most Popular

    13 Trending Songs on TikTok in Nov 2025 (+ How to Use Them)

    November 18, 20257 Views

    How to watch the 2026 GRAMMY Awards online from anywhere

    February 1, 20263 Views

    Corporate Reputation Management Strategies | Sprout Social

    November 19, 20252 Views
    Our Picks

    At Least 32 People Dead After a Mine Bridge Collapsed Due to Overcrowding

    November 17, 2025

    Here’s how I turned a Raspberry Pi into an in-car media server

    November 17, 2025

    Beloved SF cat’s death fuels Waymo criticism

    November 17, 2025

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    Facebook X (Twitter) Instagram Pinterest YouTube Dribbble
    • About Us
    • Contact Us
    • Privacy Policy
    • Terms & Conditions
    • Disclaimer

    © 2025 skytik.cc. All rights reserved.

    Type above and press Enter to search. Press Esc to cancel.