Close Menu
SkytikSkytik

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    At Least 32 People Dead After a Mine Bridge Collapsed Due to Overcrowding

    November 17, 2025

    Here’s how I turned a Raspberry Pi into an in-car media server

    November 17, 2025

    Beloved SF cat’s death fuels Waymo criticism

    November 17, 2025
    Facebook X (Twitter) Instagram
    • About Us
    • Contact Us
    SkytikSkytik
    • Home
    • AI Tools
    • Online Tools
    • Tech News
    • Guides
    • Reviews
    • SEO & Marketing
    • Social Media Tools
    SkytikSkytik
    Home»AI Tools»Do LLMs Struggle with Math Across Cultural Contexts?
    AI Tools

    Do LLMs Struggle with Math Across Cultural Contexts?

    AwaisBy AwaisApril 10, 2026No Comments2 Mins Read0 Views
    Facebook Twitter Pinterest LinkedIn Telegram Tumblr Email
    Measuring Intelligence Efficiency of Local AI
    Share
    Facebook Twitter LinkedIn Pinterest Email

    [Submitted on 23 Mar 2025 (v1), last revised 8 Apr 2026 (this version, v2)]

    View a PDF of the paper titled Lost in Cultural Translation: Do LLMs Struggle with Math Across Cultural Contexts?, by Aabid Karim and 5 other authors

    View PDF
    HTML (experimental)

    Abstract:We demonstrate that large language models’ (LLMs) mathematical reasoning is culturally sensitive: testing 14 models from Anthropic, OpenAI, Google, Meta, DeepSeek, Mistral, and Microsoft across six culturally adapted variants of the GSM8K benchmark, we find accuracy drops ranging from 0.3% (Claude 3.5 Sonnet) to 5.9% (LLaMA 3.1-8B) when math problems are embedded in unfamiliar cultural contexts–even when the underlying mathematical logic remains unchanged. These statistically significant performance reductions (p < 0.01, confirmed through McNemar tests) reveal that mathematical reasoning in LLMs is not culturally neutral.

    To create these variants for Haiti, Moldova, Pakistan, Solomon Islands, Somalia, and Suriname, we systematically replaced cultural entities (names, foods, places, etc.) in 1,198 GSM8K questions while preserving all mathematical operations and numerical values. Our quantitative error analysis of 18,887 instances reveals that cultural adaptation affects broader reasoning patterns, with mathematical reasoning errors comprising 54.7% and calculation errors 34.5% of failures.

    Interestingly, cultural familiarity can enhance performance: Mistral Saba outperforms some larger models on Pakistan-adapted problems due to Middle Eastern and South Asian training data exposure. This study underscores the need for more diverse training data to ensure robust LLM performance across global contexts.

    Submission history

    From: Aabid Karim [view email]
    [v1]
    Sun, 23 Mar 2025 10:35:39 UTC (3,432 KB)
    [v2]
    Wed, 8 Apr 2026 07:40:49 UTC (5,078 KB)

    Contexts Cultural LLMs Math Struggle
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Awais
    • Website

    Related Posts

    TraceSafe: A Systematic Assessment of LLM Guardrails on Multi-Step Tool-Calling Trajectories

    April 10, 2026

    Towards a Category-theoretic Comparative Framework for Artificial General Intelligence

    April 10, 2026

    The Future of AI for Sales Is Diverse and Distributed

    April 10, 2026

    [2604.05070] Part-Level 3D Gaussian Vehicle Generation with Joint and Hinge Axis Estimation

    April 9, 2026

    How Visual-Language-Action (VLA) Models Work

    April 9, 2026

    A Multi-Agent Framework for Automated AI Research Paper Writing

    April 9, 2026
    Leave A Reply Cancel Reply

    Top Posts

    At Least 32 People Dead After a Mine Bridge Collapsed Due to Overcrowding

    November 17, 20250 Views

    Here’s how I turned a Raspberry Pi into an in-car media server

    November 17, 20250 Views

    Beloved SF cat’s death fuels Waymo criticism

    November 17, 20250 Views
    Don't Miss

    How to stay compliant and win in local SEO

    April 10, 2026

    There’s a broad consensus that online reviews — especially Google reviews — should be a…

    How to Make Money on Facebook in 2026

    April 10, 2026

    How Hilti Builds Safety Into Every Tool, System, and Solution

    April 10, 2026

    What I Learned About The Future Of Search And AI From Sundar Pichai’s Latest Interview

    April 10, 2026
    Stay In Touch
    • Facebook
    • YouTube
    • TikTok
    • WhatsApp
    • Twitter
    • Instagram
    Latest Reviews

    What 400 Sites Reveal About Organic Traffic Gains

    April 10, 2026

    How to Use Almond Extract (Without Overdoing It)

    April 10, 2026
    Most Popular

    13 Trending Songs on TikTok in Nov 2025 (+ How to Use Them)

    November 18, 20257 Views

    How to watch the 2026 GRAMMY Awards online from anywhere

    February 1, 20263 Views

    Corporate Reputation Management Strategies | Sprout Social

    November 19, 20252 Views
    Our Picks

    At Least 32 People Dead After a Mine Bridge Collapsed Due to Overcrowding

    November 17, 2025

    Here’s how I turned a Raspberry Pi into an in-car media server

    November 17, 2025

    Beloved SF cat’s death fuels Waymo criticism

    November 17, 2025

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    Facebook X (Twitter) Instagram Pinterest YouTube Dribbble
    • About Us
    • Contact Us
    • Privacy Policy
    • Terms & Conditions
    • Disclaimer

    © 2025 skytik.cc. All rights reserved.

    Type above and press Enter to search. Press Esc to cancel.