[2603.18908] Characterizing Linear Alignment Across Language Models

[Submitted on 19 Mar 2026 (v1), last revised 26 Mar 2026 (this version, v3)]

View a PDF of the paper titled Characterizing Linear Alignment Across Language Models, by Matt Gorbett and 1 other authors

View PDF
HTML (experimental)

Abstract:Language models increasingly appear to learn similar representations, despite differences in training objectives, architectures, and data modalities. This emerging compatibility between independently trained models introduces new opportunities for cross-model alignment to downstream objectives. Moreover, this capability unlocks new potential application domains, such as settings where security, privacy, or competitive constraints prohibit direct data or model sharing. In this work, we investigate the extent to which representational convergence enables practical linear alignment between large language models. Specifically, we learn affine transformations between the final hidden states of independent models and empirically evaluate these mappings across text generation, embedding classification, and out-of-distribution detection. We find that performance is largely preserved across model pairs, and show for the first time that linear alignment sometimes enables text generation across independently trained models. We further highlight a potential application of linear alignment for privacy-preserving cross-silo inference. The framework learns an affine transformation over a shared public dataset and uses homomorphic encryption to protect client queries. By encrypting only the linear classification operation, the method achieves sub-second inference latency.

Submission history

From: Matt Gorbett [view email]
[v1]
Thu, 19 Mar 2026 13:43:32 UTC (2,229 KB)
[v2]
Sat, 21 Mar 2026 20:00:35 UTC (610 KB)
[v3]
Thu, 26 Mar 2026 15:17:05 UTC (2,577 KB)

What's Hot

At Least 32 People Dead After a Mine Bridge Collapsed Due to Overcrowding

Here’s how I turned a Raspberry Pi into an in-car media server

Beloved SF cat’s death fuels Waymo criticism

[2603.18908] Characterizing Linear Alignment Across Language Models

[2603.25366] Integrating Deep RL and Bayesian Inference for ObjectNav in Mobile Robotics

Vector Databases Explained in 3 Levels of Difficulty

A Beginner’s Guide to Quantum Computing with Python

LlamaAgents Builder: From Prompt to Deployed AI Agent in Minutes

Learning to Drive with Natural Language Instructions

Building a Production-Grade Multi-Node Training Pipeline with PyTorch DDP

At Least 32 People Dead After a Mine Bridge Collapsed Due to Overcrowding

Here’s how I turned a Raspberry Pi into an in-car media server

Beloved SF cat’s death fuels Waymo criticism

Google Begins Rolling Out March 2026 Core Update

[2603.25366] Integrating Deep RL and Bayesian Inference for ObjectNav in Mobile Robotics

Remember the Cat Murder Allegations from that LA Restaurant?

Vector Databases Explained in 3 Levels of Difficulty

[2603.18908] Characterizing Linear Alignment Across Language Models

If My Pantry Were to Disappear, Here’s What I’d Restock Immediately

Most Popular

13 Trending Songs on TikTok in Nov 2025 (+ How to Use Them)

How to watch the 2026 GRAMMY Awards online from anywhere

Corporate Reputation Management Strategies | Sprout Social

Our Picks

At Least 32 People Dead After a Mine Bridge Collapsed Due to Overcrowding

Here’s how I turned a Raspberry Pi into an in-car media server

Beloved SF cat’s death fuels Waymo criticism

Subscribe to Updates

What's Hot

[2603.18908] Characterizing Linear Alignment Across Language Models

Submission history

Related Posts

Subscribe to Updates