Research Papers llm model_compression transformers interpretability

Shows that layer equivalence in transformers depends heavily on the test protocol used (replacement vs. interchange), an

Shows that layer equivalence in transformers depends heavily on the test protocol used (replacement vs. interchange), and that conflating them can misidentify which layers are safe to prune. Has implications for model compression research.

Original Post

Layer Equivalence Is Not a Property of Layers Alone: How You Test Redundancy Changes What You Find When researchers ask whether two transformer layers are "equivalent" for compression, they often conflate distinct tests. Replacement asks whether one layer's map can substitute for another's in place; interchange asks whether two layers approximately commute when their positions are swapped. Both are output-grounded swap-KL probes, but they need not agree: on pretrained transformers the protocol gap can change which layers look safe to prune by several-fold under the same evaluator, especially when replacement distances are high. We measure both protocols across checkpoints and architectures. On a Pythia training trajectory (410M and 1.4B), the replacement-interchange gap grows from initialization to convergence. Under one matched WikiText-2 contract at 8B scale, Qwen3-8B enters a divergent regime: interchange-guided removal is several-fold safer than replacement-guided at the same layer budgets, while Llama-3.1-8B ties the two protocols for pruning cost even though interchange KL is lower, showing metric gaps need not map one-to-one to removal. Before layer removal or merging, score both swap-KLs on the target checkpoint; the diagnostic requires only unlabeled forward passes.

Source: ARXIV (arxiv)
Author: Gabriel Garcia
Date: 2026-05-15
Relevance: 5
Topics: llm, model_compression, transformers, interpretability

View Original Post ↗

Shows that layer equivalence in transformers depends heavily on the test protocol used (replacement vs. interchange), an

Related Posts

An agentic prototype combining AlphaEvolve and Empirical Research Assistance run...

Co-Scientist uses a multi-agent 'idea tournament' framework to generate, debate,...

Research finding that LLMs adapt their behavior 24.9% when under observation, ra...