Decomposing Multilingual Representations: How Scale, Architecture, and Data Shape Functional Specialization

Zeqiang Wang; Xinyue Wu; Chenxi Li; Yuqi Wang; Zixi Chen; Jon Johnson; Suparna De

doi:10.1109/ICASSP55912.2026.11462361

Back

Decomposing Multilingual Representations: How Scale, Architecture, and Data Shape Functional Specialization

Conference proceeding

Open access

Peer reviewed

Decomposing Multilingual Representations: How Scale, Architecture, and Data Shape Functional Specialization

Zeqiang Wang, Xinyue Wu, Chenxi Li, Yuqi Wang, Zixi Chen, Jon Johnson and Suparna De

ICASSP 2026 - 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),, pp.17487-17491

2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (Barcelona, Spain, 03/05/2026–08/05/2026)

21/04/2026

DOI: https://doi.org/10.1109/ICASSP55912.2026.11462361

Abstract

Neural circuits

Multilingual Representation Learning

Interpretability

How Large Multilingual Models (LMMs) separate linguistic form from semantic content is largely opaque. This paper introduces a framework to dissect their internal representations, revealing a phenomenon we term Functional Specialization: the emergence of distinct neural circuits for language-specific form versus language-agnostic semantics. Through extensive experiments on the E5 and Qwen series, we show this specialization is governed by three factors. Architecture dictates the core strategy: encoders adopt specialization breadth (many features in a staged workflow), while decoders pursue specialization depth (few, high-purity features). Scale primarily drives neural efficiency, enabling robust separation with fewer circuits. Finally, high-clarity data acts as a catalyst, inducing sophisticated mechanisms even in smaller models. Our findings chart a path toward more controllable and interpretable multilingual AI.

Files and links (1)

pdf

20250907115518_848975_27378.99 MBDownload View

Author's Accepted Manuscript Open Access CC BY V4.0

Metrics

1 Record Views

Details

Title: Decomposing Multilingual Representations: How Scale, Architecture, and Data Shape Functional Specialization
Creators: Zeqiang Wang (Corresponding Author) - University of Surrey, School of Computer Science & Electronic Engineering
Xinyue Wu (Author) - Washington State University
Chenxi Li (Author) - University of Oxford, Oxford, UK
Yuqi Wang (Author) - Xi’an Jiaotong-Liverpool University
Zixi Chen (Author) - New York University Shanghai
Jon Johnson (Author) - University College London
Suparna De (Author) - University of Surrey, School of Computer Science & Electronic Engineering
Publication Details: ICASSP 2026 - 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),, pp.17487-17491
Conference: 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (Barcelona, Spain, 03/05/2026–08/05/2026)
Publisher: IEEE
Publication Date: 21/04/2026
Grants: Metadata uplift in Longitudinal Population Studies (provenance, discovery and disclosure), UKRI2700, Engineering and Physical Sciences Research Council (United Kingdom, Swindon) - EPSRC
Identifiers: 991120791902346
Academic Unit: School of Computer Science & Electronic Engineering
Language: English
Resource Type: Conference proceeding

Decomposing Multilingual Representations: How Scale, Architecture, and Data Shape Functional Specialization

Abstract

Files and links (1)

Metrics

Details

Usage Policy