Neuron Populations Exhibit Divergent Selectivity with Scale

Amil Dravid1, Yasaman Bahri1, Alexei A. Efros1, Yossi Gandelsman2

1UC Berkeley 2TTIC

Neuron populations across scale: universality, selectivity, and specialization of Rosetta Neurons.
Neuron populations across scale. To study how neuron populations scale, we use Rosetta Neurons: units that recur across different models. (A) Under finite neuron capacity, features compete for fewer neurons, leaving them isolated, mixed, or unrepresented at a given scale. This picture guides our analysis of universality, selectivity, and specialization. In panels B–D, each column shows top-activating contexts from a single neuron. (B) Universality: how does the recurring Rosetta Neuron population scale? (C) Selectivity: do recurring neurons become increasingly monosemantic relative to polysemantic neurons? (D) Specialization: which features of the data distribution are encoded by Rosetta Neurons at different scales? Across language and vision models, Rosetta Neurons grow but vanish as a fraction of all neurons, while becoming increasingly selective and specialized.

Abstract

We investigate whether neuron populations within neural networks evolve predictably with scale, extending scaling laws beyond macroscopic observables such as loss. To probe this question, we study Rosetta Neurons, a previously characterized class of neurons whose activation patterns are similar across independently trained models. In separate analyses of language models up to 30B parameters and vision models up to 5B parameters, we observe that the population of Rosetta Neurons follows a sublinear power law in model size, growing in absolute number but occupying a shrinking fraction of the total neuron count. We further observe a Neuron Polarization Effect: Rosetta Neurons become more selective and increasingly monosemantic with scale, separating from a growing non-Rosetta population that remains less selective. An analytical model balancing feature utility against limited neuron capacity explains the sublinear power-law scaling and this polarization effect. Finally, we find that Rosetta Neurons become more domain-specialized with scale and illustrate their selectivity through a targeted data-filtering case study for continued pretraining. Our results point to a scaling law for interpretable, shared neuron-level structure, linking model size to systematic changes in neuron universality, selectivity, and specialization.

Acknowledgments

We thank members of Berkeley AI Research and the Redwood Center for Theoretical Neuroscience for helpful discussions. We are particularly grateful to Mason Kamb, Phillip Isola, David Bau, Yizhou Liu, Sophie Wang, Grace Luo, Stephanie Fu, Jasmine Shone, Tamar Rott Shaham, and Tyler Bonnen for their thoughtful feedback. YB, AE, and YG jointly advised this work. YB is a visiting scholar at UC Berkeley and member of the Simons Collaboration on the Physics of Learning & Neural Computation. AD is supported by the US Department of Energy Computational Science Graduate Fellowship. Additional support came from ONR MURI, NSF IIS-2403305, and the Google-BAIR Commons Program.

BibTeX

@misc{dravid2026neuronpopulationsexhibitdivergent,
      title={Neuron Populations Exhibit Divergent Selectivity with Scale}, 
      author={Amil Dravid and Yasaman Bahri and Alexei A. Efros and Yossi Gandelsman},
      year={2026},
      eprint={2606.03990},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2606.03990}, 
}