INDEX
Explanations
names or mentions of specific locations (particularly Basel and Bern)
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
47
+0.10
0.3%
313
+0.09
0.3%
1276
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1120
+0.10
0.05
1343
+0.09
0.05
690
+0.09
0.04
Negative Logits
intersper
-1.60
encomp
-1.56
🤣🤣
-1.55
hairc
-1.55
hentai
-1.49
lmfao
-1.49
milf
-1.47
increa
-1.44
stickied
-1.40
Lmao
-1.39
POSITIVE LOGITS
Bern
1.68
Teb
1.56
Bern
1.43
Teb
1.17
Cer
1.04
Cer
0.95
quo
0.85
cer
0.83
BERN
0.81
Newtown
0.80
Activations Density 0.287%