INDEX
Explanations
proper nouns and names of educational institutions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
227
+0.19
0.6%
752
+0.12
0.4%
1343
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
227
+0.19
0.05
1410
+0.12
0.04
1336
+0.12
0.04
Negative Logits
blackish
-0.91
purplish
-0.89
overcrow
-0.86
friable
-0.83
lmfao
-0.82
disreg
-0.81
greyish
-0.80
moistened
-0.77
purée
-0.76
sessile
-0.75
POSITIVE LOGITS
simplif
1.17
déclen
1.09
rafra
1.06
surpl
1.06
redé
1.06
renou
1.03
Chá
1.03
Keny
1.01
verrou
1.00
obé
1.00
Activations Density 0.167%