INDEX
Explanations
financial and political terms and phrases
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
227
+0.11
0.3%
1967
+0.09
0.3%
845
+0.09
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
227
+0.11
0.06
722
+0.09
0.04
1720
+0.09
0.03
Negative Logits
rubrique
-0.71
pédagogique
-0.70
logarith
-0.70
asiatique
-0.69
ftre
-0.68
Shakspeare
-0.67
reft
-0.67
laft
-0.66
fince
-0.64
itinéraire
-0.64
POSITIVE LOGITS
Milán
0.61
Compañ
0.55
organisation
0.54
barran
0.53
tambor
0.53
Inggris
0.52
palab
0.52
nonprofit
0.51
руппа
0.51
Irán
0.51
Activations Density 0.319%