INDEX
Explanations
proper nouns related to politics and current events
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1741
+0.10
0.3%
227
+0.09
0.3%
2034
+0.09
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1081
+0.10
0.06
2044
+0.09
0.06
227
+0.09
0.06
Negative Logits
Gennaio
-0.91
sappi
-0.90
ideolog
-0.88
solidar
-0.86
manuten
-0.84
inverte
-0.83
pól
-0.81
apparti
-0.79
cresce
-0.78
potest
-0.78
POSITIVE LOGITS
so
0.59
McInt
0.58
McLaugh
0.58
therefore
0.58
and
0.57
Vaugh
0.56
ecru
0.55
anyway
0.55
abestanden
0.53
chronologically
0.53
Activations Density 0.539%