INDEX
Explanations
phrases related to global initiatives and events
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.17
0.6%
690
+0.09
0.3%
752
+0.07
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
783
+0.17
0.07
690
+0.09
0.07
219
+0.07
0.06
Negative Logits
>({-0.77
ⓧ
-0.67
IsRequired
-0.67
also
-0.67
javax
-0.67
ver
-0.66
win
-0.66
won
-0.65
、
-0.65
win
-0.65
POSITIVE LOGITS
hcm
1.90
saar
1.84
mef
1.78
wien
1.76
effe
1.74
socie
1.72
maneu
1.70
Somal
1.67
territo
1.67
stockholm
1.66
Activations Density 0.759%