INDEX
Explanations
information related to political matters, economics, and controversies
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
411
+0.15
0.5%
25
+0.10
0.3%
553
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
411
+0.15
0.09
1415
+0.10
0.05
663
+0.09
0.05
Negative Logits
Abbé
-1.14
mef
-1.12
hcm
-1.10
fup
-1.09
fta
-1.08
Augu
-1.06
vété
-1.04
Chinois
-1.02
ftu
-1.01
Intere
-1.00
POSITIVE LOGITS
will
0.84
soon
0.79
eventually
0.78
WILL
0.76
Will
0.76
Will
0.75
continue
0.72
will
0.71
surely
0.71
hopefully
0.70
Activations Density 0.350%