INDEX
Explanations
mentions of positions such as "president" or specific positions held by individuals
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
555
+0.18
0.7%
1035
+0.15
0.6%
1778
+0.14
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
555
+0.18
0.06
1035
+0.15
0.06
1778
+0.14
0.05
Negative Logits
»>
-0.71
aen
-0.70
jacobs
-0.67
paula
-0.65
ricardo
-0.64
fta
-0.63
akku
-0.63
fua
-0.63
whiche
-0.63
thut
-0.62
POSITIVE LOGITS
president
1.44
president
1.34
President
1.31
President
1.27
presidents
1.25
presidency
1.18
PRESIDENT
1.08
Presidents
1.07
PRESIDENT
1.04
presidential
0.99
Activations Density 0.085%