INDEX
Explanations
people who are involved in legal or political situations
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.13
0.4%
16
+0.12
0.3%
1806
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
16
+0.13
0.04
1806
+0.12
0.04
238
+0.10
0.04
Negative Logits
."<
-0.70
."\
-0.68
,'\
-0.62
)+"
-0.60
."/
-0.60
+".
-0.59
.'</
-0.58
+"_
-0.57
tiks
-0.57
Cursors
-0.56
POSITIVE LOGITS
encomp
1.30
intersper
1.23
emphat
1.22
depic
1.18
disagre
1.13
unspeak
1.12
inev
1.09
affor
1.09
increa
1.07
reluct
1.06
Activations Density 0.167%