INDEX
Explanations
mentions of legal proceedings and governmental actions, specifically focusing on hearings, testimonies, and intelligence committees
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
453
+0.10
0.3%
752
+0.09
0.3%
856
+0.08
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
16
+0.10
0.05
1499
+0.09
0.04
453
+0.08
0.04
Negative Logits
purée
-0.62
nutella
-0.51
gliss
-0.49
COMMUNIC
-0.48
BEHAV
-0.48
upvoted
-0.48
EXERCISES
-0.48
bleeds
-0.48
buttercream
-0.47
Rgds
-0.47
POSITIVE LOGITS
gouver
0.75
alkoh
0.75
principalColumn
0.70
déliv
0.69
kompati
0.67
évalu
0.65
dör
0.65
rémun
0.65
Nö
0.62
karton
0.62
Activations Density 0.273%