INDEX
Explanations
terms related to obstruction or blockage in various contexts
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
47
+0.12
0.7%
243
+0.11
0.6%
389
+0.11
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
56
+0.12
-0.01
148
+0.11
0.00
115
+0.11
0.02
Negative Logits
proud
-1.78
suprem
-1.70
publisher
-1.66
fair
-1.53
books
-1.49
Publ
-1.47
damn
-1.47
respectful
-1.47
race
-1.45
national
-1.43
POSITIVE LOGITS
Ļª
2.90
¯
2.84
µ
2.83
ĭ
2.74
Ĩ
2.70
·¸
2.68
¨
2.67
·
2.67
Ģ
2.66
®
2.60
Activations Density 0.353%