INDEX
Explanations
descriptors related to physical appearances and conditions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
56
+0.16
1.0%
419
+0.15
0.9%
198
+0.13
0.8%
Correlated Neurons
Index
P. Corr.
Cos Sim.
198
+0.16
0.09
56
+0.15
-0.00
419
+0.13
0.07
Negative Logits
Ŀ
-4.04
¸
-4.03
Ī
-4.03
ļ
-4.02
·¸
-3.79
Īĺ
-3.77
Ĥ
-3.77
ĸ´
-3.75
¦
-3.75
ķ
-3.74
POSITIVE LOGITS
bum
1.63
backed
1.60
beneath
1.57
marks
1.49
relief
1.48
sides
1.48
plasty
1.47
ball
1.46
wagen
1.45
horn
1.45
Activations Density 1.466%