INDEX
Explanations
descriptions of condition and quality related to objects or situations
New Auto-Interp
Negative Logits
voks
-0.16
_SAFE
-0.15
enus
-0.15
ots
-0.15
umb
-0.14
UMB
-0.14
_barrier
-0.14
оÑħ
-0.14
áž
-0.14
llen
-0.13
POSITIVE LOGITS
shape
0.77
Shape
0.64
shape
0.63
Shape
0.55
condition
0.51
shapes
0.49
_shape
0.48
.shape
0.47
(shape
0.43
Shapes
0.41
Activations Density 0.046%