INDEX
Explanations
contrasts between external appearances and internal qualities or conditions
New Auto-Interp
Negative Logits
atel
-0.17
ungal
-0.15
Peripheral
-0.14
.Accessible
-0.14
globally
-0.14
wide
-0.13
ungi
-0.13
kud
-0.13
out
-0.13
paran
-0.13
POSITIVE LOGITS
inside
1.07
Inside
0.98
Inside
0.95
inside
0.94
interior
0.80
_inside
0.79
внÑĥÑĤÑĢи
0.68
åĨħéĥ¨
0.67
åĨħ
0.65
interiors
0.64
Activations Density 0.383%