INDEX
Explanations
measurements related to depth and height
New Auto-Interp
Negative Logits
nez
-0.15
Lob
-0.15
инкÑĥ
-0.15
gnore
-0.15
hti
-0.14
longitud
-0.14
ÑĤÑİ
-0.14
arg
-0.14
tring
-0.14
otherapy
-0.14
POSITIVE LOGITS
depth
0.23
depths
0.21
-depth
0.19
layer
0.19
layers
0.18
height
0.18
depth
0.18
-height
0.17
heights
0.17
Depth
0.17
Activations Density 0.116%