INDEX
Explanations
references to sports uniforms or clothing
New Auto-Interp
Negative Logits
ož
-0.17
Shoe
-0.15
éŀĭ
-0.15
aney
-0.14
shoe
-0.14
)((((
-0.14
ãĥĥãĥģ
-0.14
icket
-0.14
Sofa
-0.14
Compliance
-0.14
POSITIVE LOGITS
uniform
0.43
uniform
0.36
uniforms
0.34
Uniform
0.33
Uniform
0.33
wearing
0.31
wear
0.28
wears
0.27
.uniform
0.25
Wear
0.25
Activations Density 0.149%