INDEX
Explanations
references to body positions
instances of the word "the" and its context related to locations or positions
New Auto-Interp
Negative Logits
thood
-0.86
ividual
-0.80
lance
-0.73
hire
-0.72
eful
-0.71
illac
-0.69
iac
-0.69
Bloom
-0.68
uls
-0.67
lessly
-0.67
POSITIVE LOGITS
pavement
1.38
sidewalk
1.37
floor
1.37
ground
1.28
sofa
1.25
porch
1.24
couch
1.24
ledge
1.22
doorstep
1.19
balcony
1.17
Activations Density 0.131%