INDEX
Explanations
references to specific locations or directions
occurrences of the word "the."
New Auto-Interp
Negative Logits
acca
-0.78
manship
-0.75
utm
-0.73
successfully
-0.72
uates
-0.72
thood
-0.71
teness
-0.70
oppable
-0.70
bourg
-0.70
pers
-0.69
POSITIVE LOGITS
bathroom
1.22
attic
1.17
kitchen
1.16
nearest
1.14
restroom
1.10
toilet
1.08
fireplace
1.06
fridge
1.04
coolest
1.04
beach
1.01
Activations Density 0.354%