INDEX
Explanations
words related to doors and door-related actions
occurrences of the word "door."
New Auto-Interp
Negative Logits
ét
-0.70
cum
-0.69
ocity
-0.69
SHARE
-0.67
bis
-0.65
utan
-0.64
TM
-0.63
Bet
-0.60
utical
-0.60
ÃŃn
-0.59
POSITIVE LOGITS
door
3.76
doors
3.04
Door
2.72
door
2.67
doorway
2.23
doors
2.02
Doors
1.91
gates
1.82
gate
1.80
doorstep
1.76
Activations Density 0.015%