INDEX
Explanations
references to doors, such as holding the door open or closing the door
the word "door" and its variations in different contexts
New Auto-Interp
Negative Logits
ollah
-0.78
irtual
-0.73
lihood
-0.73
ivia
-0.71
ting
-0.70
ual
-0.69
TING
-0.68
ousand
-0.65
azeera
-0.65
milo
-0.65
POSITIVE LOGITS
bell
1.31
door
1.19
steps
1.14
doors
1.06
doors
1.03
ways
0.99
opener
0.97
Door
0.93
door
0.92
holes
0.90
Activations Density 0.035%