INDEX
Explanations
references to actions or events happening behind metaphorical or physical closed doors
references to the concept of "doors" in various contexts
New Auto-Interp
Negative Logits
TY
-0.80
Vide
-0.73
TX
-0.72
TING
-0.71
CTV
-0.68
olson
-0.66
ting
-0.64
TOD
-0.63
lihood
-0.63
Hots
-0.63
POSITIVE LOGITS
doors
1.26
doors
1.14
bell
1.09
door
1.07
pring
1.03
holes
1.02
gates
0.98
Doors
0.94
opening
0.93
mith
0.93
Activations Density 0.020%