INDEX
Explanations
references to actions or events happening behind closed doors
references to doors, especially in contexts of closure, opening, and metaphorical significance
New Auto-Interp
Negative Logits
TY
-0.80
TING
-0.76
amera
-0.70
ting
-0.69
ency
-0.68
TX
-0.68
CTV
-0.68
allah
-0.66
Astro
-0.63
OTAL
-0.63
POSITIVE LOGITS
doors
1.08
doors
1.06
holes
1.01
mith
0.97
door
0.94
pring
0.92
hips
0.86
Doors
0.85
hole
0.85
bell
0.81
Activations Density 0.010%