INDEX
Explanations
instances where doors are being opened or closed
references to doors and their states of being open or closed
New Auto-Interp
Negative Logits
tein
-0.70
emetery
-0.66
article
-0.64
Ranked
-0.64
oké
-0.63
ORTS
-0.63
Wage
-0.61
ostr
-0.60
obos
-0.59
rities
-0.59
POSITIVE LOGITS
omin
0.99
automatically
0.96
abruptly
0.94
correctly
0.89
prematurely
0.87
momentarily
0.87
loudly
0.86
incorrectly
0.86
smoothly
0.85
unexpectedly
0.83
Activations Density 0.258%