INDEX
Explanations
references to doors and their states, particularly in relation to being closed or locked
New Auto-Interp
Negative Logits
ждународ
-0.55
<<<<<<<<<<<<<<
-0.54
vaid
-0.53
noqa
-0.52
astray
-0.51
はじめに
-0.51
">*
-0.50
oarece
-0.50
Heer
-0.49
lòng
-0.49
POSITIVE LOGITS
closed
1.05
closure
1.03
closing
1.03
closes
1.02
shut
0.91
閉
0.89
Closed
0.89
shuts
0.88
Closed
0.87
Closure
0.86
Activations Density 0.323%