INDEX
Explanations
modal verbs and expressions of necessity or obligation
New Auto-Interp
Negative Logits
öh
-0.16
iry
-0.16
xba
-0.15
Guaranteed
-0.15
Panic
-0.15
itary
-0.14
Overnight
-0.14
jom
-0.14
Ùĩر
-0.14
ember
-0.14
POSITIVE LOGITS
admit
0.24
Wonder
0.20
wonder
0.19
confess
0.18
adr
0.17
confession
0.17
admitting
0.17
polator
0.16
warn
0.16
æīĭ
0.16
Activations Density 0.036%