INDEX
Explanations
phrases indicating necessity or obligation
New Auto-Interp
Negative Logits
.Focused
-0.17
acom
-0.16
à¸Ńà¸Ļà¸Ĺ
-0.16
anuts
-0.15
tring
-0.15
iage
-0.15
αÏģά
-0.14
shaw
-0.14
ئ
-0.14
olas
-0.14
POSITIVE LOGITS
anymore
0.19
need
0.18
NEED
0.17
need
0.17
Needed
0.16
Need
0.16
ارد
0.16
needed
0.16
δή
0.15
necessity
0.15
Activations Density 0.066%