INDEX
Explanations
words and phrases that suggest necessity or obligation
New Auto-Interp
Negative Logits
ana
-0.18
imore
-0.17
elt
-0.15
yn
-0.15
aday
-0.14
trib
-0.14
ibil
-0.13
новид
-0.13
udas
-0.13
صد
-0.13
POSITIVE LOGITS
661
0.15
inja
0.14
Merchant
0.14
stal
0.14
761
0.14
pie
0.14
atform
0.14
pyx
0.13
671
0.13
634
0.13
Activations Density 0.017%