INDEX
Explanations
intensifiers that convey strong opinions or feelings of agreement/disagreement
New Auto-Interp
Negative Logits
chein
-0.18
iÄħ
-0.16
isque
-0.15
ICA
-0.15
ikal
-0.14
isma
-0.14
iez
-0.14
ica
-0.13
strchr
-0.13
oret
-0.13
POSITIVE LOGITS
anymore
0.19
ÙĴع
0.15
arding
0.15
apas
0.14
y
0.14
воз
0.14
ibi
0.14
OMPI
0.13
even
0.13
coat
0.13
Activations Density 0.049%