INDEX
Explanations
comparisons or evaluations related to quantity or quality
New Auto-Interp
Negative Logits
YES
-0.80
çļ
-0.80
Ĥİ
-0.80
kus
-0.80
ilts
-0.73
ortment
-0.66
odium
-0.65
idelines
-0.64
querade
-0.64
iot
-0.63
POSITIVE LOGITS
anymore
1.25
nor
0.90
bothered
0.88
bother
0.82
flashy
0.77
bothering
0.77
fancy
0.75
consolation
0.75
noticeable
0.74
enough
0.73
Activations Density 0.172%