INDEX
Explanations
negative responses or denials
New Auto-Interp
Negative Logits
AssemblyCulture
-1.11
Roskov
-0.97
Autorizaciones
-0.96
myſelf
-0.94
tartalomajánló
-0.92
himſelf
-0.92
itſelf
-0.91
سكانية
-0.90
ViewImports
-0.89
saites
-0.88
POSITIVE LOGITS
no
0.94
No
0.80
No
0.80
,
0.77
no
0.75
matter
0.70
NO
0.67
big
0.65
!
0.64
it
0.61
Activations Density 0.070%