INDEX
Explanations
expressions of hate or negative feelings toward various subjects
New Auto-Interp
Negative Logits
jurk
-0.47
reason
-0.42
doInBackground
-0.41
donnée
-0.39
ClientRect
-0.39
soeur
-0.39
diff
-0.38
Polly
-0.38
NonQuery
-0.37
wwww
-0.37
POSITIVE LOGITS
Савезне
0.91
disambiguazione
0.71
homonymie
0.61
Biôgrafia
0.60
distancing
0.60
CanadaChoose
0.59
ujednoznacz
0.57
Италијани
0.57
Italijanski
0.56
Gemein
0.56
Activations Density 0.205%