INDEX
Negative Logits
extrem
-0.08
eu
-0.08
nationals
-0.08
eu
-0.08
उपाय
-0.07
Brazilian
-0.07
governing
-0.07
pathogens
-0.07
_full
-0.07
IDs
-0.07
POSITIVE LOGITS
Came
0.11
wears
0.09
edo
0.08
arr
0.08
Wear
0.08
จ
0.08
worn
0.08
wore
0.08
obbl
0.08
Wear
0.08
Activations Density 0.003%