INDEX
Negative Logits
asha
-0.82
atre
-0.68
aten
-0.68
skirts
-0.65
afort
-0.63
uting
-0.63
Cosponsors
-0.63
taboola
-0.61
uffle
-0.61
lander
-0.60
POSITIVE LOGITS
pause
0.72
permission
0.67
berth
0.62
congratulations
0.62
leverage
0.62
İĭ
0.61
incentive
0.61
insight
0.58
flexibility
0.58
preferential
0.58
Activations Density 0.218%