INDEX
Negative Logits
OUGH
-0.08
-treated
-0.07
sense
-0.07
acting
-0.06
Acting
-0.06
portun
-0.06
Enough
-0.06
azing
-0.06
urg
-0.06
τέλε
-0.06
POSITIVE LOGITS
Mits
0.06
loan
0.06
ebony
0.06
Monter
0.06
張
0.06
tou
0.06
onSubmit
0.06
listener
0.05
Monte
0.05
NAT
0.05
Activations Density 0.030%