INDEX
Negative Logits
one
-0.08
ten
-0.07
هو
-0.06
in
-0.06
One
-0.06
employ
-0.06
・・
-0.06
Describe
-0.06
laşma
-0.06
Tur
-0.06
POSITIVE LOGITS
ally
0.14
ly
0.14
ively
0.11
LY
0.11
ely
0.10
ially
0.10
ALLY
0.10
ically
0.10
orally
0.09
ingly
0.09
Activations Density 0.531%