INDEX
Negative Logits
ింప
-0.09
inc
-0.08
functioning
-0.07
ిత
-0.07
atisch
-0.07
alpine
-0.07
inc
-0.07
ింపు
-0.07
好运
-0.07
accomplishment
-0.07
POSITIVE LOGITS
^-
0.09
konuş
0.08
studying
0.08
jee
0.08
examining
0.08
ಮಾತ
0.07
料
0.07
Speaking
0.07
Stud
0.07
Price
0.07
Activations Density 0.001%