INDEX
Negative Logits
للمعارف
-0.72
GEBURTSDATUM
-0.67
webElement
-0.65
виправивши
-0.65
ValueStyle
-0.63
betweenstory
-0.61
Spoljašnje
-0.60
الحره
-0.60
WithMany
-0.60
חיצוניים
-0.59
POSITIVE LOGITS
out
0.65
to
0.50
pshots
0.46
from
0.45
zetek
0.44
for
0.44
నా
0.44
Tazama
0.44
inspiration
0.43
codegen
0.43
Activations Density 0.001%