INDEX
Negative Logits
Fool
-0.09
Bale
-0.08
DESC
-0.08
Allow
-0.08
Ablauf
-0.08
Envelope
-0.08
blind
-0.07
ardonnay
-0.07
Allow
-0.07
рақ
-0.07
POSITIVE LOGITS
Gef
0.08
exploratory
0.08
исправ
0.08
edging
0.08
devices
0.08
arcade
0.08
kum
0.07
equivalents
0.07
ionais
0.07
探索
0.07
Activations Density 0.004%