INDEX
Negative Logits
avio
-0.08
jakie
-0.08
Ottoman
-0.08
incl
-0.08
elateerde
-0.07
survivors
-0.07
termasuk
-0.07
обзор
-0.07
were
-0.07
demeanor
-0.07
POSITIVE LOGITS
绑定
0.09
授
0.09
(bind
0.08
绑
0.08
drain
0.08
赋
0.08
|(
0.08
exclusiva
0.08
condicion
0.08
feeding
0.08
Activations Density 0.003%