INDEX
Negative Logits
robot
-0.07
Fort
-0.06
lis
-0.06
fights
-0.06
alternatives
-0.06
WALL
-0.06
stinence
-0.06
felony
-0.06
.ascii
-0.06
larg
-0.06
POSITIVE LOGITS
ابعة
0.06
Ebay
0.06
ulfill
0.06
опрос
0.06
маст
0.06
_extended
0.06
PressEvent
0.06
cciones
0.06
sklad
0.06
выход
0.06
Activations Density 0.003%