INDEX
Negative Logits
_destroy
-0.07
кого
-0.07
miền
-0.07
où
-0.07
/left
-0.07
cour
-0.07
Forward
-0.07
lore
-0.06
contradictory
-0.06
detox
-0.06
POSITIVE LOGITS
annual
0.11
Annual
0.09
annually
0.07
Annual
0.07
BN
0.06
>(),
0.06
%,
0.06
ual
0.06
_ANAL
0.06
b
0.06
Activations Density 0.012%