INDEX
Negative Logits
aunque
1.50
evitando
1.46
wszel
1.44
afin
1.44
délais
1.40
любые
1.36
таких
1.36
quelles
1.35
กรณี
1.33
данном
1.33
POSITIVE LOGITS
will
1.15
Princess
1.11
has
1.07
warrior
1.05
loid
1.03
eward
1.03
sends
1.01
Warrior
0.99
overthrow
0.99
into
0.97
Activations Density 0.164%