INDEX
Negative Logits
atac
0.43
arbitrary
0.42
datos
0.42
sticker
0.42
propagand
0.42
cram
0.40
painless
0.40
bağı
0.40
puns
0.40
razon
0.40
POSITIVE LOGITS
Leadership
1.71
Leadership
1.62
leadership
1.51
leadership
1.43
liderazgo
1.25
interpersonal
1.24
лидер
1.22
领导
1.20
領導
1.16
Leaders
1.08
Activations Density 0.055%