INDEX
Negative Logits
:NS
-0.09
pseudo
-0.08
ứng
-0.08
carrying
-0.08
NS
-0.08
运输
-0.08
feu
-0.08
background
-0.08
transportar
-0.07
Background
-0.07
POSITIVE LOGITS
humility
0.12
admitir
0.11
admitting
0.10
acknowledges
0.10
recept
0.09
receptive
0.09
admits
0.09
acknowledging
0.09
geschlossen
0.08
openness
0.08
Activations Density 0.045%