INDEX
Negative Logits
reconcile
0.70
disparate
0.69
knowing
0.67
neutrons
0.67
ignorance
0.67
distract
0.62
proclaiming
0.62
discourage
0.60
‟
0.59
disruptive
0.59
POSITIVE LOGITS
moze
0.65
puede
0.65
ultimo
0.64
Agregar
0.63
selatan
0.63
oeste
0.63
ajout
0.63
east
0.62
आंकड़ा
0.61
Aynı
0.60
Activations Density 0.036%