INDEX
Negative Logits
s
0.72
lenguaje
0.71
0.67
relationship
0.64
with
0.63
relation
0.61
in
0.61
pacing
0.58
since
0.57
relational
0.57
POSITIVE LOGITS
tròn
0.80
rounding
0.70
可爱
0.70
Rounding
0.69
redondo
0.68
atelle
0.67
круг
0.66
ومو
0.66
కే
0.66
roundup
0.66
Activations Density 0.028%