INDEX
Negative Logits
Premio
0.44
ane
0.43
parchment
0.43
who
0.42
turut
0.42
admite
0.39
Alte
0.39
ax
0.39
выво
0.39
celebrate
0.38
POSITIVE LOGITS
complement
0.90
Complement
0.89
complement
0.86
complements
0.83
Complement
0.82
complément
0.79
complemento
0.77
complementing
0.75
ary
0.72
ergän
0.72
Activations Density 0.008%