INDEX
Negative Logits
pored
0.60
rife
0.59
fix
0.59
questioning
0.58
াপাশি
0.58
battered
0.57
mischiev
0.57
aider
0.56
datasets
0.56
manc
0.56
POSITIVE LOGITS
least
1.70
least
1.53
Least
1.50
Least
1.28
razine
0.89
oll
0.83
variance
0.82
rocities
0.82
hene
0.79
ividades
0.79
Activations Density 0.042%