INDEX
Negative Logits
decenas
0.52
sporadically
0.45
fasted
0.45
placed
0.42
ふわ
0.42
aka
0.42
chlorinated
0.40
faciles
0.40
outliers
0.39
outlier
0.38
POSITIVE LOGITS
retrieving
0.39
从事
0.39
Corfu
0.39
Studying
0.38
ității
0.38
Retrie
0.38
idcar
0.37
ayment
0.36
证实
0.36
Study
0.36
Activations Density 0.004%