INDEX
Negative Logits
約
0.45
5
0.45
ο
0.45
これらの
0.45
0.44
9
0.43
学者
0.42
μένες
0.42
extravagant
0.41
irregularly
0.41
POSITIVE LOGITS
că
0.48
tío
0.47
chas
0.43
choć
0.42
osław
0.41
unoscut
0.40
Methodist
0.40
कै
0.40
disinterested
0.39
maid
0.39
Activations Density 0.010%