INDEX
Negative Logits
r
0.52
a
0.52
;
0.48
ing
0.45
io
0.45
Roo
0.44
,
0.43
伊
0.43
々
0.42
adhesive
0.42
POSITIVE LOGITS
salud
0.51
preached
0.49
ટર
0.48
sirve
0.47
tourisme
0.45
caminar
0.45
nonprofits
0.45
াসের
0.44
taxed
0.44
ϫ
0.44
Activations Density 0.000%