INDEX
Negative Logits
th
-0.15
adol
-0.15
aber
-0.15
pike
-0.14
Bro
-0.13
utton
-0.13
thro
-0.13
Cook
-0.13
pi
-0.13
paired
-0.13
POSITIVE LOGITS
mee
0.17
ắc
0.16
ynet
0.15
جات
0.15
sứ
0.14
lettes
0.14
ìĤ
0.14
LETTE
0.14
tee
0.14
stants
0.14
Activations Density 0.015%