INDEX
Explanations
references to awards or recognitions in athletics or academia
New Auto-Interp
Negative Logits
abilia
-0.17
Dar
-0.17
riter
-0.16
Bra
-0.16
Lect
-0.15
ocop
-0.15
cept
-0.15
fal
-0.14
Garland
-0.14
ynet
-0.14
POSITIVE LOGITS
ãĥ¼ãĥĪ
0.16
elig
0.14
ucht
0.14
ÙIJÙĥ
0.14
baptized
0.14
Marketable
0.14
StÅĻed
0.14
ihan
0.14
£
0.14
обов
0.14
Activations Density 0.322%