INDEX
Explanations
names or surnames of individuals, especially when they are referred to in a professional or newsworthy context
New Auto-Interp
Negative Logits
COP
-0.80
DIRECT
-0.79
charg
-0.76
toget
-0.76
multit
-0.76
INTER
-0.75
Competitive
-0.75
seiz
-0.74
livest
-0.72
puzz
-0.71
POSITIVE LOGITS
aga
1.37
ma
1.34
na
1.30
sa
1.29
ya
1.24
va
1.21
da
1.20
ava
1.19
ana
1.14
ba
1.12
Activations Density 0.152%