INDEX
Explanations
mentions of individuals or groups and their associations within a social context
New Auto-Interp
Negative Logits
ACHUSET
-0.58
okuyayım
-0.51
vábbi
-0.51
かん
-0.50
цезда
-0.49
UnusedPrivate
-0.49
Ikke
-0.48
甭
-0.47
nalités
-0.47
whereof
-0.47
POSITIVE LOGITS
does
2.22
do
2.20
did
1.98
does
1.75
Do
1.66
do
1.66
Does
1.65
DID
1.64
DO
1.64
Does
1.63
Activations Density 0.456%