INDEX
Explanations
references to individuals and their statements in media contexts
New Auto-Interp
Negative Logits
=č↵
-0.17
arget
-0.16
coop
-0.16
ãĥ¬ãĤ¹
-0.15
ngrx
-0.15
âĹıâĹı
-0.15
ziej
-0.15
discharge
-0.14
iedade
-0.14
diam
-0.14
POSITIVE LOGITS
told
0.26
according
0.23
tell
0.20
reportedly
0.20
anonymously
0.18
CBS
0.18
ccording
0.18
tells
0.17
Tell
0.17
According
0.17
Activations Density 0.159%