INDEX
Explanations
names followed by the action of telling or informing
instances of the word "told" in various contexts
New Auto-Interp
Negative Logits
isol
-0.76
ILCS
-0.73
imposed
-0.68
artifacts
-0.68
adesh
-0.66
fet
-0.66
ãĥİ
-0.65
aband
-0.65
execute
-0.64
ciplinary
-0.64
POSITIVE LOGITS
reporters
1.27
HuffPost
1.13
BuzzFeed
1.11
CNN
1.09
interviewer
1.08
me
1.06
CNBC
1.02
VICE
1.01
Politico
0.98
MSNBC
0.98
Activations Density 0.054%