INDEX
Explanations
mentions of Russia and its government, as well as references to Russian individuals and interference
New Auto-Interp
Negative Logits
dash
-0.78
erick
-0.77
Thom
-0.77
draw
-0.75
Dublin
-0.75
Dickinson
-0.73
Towns
-0.72
Kier
-0.71
enment
-0.71
liam
-0.71
POSITIVE LOGITS
collusion
1.11
meddling
1.10
hacking
1.02
intelligence
0.99
hoax
0.96
disinformation
0.96
espionage
0.94
dossier
0.93
interference
0.92
intelligence
0.91
Activations Density 0.037%