INDEX
Explanations
phrases related to accusations or suspicions involving named individuals
references to Donald Trump
New Auto-Interp
Negative Logits
ruary
-0.81
dos
-0.69
ãĥ¼ãĥ«
-0.66
dden
-0.65
glers
-0.64
ariat
-0.63
ibility
-0.63
Thumbnail
-0.63
gers
-0.62
zsche
-0.61
POSITIVE LOGITS
Recomm
0.73
Senate
0.61
diabetic
0.59
allergic
0.58
optimistic
0.56
Ford
0.56
palace
0.55
overrun
0.55
holiest
0.55
Intel
0.54
Activations Density 0.033%