INDEX
Explanations
occurrences of the word "the."
New Auto-Interp
Negative Logits
thood
-0.76
Spread
-0.69
amphetamine
-0.69
FIELD
-0.69
illion
-0.68
ÃĥÃĤ
-0.67
adesh
-0.66
illac
-0.65
abel
-0.65
acea
-0.65
POSITIVE LOGITS
interviewer
1.02
Associated
0.99
Huffington
0.99
BBC
0.98
latter
0.98
latest
0.97
Guardian
0.93
same
0.91
Chronicle
0.90
ABC
0.87
Activations Density 0.044%