INDEX
Explanations
mentions or discussions of the term "fake news" in text
references to "fake news."
New Auto-Interp
Negative Logits
oples
-0.74
Scher
-0.74
xus
-0.72
joining
-0.72
kindred
-0.67
[|
-0.66
ental
-0.64
Nile
-0.63
skilled
-0.63
asse
-0.63
POSITIVE LOGITS
room
0.91
iness
0.84
rooms
0.84
feed
0.82
headlines
0.78
caster
0.77
worthy
0.77
ãĤ±
0.76
icle
0.76
ashington
0.75
Activations Density 0.031%