INDEX
Explanations
mentions of and discussions around the concept of 'fake news'
tokens indicating the end of a text or section
New Auto-Interp
Negative Logits
Dialogue
-0.63
alties
-0.62
ãĢĮ
-0.60
rises
-0.59
KR
-0.59
AAF
-0.59
Leilan
-0.58
Keller
-0.57
essage
-0.56
immer
-0.56
POSITIVE LOGITS
usterity
0.97
tenance
0.94
"
0.87
`,
0.86
»
0.83
terday
0.83
\)
0.82
"!
0.82
'?
0.81
/,
0.80
Activations Density 0.308%