INDEX
Explanations
instances of words related to manipulation or control
words and phrases related to manipulation and distortion of truth or facts
New Auto-Interp
Negative Logits
ccoli
-0.81
starter
-0.71
¯¯¯¯
-0.70
âĹ¼
-0.70
zik
-0.69
hesda
-0.65
SourceFile
-0.65
oother
-0.65
esides
-0.65
alone
-0.64
POSITIVE LOGITS
perceptions
1.16
facts
1.04
reality
1.01
minds
1.00
realities
0.96
headlines
0.92
emotions
0.92
events
0.89
timelines
0.87
opinions
0.84
Activations Density 0.193%