INDEX
Explanations
references to lies and deception in political contexts
New Auto-Interp
Negative Logits
iek
-0.15
_RESOLUTION
-0.15
.createObject
-0.15
enant
-0.15
Passage
-0.14
POLIT
-0.14
smarty
-0.14
ÑĨвеÑĤа
-0.14
ï¿¥
-0.14
icket
-0.13
POSITIVE LOGITS
Donald
0.20
Donald
0.20
Tweets
0.17
tweets
0.17
presidency
0.16
Apprentice
0.16
ãĥĥãĥĦ
0.16
Golf
0.15
olini
0.15
realDonaldTrump
0.15
Activations Density 0.147%