INDEX
Explanations
expressions related to espionage or conspiracy
references to political and social manipulation
New Auto-Interp
Negative Logits
"}],"
-0.83
uterte
-0.80
âĸº
-0.80
âĢ
-0.80
âĸł
-0.79
20439
-0.78
âϦ
-0.78
»
-0.75
Cosponsors
-0.74
âĢ
-0.73
POSITIVE LOGITS
basically
1.04
crappy
0.98
supposedly
0.96
crap
0.96
ridiculously
0.95
magically
0.93
downright
0.93
hilar
0.91
pretty
0.90
clueless
0.89
Activations Density 1.071%