INDEX
Explanations
mentions of political figures and actions-related to criticism or consequences
the word "MORE" and its association with political commentary or analysis
New Auto-Interp
Negative Logits
spring
-0.69
arist
-0.69
ars
-0.66
ãĥ³
-0.65
release
-0.64
bas
-0.63
ãĥª
-0.63
king
-0.63
idem
-0.63
1932
-0.63
POSITIVE LOGITS
MORE
1.28
VIDEOS
0.97
pedia
0.86
osponsors
0.83
FTWARE
0.81
Flavoring
0.78
eller
0.75
aukee
0.75
abund
0.74
ellen
0.73
Activations Density 0.006%