INDEX
Explanations
phrases related to political issues and government actions
New Auto-Interp
Negative Logits
mentioned
-0.72
ukong
-0.67
Laughs
-0.67
yeah
-0.66
['
-0.65
said
-0.65
mentions
-0.65
Yeah
-0.64
ozo
-0.63
Yeah
-0.63
POSITIVE LOGITS
unfairly
1.45
unfair
0.98
outwe
0.89
misunderstood
0.88
unlawfully
0.88
underestimated
0.87
overest
0.85
disproportionately
0.84
underest
0.83
siph
0.83
Activations Density 0.416%