INDEX
Explanations
political figures or media outlets being criticized
instances of the word "criticized" and its variations
New Auto-Interp
Negative Logits
OTE
-0.73
ëĭ
-0.71
bourne
-0.69
mop
-0.68
ther
-0.67
ulhu
-0.66
aho
-0.66
ammy
-0.65
nown
-0.65
mad
-0.64
POSITIVE LOGITS
imaru
0.73
Cosponsors
0.73
harshly
0.69
comments
0.63
remarks
0.63
Stab
0.62
him
0.61
Orb
0.61
critiques
0.61
critic
0.61
Activations Density 0.050%