INDEX
Explanations
verbs related to actions or events performed by public figures
verbs related to political actions and decisions
New Auto-Interp
Negative Logits
âĶĢâĶĢ
-0.71
arry
-0.70
soc
-0.68
houses
-0.68
omever
-0.68
cause
-0.64
lins
-0.64
Soc
-0.64
bie
-0.64
â̦â̦
-0.63
POSITIVE LOGITS
himself
0.94
scathing
0.83
rhet
0.83
retweet
0.81
applause
0.80
briefly
0.78
upbeat
0.77
apologizing
0.76
cautiously
0.76
angrily
0.75
Activations Density 0.407%