INDEX
Explanations
actions or events involving violence or conflict
phrases related to crises or emergencies
New Auto-Interp
Negative Logits
Cosponsors
-0.80
alde
-0.74
iaries
-0.73
sbm
-0.71
uploads
-0.68
raft
-0.67
umerous
-0.66
redes
-0.65
hent
-0.64
"]=>
-0.63
POSITIVE LOGITS
somebody
1.49
someone
1.41
something
1.21
your
1.14
someone
1.10
grandma
1.06
oneself
1.04
yourself
1.03
Someone
1.02
somewhere
1.00
Activations Density 0.668%