INDEX
Explanations
phrases related to violence and conflict
terms associated with violence and firearms
New Auto-Interp
Negative Logits
pend
-0.65
Huntington
-0.63
Negative
-0.61
hoe
-0.60
oat
-0.57
aughed
-0.56
CTR
-0.54
concerned
-0.53
jun
-0.53
ãĥ£
-0.53
POSITIVE LOGITS
aciously
0.74
waves
0.70
ths
0.70
eper
0.68
EMA
0.67
tics
0.65
Film
0.64
steen
0.64
.>>
0.63
with
0.63
Activations Density 0.188%