INDEX
Explanations
information related to armed conflicts, wars, and violent incidents
phrases referencing significant events that result in casualties or loss of life
New Auto-Interp
Negative Logits
isSpecialOrderable
-0.79
learns
-0.72
Dialogue
-0.71
affirmative
-0.68
Fair
-0.67
Pros
-0.66
Plays
-0.65
representations
-0.64
Opt
-0.64
Recomm
-0.63
POSITIVE LOGITS
ravaged
1.30
devastated
1.28
engulfed
1.18
raged
1.17
engulf
1.16
killed
1.14
rocked
1.09
crippled
1.09
devast
1.06
displaced
1.05
Activations Density 0.167%