INDEX
Explanations
phrases related to destruction and conflict
instances of destruction or harm in contexts related to conflict
New Auto-Interp
Negative Logits
ITNESS
-0.69
PLIED
-0.68
Assistance
-0.64
Representative
-0.63
ç·
-0.63
irin
-0.63
Lithuan
-0.62
Dating
-0.62
ogle
-0.61
>[
-0.61
POSITIVE LOGITS
favor
0.84
senseless
0.81
pload
0.75
merciless
0.73
favour
0.72
scourge
0.71
sheer
0.71
brutally
0.70
entirety
0.69
manageable
0.68
Activations Density 0.483%