INDEX
Explanations
words related to conflict and negative impacts like war, injury, disease, and danger
themes related to conflict, suffering, and moral implications of actions
New Auto-Interp
Negative Logits
referen
-0.66
Liv
-0.62
ofi
-0.62
Marketplace
-0.61
abase
-0.61
trademark
-0.60
Electoral
-0.59
Ri
-0.58
ynchron
-0.58
Socket
-0.57
POSITIVE LOGITS
flies
0.97
killers
0.88
istically
0.86
bows
0.86
lessly
0.84
worms
0.83
lessness
0.82
bugs
0.82
seekers
0.82
ously
0.81
Activations Density 0.363%