INDEX
Explanations
phrases related to destruction or aggressive actions
mentions of destruction or damaging actions related to various subjects
New Auto-Interp
Negative Logits
ogg
-0.71
oult
-0.71
olkien
-0.71
eller
-0.69
orney
-0.69
Redditor
-0.68
helps
-0.68
zag
-0.67
agonists
-0.67
venture
-0.66
POSITIVE LOGITS
entire
1.05
havoc
0.99
livelihood
0.92
empires
0.86
credibility
0.85
morale
0.85
whole
0.82
illusions
0.82
planet
0.82
habitat
0.82
Activations Density 0.287%