INDEX
Explanations
action verbs and words related to confrontation or conflict
terms related to offensive actions and military activities
New Auto-Interp
Negative Logits
OGR
-0.70
pedia
-0.68
ibur
-0.66
fantas
-0.62
ICS
-0.62
awoken
-0.61
uthor
-0.61
complicit
-0.60
CAT
-0.60
Plus
-0.59
POSITIVE LOGITS
offensive
3.32
eff
2.04
attack
1.41
went
1.38
expensive
1.37
stood
1.26
sale
1.16
alien
1.13
obscurity
0.98
attacks
0.97
Activations Density 0.029%