INDEX
Explanations
words related to enemies or opponents
terms related to enemies or opponents
New Auto-Interp
Negative Logits
Airl
-0.70
ikk
-0.68
processed
-0.65
Baby
-0.65
Processing
-0.64
INS
-0.63
photo
-0.62
express
-0.62
Quality
-0.61
processing
-0.60
POSITIVE LOGITS
foes
3.29
foe
3.24
adversaries
2.88
adversary
2.80
nem
2.33
enemies
2.06
antagonists
2.02
opponents
1.80
antagonist
1.77
rivals
1.70
Activations Density 0.017%