INDEX
Explanations
phrases related to defending or defense
instances of the word "defend" in various contexts
New Auto-Interp
Negative Logits
NetMessage
-0.70
explode
-0.69
hall
-0.68
Ju
-0.68
ucket
-0.63
foot
-0.63
Hop
-0.63
Machine
-0.62
mad
-0.62
bows
-0.62
POSITIVE LOGITS
against
0.97
atively
0.90
defending
0.87
Against
0.81
defends
0.76
ively
0.76
orate
0.75
ably
0.75
iveness
0.74
ously
0.72
Activations Density 0.026%