INDEX
Explanations
words related to fighting or battling back
New Auto-Interp
Negative Logits
Newport
-0.67
viz
-0.65
realm
-0.61
Danish
-0.60
pedigree
-0.59
Dresden
-0.58
livest
-0.57
Moines
-0.57
itarian
-0.56
Corpse
-0.56
POSITIVE LOGITS
against
1.01
packs
0.99
lash
0.91
against
0.84
tears
0.84
GROUND
0.82
stab
0.80
wards
0.76
forcefully
0.75
assault
0.74
Activations Density 0.027%