INDEX
Explanations
instances of the phrase "fight back" in various contexts
New Auto-Interp
Negative Logits
obi
-0.17
anela
-0.17
icari
-0.15
vé
-0.15
oft
-0.15
инÑĥ
-0.14
DM
-0.14
irut
-0.14
aku
-0.14
ssi
-0.14
POSITIVE LOGITS
USTER
0.15
blows
0.15
tern
0.14
ạt
0.14
dale
0.14
conc
0.13
NavItem
0.13
iko
0.13
_defs
0.13
emble
0.13
Activations Density 0.009%