INDEX
Explanations
phrases related to resistance or fighting back
instances of the phrase "fight back."
New Auto-Interp
Negative Logits
Jew
-0.63
viz
-0.61
LINE
-0.58
Chosen
-0.58
Motorsport
-0.57
nt
-0.57
WW
-0.57
Monaco
-0.57
Vu
-0.56
kes
-0.56
POSITIVE LOGITS
GROUND
0.90
)=(
0.85
packs
0.83
wards
0.83
tracking
0.78
track
0.78
vironment
0.76
dated
0.76
othal
0.71
trace
0.71
Activations Density 0.030%