INDEX
Explanations
words related to opposing forces or challenges
instances of the word "resistance" in various contexts
New Auto-Interp
Negative Logits
tein
-0.80
monds
-0.71
cester
-0.70
ogg
-0.69
orp
-0.69
oufl
-0.68
estone
-0.68
utters
-0.67
=-=-=-=-
-0.67
uras
-0.67
POSITIVE LOGITS
resistance
1.24
Resistance
0.96
resistant
0.79
fighters
0.76
istance
0.76
dictate
0.75
regimes
0.74
thereto
0.74
ngth
0.73
xual
0.72
Activations Density 0.009%