INDEX
Explanations
phrases related to resistance against authority or oppression
references to the concept of resistance in various contexts
New Auto-Interp
Negative Logits
ghazi
-0.81
estone
-0.74
tein
-0.73
Loaded
-0.71
ewater
-0.70
utra
-0.67
gow
-0.67
liner
-0.67
oba
-0.67
Pengu
-0.65
POSITIVE LOGITS
resistance
1.09
istance
0.85
Resistance
0.80
fighters
0.78
atility
0.77
movements
0.75
yip
0.74
movement
0.74
regimes
0.74
induction
0.72
Activations Density 0.009%