INDEX
Explanations
phrases related to resistance or opposition
phrases indicating opposition or resistance from various groups or individuals
New Auto-Interp
Negative Logits
imate
-0.86
puter
-0.82
stakes
-0.79
perture
-0.76
flix
-0.74
Saharan
-0.72
itution
-0.71
acre
-0.71
velt
-0.71
merce
-0.70
POSITIVE LOGITS
afar
1.32
abroad
0.94
inside
0.93
passers
0.83
locals
0.82
within
0.76
anywhere
0.75
constituents
0.74
peers
0.72
elsewhere
0.72
Activations Density 0.117%