INDEX
Explanations
words associated with opposition or defiance
instances of opposition or resistance to various policies or ideas
New Auto-Interp
Negative Logits
ammy
-0.91
aker
-0.72
argo
-0.72
arted
-0.70
hack
-0.70
akers
-0.67
negie
-0.67
Discussion
-0.66
Chance
-0.66
Redditor
-0.65
POSITIVE LOGITS
minded
0.89
opposes
0.87
oppose
0.86
opposing
0.85
vehemently
0.83
thereto
0.81
onent
0.80
="#
0.76
opposed
0.76
dissent
0.71
Activations Density 0.010%