INDEX
Explanations
phrases or words related to opposition or contention
instances of the word "against" in various contexts
New Auto-Interp
Negative Logits
oola
-0.90
ulous
-0.89
çīĪ
-0.88
ulously
-0.85
chin
-0.85
aukee
-0.76
hops
-0.73
retty
-0.73
kered
-0.73
ruary
-0.73
POSITIVE LOGITS
whom
0.93
Humanity
0.90
him
0.88
prejudice
0.82
tyranny
0.80
backdrop
0.80
unreasonable
0.79
encro
0.78
humanity
0.78
them
0.78
Activations Density 0.071%