INDEX
Explanations
phrases related to equality and social justice
New Auto-Interp
Negative Logits
asus
-0.81
ibur
-0.77
ESE
-0.76
resa
-0.75
edient
-0.72
ée
-0.72
NetMessage
-0.70
Palestin
-0.69
etts
-0.68
és
-0.68
POSITIVE LOGITS
technically
1.24
admittedly
1.04
occasional
1.00
ostensibly
1.00
slight
0.94
initially
0.92
disagree
0.90
outward
0.87
caveats
0.87
setbacks
0.84
Activations Density 2.433%