INDEX
Explanations
phrases expressing support and unity concerning social justice issues
New Auto-Interp
Negative Logits
Patron
-0.17
inges
-0.15
ugs
-0.15
jerne
-0.15
lc
-0.15
slow
-0.15
Hood
-0.14
ekt
-0.14
.
-0.14
Vectorizer
-0.14
POSITIVE LOGITS
stood
0.20
_stand
0.19
stand
0.19
Stand
0.19
Stand
0.18
-alone
0.17
stands
0.17
yer
0.16
stand
0.16
idget
0.15
Activations Density 0.035%