INDEX
Explanations
phrases related to standing up and advocating for rights or beliefs
New Auto-Interp
Negative Logits
ÑĢап
-0.15
alez
-0.15
алеж
-0.15
eral
-0.14
263
-0.14
aight
-0.14
wors
-0.13
Aim
-0.13
605
-0.13
dre
-0.13
POSITIVE LOGITS
stand
0.38
stood
0.31
stands
0.30
Stand
0.28
standing
0.28
Stand
0.25
_stand
0.25
assert
0.24
standing
0.23
voice
0.23
Activations Density 0.324%