INDEX
Negative Logits
mitigating
-0.08
agd
-0.08
desal
-0.07
lamp
-0.07
IPO
-0.07
ernal
-0.07
miti
-0.07
counsel
-0.07
-repeat
-0.07
/Search
-0.07
POSITIVE LOGITS
reds
0.10
紅
0.09
rojo
0.09
rouges
0.09
红
0.08
_RED
0.08
辣
0.08
vermelho
0.08
lipstick
0.08
fiery
0.08
Activations Density 0.037%