INDEX
Negative Logits
titudes
-0.69
🤩
-0.69
viť
-0.69
😍
-0.67
titud
-0.66
uak
-0.66
uyen
-0.66
Removal
-0.66
Wrapper
-0.65
yw
-0.65
POSITIVE LOGITS
push
2.16
push
2.00
pushing
1.69
Push
1.41
pushes
1.41
PUSH
1.38
Pushing
1.36
Pushing
1.31
pushed
1.31
Push
1.31
Activations Density 0.003%