INDEX
Negative Logits
contamination
-0.07
random
-0.07
-0.07
unde
-0.07
heavy
-0.07
represented
-0.07
imenta
-0.07
investigation
-0.07
offen
-0.07
floor
-0.07
POSITIVE LOGITS
bcrypt
0.09
vindt
0.08
nachts
0.08
/how
0.08
Rate
0.08
feel
0.08
(++
0.08
feels
0.08
borst
0.08
nyaman
0.08
Activations Density 0.002%