INDEX
Negative Logits
PRODUCTS
-0.07
favor
-0.07
іння
-0.06
aisy
-0.06
patron
-0.06
frames
-0.06
companies
-0.06
USAGE
-0.06
-around
-0.06
런
-0.06
POSITIVE LOGITS
stat
0.07
ังม
0.07
prosecuted
0.07
genu
0.07
masturbation
0.07
plugins
0.06
cafes
0.06
وقت
0.06
configparser
0.06
Took
0.06
Activations Density 0.002%