INDEX
Negative Logits
BOOL
-0.06
wrongful
-0.06
inequality
-0.06
onacci
-0.06
discounted
-0.06
활동
-0.06
bowling
-0.06
/photos
-0.05
.PostMapping
-0.05
원
-0.05
POSITIVE LOGITS
./
0.07
la
0.07
consenting
0.06
ang
0.06
locker
0.06
+\
0.06
kara
0.06
جلس
0.06
姿
0.06
unlock
0.06
Activations Density 0.074%