INDEX
Negative Logits
koşul
-0.07
lesions
-0.07
并
-0.07
-ms
-0.07
miners
-0.07
.creation
-0.07
rol
-0.07
76
-0.06
_FIN
-0.06
girls
-0.06
POSITIVE LOGITS
appropriate
0.20
appropriately
0.12
appropriate
0.11
ropriate
0.10
suit
0.08
inappropriate
0.08
Suit
0.07
APT
0.07
App
0.07
0.07
Activations Density 0.027%