INDEX
Negative Logits
NewLabel
-0.07
Cancel
-0.07
Satellite
-0.06
320
-0.06
STDCALL
-0.06
nam
-0.06
mult
-0.06
_sleep
-0.06
_NOTIFICATION
-0.06
dose
-0.06
POSITIVE LOGITS
Iron
0.14
iron
0.12
Iron
0.10
iron
0.09
IRON
0.08
ir
0.07
Inn
0.07
IDF
0.07
:o
0.07
irony
0.06
Activations Density 0.010%