INDEX
Negative Logits
card
-0.73
slip
-0.68
flap
-0.67
session
-0.63
column
-0.62
redu
-0.62
polymer
-0.61
encounter
-0.60
random
-0.59
Tea
-0.59
POSITIVE LOGITS
aim
4.52
Aim
1.33
Aim
1.29
aimon
1.26
aiman
1.12
amation
1.05
ai
1.02
air
0.93
uel
0.92
amera
0.91
Activations Density 0.010%