INDEX
Negative Logits
ability
-1.53
tendency
-1.15
Ability
-0.98
abilities
-0.96
Ability
-0.93
Rohy
-0.90
ability
-0.89
willingness
-0.88
propOrder
-0.88
kaarangay
-0.87
POSITIVE LOGITS
a
0.70
e
0.65
for
0.56
in
0.53
seen
0.52
as
0.49
s
0.49
textStatus
0.48
named
0.47
on
0.47
Activations Density 0.088%