INDEX
Negative Logits
presiding
0.41
oversee
0.39
ub
0.38
overseeing
0.37
SiO
0.37
x
0.36
decides
0.36
cat
0.36
acet
0.36
uret
0.36
POSITIVE LOGITS
contribution
0.98
contribute
0.95
贡献
0.93
貢献
0.93
Contribution
0.93
contributes
0.91
Contribution
0.90
burden
0.89
CONTRIBUTION
0.89
contributing
0.88
Activations Density 0.026%