INDEX
Negative Logits
贍
0.41
schal
0.41
वर्गीकृत
0.39
Tiv
0.37
成績
0.36
umine
0.36
udades
0.36
坟
0.36
deletions
0.36
縱
0.35
POSITIVE LOGITS
lamb
0.66
weapon
0.63
Weapon
0.57
Lamb
0.56
Weapon
0.56
affairs
0.55
lambda
0.55
AFFAIRS
0.55
lambda
0.54
weap
0.54
Activations Density 0.000%