INDEX
Negative Logits
terdapat
0.89
aprob
0.88
위한
0.84
anthropology
0.83
groceries
0.83
desktop
0.79
储
0.79
desk
0.77
offices
0.77
approbation
0.76
POSITIVE LOGITS
could
0.76
रिश्ते
0.74
팀
0.74
rothed
0.73
they
0.72
mannschaft
0.72
hxg
0.71
jej
0.70
टीम
0.70
weakened
0.70
Activations Density 0.002%