INDEX
Negative Logits
vért
0.52
Magyar
0.51
权的
0.50
Names
0.49
hormon
0.49
Ephesus
0.49
говор
0.49
itis
0.48
ivre
0.47
nötig
0.47
POSITIVE LOGITS
rejection
0.67
removal
0.55
dismissal
0.55
deletion
0.54
realisation
0.54
ﯼ
0.54
submissions
0.53
raining
0.53
submission
0.53
refusal
0.52
Activations Density 0.000%