INDEX
Negative Logits
essay
0.40
password
0.39
Essay
0.38
portion
0.36
merci
0.36
мол
0.36
ğer
0.36
geral
0.36
fascination
0.36
Don
0.36
POSITIVE LOGITS
্সা
0.42
irms
0.42
BACKGROUND
0.42
フル
0.41
ksjon
0.39
⺈
0.39
బాటు
0.39
procedures
0.39
Durchführung
0.39
candidate
0.38
Activations Density 0.005%