INDEX
Negative Logits
when
0.36
गोष्टी
0.35
that
0.35
use
0.35
days
0.35
activism
0.34
things
0.34
fois
0.33
singers
0.33
portrayal
0.33
POSITIVE LOGITS
idän
0.34
.(
0.29
(!)
0.29
Analyze
0.28
اديم
0.28
خپل
0.28
Calculator
0.27
ించాడు
0.27
adlı
0.27
osław
0.27
Activations Density 0.192%