INDEX
Negative Logits
n
0.45
d
0.42
er
0.41
time
0.41
nun
0.37
ના
0.36
the
0.36
it
0.35
ni
0.35
how
0.35
POSITIVE LOGITS
surpluses
0.41
sağlam
0.40
ğın
0.39
cardiaque
0.37
ждане
0.37
ciende
0.36
dampen
0.36
apot
0.35
шты
0.35
transcend
0.35
Activations Density 0.260%