INDEX
Negative Logits
this
0.48
that
0.44
Wedding
0.43
Withdraw
0.42
Employ
0.40
Remove
0.39
Inspired
0.39
This
0.39
Interested
0.39
these
0.38
POSITIVE LOGITS
𝐨
0.50
ilości
0.48
प्लीट
0.46
matemática
0.45
gerade
0.44
ብስ
0.44
spic
0.43
փ
0.42
嫔
0.42
kerana
0.42
Activations Density 0.004%