INDEX
Negative Logits
erson
-0.20
scal
-0.18
erna
-0.18
urn
-0.18
y
-0.17
vie
-0.17
ess
-0.17
son
-0.16
shaw
-0.15
elli
-0.15
POSITIVE LOGITS
ÙĦ
0.19
Ùĩ
0.18
s
0.17
apeutics
0.16
neck
0.16
ozy
0.16
iw
0.16
æ°ı
0.15
_than
0.15
luet
0.14
Activations Density 0.053%