INDEX
Negative Logits
ÑĤов
-0.17
agar
-0.17
ugin
-0.16
logan
-0.16
strap
-0.15
izador
-0.15
ạn
-0.15
lems
-0.15
unger
-0.15
issing
-0.14
POSITIVE LOGITS
nel
0.28
nal
0.27
ajes
0.26
aggi
0.25
hood
0.23
nels
0.22
ality
0.21
ae
0.21
alia
0.20
ified
0.20
Activations Density 0.013%