INDEX
Negative Logits
453
-0.16
921
-0.15
عد
-0.14
922
-0.14
ertext
-0.14
atte
-0.14
Cele
-0.13
Sar
-0.13
Travel
-0.13
liest
-0.13
POSITIVE LOGITS
adian
0.17
nip
0.16
ijn
0.16
ikan
0.16
ancia
0.15
nop
0.15
ivery
0.15
ichern
0.15
asics
0.15
sco
0.15
Activations Density 0.009%