INDEX
Negative Logits
rogram
-0.07
/false
-0.06
gerekli
-0.06
ारक
-0.06
nergie
-0.06
mayoría
-0.06
_continue
-0.06
_water
-0.06
rhetorical
-0.06
blockbuster
-0.06
POSITIVE LOGITS
exped
0.18
expedition
0.15
Expedition
0.15
Exped
0.12
quotelev
0.08
sped
0.07
maz
0.07
_il
0.07
ımlı
0.07
Data
0.07
Activations Density 0.002%