INDEX
Negative Logits
gjør
-0.08
gjøre
-0.08
Xi
-0.07
(BASE
-0.07
France
-0.07
Cycl
-0.07
duce
-0.07
hills
-0.07
Bills
-0.07
Paris
-0.07
POSITIVE LOGITS
tecnico
0.10
otten
0.09
agua
0.08
ensic
0.08
técnico
0.08
તક
0.08
ambar
0.08
ologico
0.08
ако
0.08
genere
0.07
Activations Density 0.030%