INDEX
Negative Logits
Positive
-0.08
-placement
-0.08
imension
-0.08
positiven
-0.08
بإ
-0.08
equil
-0.08
Retry
-0.07
بهذا
-0.07
LOCATION
-0.07
eens
-0.07
POSITIVE LOGITS
dominated
0.11
mostly
0.11
largely
0.11
dominates
0.11
geprägt
0.10
predomin
0.10
predominantly
0.09
Domin
0.09
untouched
0.09
mostly
0.09
Activations Density 0.019%