INDEX
Explanations
roles and titles related to authority or official positions
New Auto-Interp
Negative Logits
acco
-0.17
åıĬåħ¶
-0.16
#__
-0.15
enheim
-0.15
okino
-0.15
enco
-0.15
леÑĤ
-0.15
ilt
-0.14
pent
-0.14
reon
-0.14
POSITIVE LOGITS
from
0.25
representing
0.19
dari
0.16
thuá»Ļc
0.15
from
0.15
including
0.15
ÙħÙĨÙĩ
0.14
từ
0.14
ibr
0.14
FROM
0.14
Activations Density 0.138%