INDEX
Explanations
elements related to cultural or traditional practices
New Auto-Interp
Negative Logits
à¥Ģà¤Ĩà¤Ī
-0.17
à¥ľà¤ķ
-0.17
à¹ĭ
-0.17
िरफ
-0.16
azen
-0.16
afen
-0.16
à¥Īà¤łà¤ķ
-0.15
à¤Ĺढ
-0.15
azel
-0.14
aben
-0.14
POSITIVE LOGITS
kara
0.19
Ìģ
0.18
citt
0.18
antasy
0.18
dv
0.17
esa
0.16
Oliv
0.16
param
0.16
Åļ
0.16
cid
0.16
Activations Density 0.048%