INDEX
Explanations
elements related to storytelling and narratives
New Auto-Interp
Negative Logits
auen
-0.17
outu
-0.15
ÏĢά
-0.13
جد
-0.13
erif
-0.13
afka
-0.13
owler
-0.13
illard
-0.12
lp
-0.12
ikel
-0.12
POSITIVE LOGITS
behind
1.66
Behind
1.41
Behind
1.26
beh
1.06
underlying
0.85
achter
0.79
_beh
0.66
پشت
0.65
hinter
0.63
beneath
0.61
Activations Density 0.519%