INDEX
Explanations
structures or patterns in complex narratives involving communities or identities
New Auto-Interp
Negative Logits
salopes
-0.20
ÑģоÑģÑĤоÑı
-0.17
oyer
-0.16
á»Ŀ
-0.16
ема
-0.16
oma
-0.15
ippo
-0.15
iqué
-0.15
iqu
-0.15
uard
-0.15
POSITIVE LOGITS
met
0.23
lance
0.23
organise
0.19
place
0.19
inst
0.19
cr
0.19
propose
0.19
donne
0.19
invite
0.18
anime
0.18
Activations Density 0.023%