INDEX
Explanations
concepts related to social development and progressive change
New Auto-Interp
Negative Logits
220
-0.12
[â̦]↵
-0.12
â̦↵
-0.12
owe
-0.11
kì
-0.11
âĢį
-0.11
öl
-0.11
ï¼ĵ
-0.11
sty
-0.11
ì¶©
-0.11
POSITIVE LOGITS
mür
0.15
ppv
0.14
Podesta
0.14
norge
0.14
chatte
0.13
rotterdam
0.13
ATALOG
0.13
avig
0.13
civilian
0.13
qed
0.12
Activations Density 0.051%