INDEX
Explanations
references to "world" in various contexts, indicating a focus on global or universal themes
New Auto-Interp
Negative Logits
elor
-0.18
eters
-0.16
ases
-0.16
imson
-0.16
ábado
-0.14
atures
-0.14
imap
-0.14
è¼Ŀ
-0.14
mons
-0.14
oter
-0.13
POSITIVE LOGITS
-wide
0.30
liness
0.26
Wide
0.25
wide
0.24
wide
0.23
views
0.23
Wide
0.22
-ren
0.19
/world
0.18
Ú¯ÛĮر
0.17
Activations Density 0.094%