INDEX
Explanations
gerunds and processes involving creation or manipulation
New Auto-Interp
Negative Logits
assis
-0.18
Ïĥί
-0.17
/from
-0.16
aji
-0.15
/on
-0.15
/up
-0.14
etc
-0.14
/head
-0.14
pile
-0.14
cks
-0.14
POSITIVE LOGITS
ogle
0.17
things
0.17
vale
0.17
iani
0.15
Gatt
0.15
nier
0.15
arda
0.15
Joined
0.15
lisi
0.15
zug
0.14
Activations Density 0.087%