INDEX
Explanations
expressions related to goals or ambitions
New Auto-Interp
Negative Logits
an
-0.15
bro
-0.15
edom
-0.15
ials
-0.14
hammer
-0.14
odore
-0.14
bil
-0.14
æĺł
-0.14
uous
-0.14
ught
-0.14
POSITIVE LOGITS
lessly
0.28
Aim
0.18
tır
0.18
erais
0.17
/target
0.16
yr
0.16
lexport
0.16
yro
0.16
azon
0.16
ldr
0.16
Activations Density 0.013%