INDEX
Explanations
expressions of desire or intent
New Auto-Interp
Negative Logits
pector
-0.17
elerik
-0.15
raud
-0.14
rena
-0.14
gressor
-0.14
anela
-0.14
æļ
-0.14
REEN
-0.14
nant
-0.14
.ctx
-0.14
POSITIVE LOGITS
arget
0.15
est
0.14
roc
0.14
891
0.14
erin
0.13
ooke
0.13
reff
0.13
olley
0.13
obe
0.13
lew
0.13
Activations Density 0.015%