INDEX
Explanations
the name "Eli" and its variations in the text
New Auto-Interp
Negative Logits
ipeg
-0.17
ollo
-0.16
loff
-0.15
алÑĮ
-0.15
šov
-0.15
bla
-0.15
ÑĤÑĮ
-0.15
odesk
-0.14
raÄį
-0.14
lear
-0.14
POSITIVE LOGITS
eder
0.22
y
0.17
u
0.17
abeth
0.17
ka
0.16
nger
0.16
yah
0.16
xa
0.16
antha
0.16
anh
0.16
Activations Density 0.049%