INDEX
Explanations
instances of the letter 'L' in the text
New Auto-Interp
Negative Logits
baugh
-0.15
آذ
-0.14
aux
-0.14
venge
-0.14
643
-0.14
nem
-0.14
ABS
-0.14
unsupported
-0.14
ivre
-0.14
ibs
-0.14
POSITIVE LOGITS
azio
0.20
ured
0.18
oreal
0.17
VM
0.16
elage
0.16
allon
0.16
arian
0.15
etic
0.15
el
0.15
ilee
0.15
Activations Density 0.021%