INDEX
Explanations
instances of the name "Len" and variations thereof
New Auto-Interp
Negative Logits
er
-0.18
irma
-0.17
eur
-0.17
ragon
-0.16
clar
-0.14
avian
-0.14
azon
-0.14
Aralık
-0.14
ίγ
-0.14
æŁ³
-0.14
POSITIVE LOGITS
ngth
0.22
ovo
0.21
=length
0.19
nox
0.19
elong
0.18
hardt
0.17
ingleton
0.17
fant
0.15
ardo
0.15
ened
0.15
Activations Density 0.012%