INDEX
Explanations
the name "Les"
occurrences of the name "Les."
New Auto-Interp
Negative Logits
REL
-0.73
ACTED
-0.69
bler
-0.65
strate
-0.63
sailing
-0.62
stoked
-0.61
INAL
-0.61
ARK
-0.60
ICE
-0.59
NG
-0.59
POSITIVE LOGITS
bians
1.24
bian
1.03
nar
0.89
agues
0.83
opher
0.77
ongh
0.76
ham
0.76
bos
0.76
atan
0.73
otta
0.73
Activations Density 0.017%