INDEX
Explanations
names starting with "Len"
mentions of the name "Len" followed by an associated number or identifier
New Auto-Interp
Negative Logits
contra
-0.91
tc
-0.71
adian
-0.65
eous
-0.65
raised
-0.65
²¾
-0.64
coming
-0.62
straight
-0.62
TF
-0.62
cake
-0.62
POSITIVE LOGITS
Len
4.03
Len
2.61
Lenn
1.76
len
1.40
Bren
1.25
len
1.17
Leonard
1.15
Kap
1.08
Koen
1.07
Patty
1.06
Activations Density 0.023%