INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.08
1:0.05
2:0.08
3:0.08
4:0.08
5:0.08
6:0.07
7:0.08
8:0.07
9:0.09
10:0.09
11:0.08
Negative Logits
chuckle
-1.87
accessories
-1.81
crochet
-1.81
modifier
-1.76
keyboard
-1.76
fascinated
-1.71
punishable
-1.66
recoil
-1.65
scroll
-1.64
Psal
-1.64
POSITIVE LOGITS
bos
2.12
se
1.87
sein
1.86
alth
1.86
nels
1.85
vic
1.84
ongyang
1.80
amen
1.79
uggest
1.79
pread
1.77
Activations Density 0.000%
No Known Activations
This feature has no known activations.