INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.10
1:0.06
2:0.06
3:0.07
4:0.08
5:0.10
6:0.06
7:0.06
8:0.08
9:0.09
10:0.09
11:0.09
Negative Logits
Voting
-1.63
Bernstein
-1.55
Dy
-1.54
Ey
-1.50
Painter
-1.48
Controlled
-1.47
Mn
-1.45
Brow
-1.45
Wind
-1.40
Typ
-1.40
POSITIVE LOGITS
spawn
1.65
chwitz
1.58
antha
1.57
awa
1.51
arium
1.49
wre
1.41
laun
1.40
█
1.38
captives
1.37
kai
1.36
Activations Density 0.000%
No Known Activations
This feature has no known activations.