INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.07
1:0.06
2:0.09
3:0.09
4:0.08
5:0.08
6:0.07
7:0.09
8:0.08
9:0.08
10:0.08
11:0.08
Negative Logits
spec
-1.92
resear
-1.90
rods
-1.83
paintings
-1.77
brushes
-1.76
paints
-1.73
bart
-1.72
torches
-1.71
distribut
-1.71
discounts
-1.69
POSITIVE LOGITS
BALL
1.90
facing
1.87
uberty
1.79
emies
1.77
cknow
1.68
achus
1.68
Warning
1.67
deen
1.65
onomous
1.65
asis
1.64
Activations Density 0.000%
No Known Activations
This feature has no known activations.