INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.09
1:0.08
2:0.08
3:0.06
4:0.09
5:0.07
6:0.08
7:0.07
8:0.09
9:0.08
10:0.07
11:0.08
Negative Logits
Cosponsors
-2.29
sts
-1.92
Shows
-1.79
Parts
-1.76
Correspond
-1.74
Admir
-1.72
RM
-1.69
atures
-1.69
palp
-1.68
rs
-1.68
POSITIVE LOGITS
lier
1.88
BUS
1.66
diffusion
1.63
multiplication
1.61
fewer
1.60
icable
1.57
lasting
1.56
growth
1.56
costing
1.54
horm
1.53
Activations Density 0.000%
No Known Activations
This feature has no known activations.