INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.07
1:0.06
2:0.08
3:0.08
4:0.08
5:0.07
6:0.09
7:0.09
8:0.08
9:0.07
10:0.09
11:0.07
Negative Logits
Discovery
-1.81
Detention
-1.65
Founders
-1.61
Received
-1.58
Inv
-1.57
Ent
-1.48
Tomorrow
-1.48
Tin
-1.44
Cap
-1.43
Museum
-1.42
POSITIVE LOGITS
blance
2.16
istically
1.97
efficients
1.82
etting
1.82
versely
1.78
astically
1.72
iful
1.67
orically
1.65
statistically
1.64
ifully
1.64
Activations Density 0.000%
No Known Activations
This feature has no known activations.