INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.09
1:0.05
2:0.08
3:0.09
4:0.07
5:0.08
6:0.07
7:0.08
8:0.08
9:0.08
10:0.08
11:0.08
Negative Logits
Param
-1.69
iverse
-1.66
Param
-1.65
hangar
-1.52
cool
-1.49
Shard
-1.47
bounded
-1.47
Ambro
-1.47
Desc
-1.46
namese
-1.46
POSITIVE LOGITS
bda
1.93
acknow
1.75
ippers
1.75
��
1.75
CLAIM
1.74
20439
1.66
[&
1.64
ody
1.62
ernels
1.60
ğ
1.59
Activations Density 0.000%
No Known Activations
This feature has no known activations.