INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.06
1:0.08
2:0.08
3:0.08
4:0.08
5:0.08
6:0.08
7:0.08
8:0.09
9:0.07
10:0.08
11:0.08
Negative Logits
deft
-1.84
kindred
-1.78
urally
-1.73
Quart
-1.71
antha
-1.69
roundup
-1.67
cuff
-1.66
chores
-1.66
reperto
-1.64
paperback
-1.63
POSITIVE LOGITS
�
2.01
Claim
1.95
Claim
1.86
�
1.77
�
1.76
�
1.72
人
1.72
enda
1.71
�
1.69
�
1.68
Activations Density 0.000%
No Known Activations
This feature has no known activations.