INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.09
1:0.07
2:0.09
3:0.08
4:0.07
5:0.08
6:0.08
7:0.06
8:0.09
9:0.07
10:0.09
11:0.08
Negative Logits
endas
-2.09
banners
-2.02
)</
-1.93
nods
-1.83
orie
-1.77
costumes
-1.77
Pengu
-1.68
auld
-1.65
patrols
-1.65
Grimm
-1.64
POSITIVE LOGITS
stream
1.82
ゴン
1.79
wrong
1.79
SHARE
1.76
hered
1.67
..
1.64
�
1.61
....
1.59
HC
1.59
HE
1.53
Activations Density 0.000%
No Known Activations
This feature has no known activations.