INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.07
1:0.12
2:0.07
3:0.08
4:0.09
5:0.06
6:0.07
7:0.09
8:0.07
9:0.07
10:0.08
11:0.09
Negative Logits
Prop
-2.11
NV
-1.95
Euros
-1.59
reciproc
-1.58
VT
-1.54
Persian
-1.53
alternatively
-1.48
Rouge
-1.45
Gulf
-1.44
differing
-1.44
POSITIVE LOGITS
ancies
1.87
ctors
1.85
mble
1.81
ancy
1.77
thood
1.76
jong
1.75
accompanied
1.71
avery
1.67
aturday
1.66
ocese
1.66
Activations Density 0.000%
No Known Activations
This feature has no known activations.