INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.07
1:0.05
2:0.07
3:0.08
4:0.08
5:0.08
6:0.09
7:0.08
8:0.09
9:0.08
10:0.07
11:0.07
Negative Logits
cens
-1.71
hoe
-1.68
Guer
-1.61
Must
-1.54
Marin
-1.49
oster
-1.48
Nau
-1.48
Dor
-1.47
Naval
-1.46
ASC
-1.45
POSITIVE LOGITS
mathemat
2.06
fters
1.82
cryst
1.75
charact
1.74
ecause
1.67
riger
1.62
welf
1.61
behavi
1.59
latt
1.53
�
1.52
Activations Density 0.000%
No Known Activations
This feature has no known activations.