INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.08
1:0.07
2:0.09
3:0.08
4:0.09
5:0.08
6:0.08
7:0.06
8:0.08
9:0.07
10:0.08
11:0.09
Negative Logits
Equipment
-2.00
Household
-1.78
Construction
-1.68
Home
-1.66
furthermore
-1.65
Death
-1.64
Uni
-1.63
Telephone
-1.62
home
-1.62
Humans
-1.62
POSITIVE LOGITS
severe
1.90
FontSize
1.83
scathing
1.80
hemy
1.78
paralysis
1.77
hesitation
1.73
zed
1.70
Marginal
1.69
prelim
1.69
raq
1.67
Activations Density 0.000%
No Known Activations
This feature has no known activations.