INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.08
1:0.08
2:0.07
3:0.08
4:0.07
5:0.09
6:0.08
7:0.08
8:0.07
9:0.09
10:0.08
11:0.09
Negative Logits
////////
-3.04
vend
-2.69
violates
-2.67
owe
-2.65
psons
-2.54
////////////////
-2.52
iques
-2.51
paw
-2.46
sab
-2.45
sweats
-2.44
POSITIVE LOGITS
Howell
2.77
Maid
2.74
Atkinson
2.74
��
2.71
adiq
2.63
Giul
2.58
Hatt
2.56
Thames
2.55
Shepherd
2.55
Marian
2.50
Activations Density 0.000%
No Known Activations
This feature has no known activations.