INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.10
1:0.05
2:0.07
3:0.08
4:0.08
5:0.08
6:0.09
7:0.08
8:0.07
9:0.07
10:0.09
11:0.07
Negative Logits
bud
-2.70
Flor
-2.65
Fell
-2.62
rh
-2.60
Feel
-2.52
Dru
-2.52
hal
-2.49
Rey
-2.48
Iris
-2.46
Quan
-2.46
POSITIVE LOGITS
kefeller
3.26
casinos
3.21
ufact
3.14
ataka
2.91
nikov
2.91
piston
2.85
TNT
2.83
Amtrak
2.82
ertodd
2.78
casino
2.77
Activations Density 0.000%
No Known Activations
This feature has no known activations.