INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.07
1:0.08
2:0.09
3:0.08
4:0.08
5:0.08
6:0.08
7:0.08
8:0.08
9:0.07
10:0.07
11:0.09
Negative Logits
chid
-1.83
Tennis
-1.80
ibr
-1.69
icter
-1.60
�
-1.60
dL
-1.60
CCC
-1.59
udd
-1.52
�
-1.52
esta
-1.51
POSITIVE LOGITS
neighb
1.75
xtap
1.65
erest
1.56
Emblem
1.55
avascript
1.55
neigh
1.55
compan
1.53
gencies
1.52
etimes
1.51
ancial
1.51
Activations Density 0.000%
No Known Activations
This feature has no known activations.