INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.08
1:0.06
2:0.09
3:0.07
4:0.08
5:0.08
6:0.08
7:0.07
8:0.09
9:0.09
10:0.08
11:0.09
Negative Logits
Dame
-1.69
endif
-1.68
breath
-1.61
hype
-1.60
lest
-1.56
laughs
-1.54
gotta
-1.52
hub
-1.51
chau
-1.51
endif
-1.50
POSITIVE LOGITS
insured
1.73
ixel
1.68
aka
1.63
voc
1.62
ramid
1.60
stice
1.60
Votes
1.56
cussion
1.55
ixt
1.54
rak
1.54
Activations Density 0.000%
No Known Activations
This feature has no known activations.