INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.08
1:0.07
2:0.07
3:0.09
4:0.07
5:0.09
6:0.08
7:0.10
8:0.07
9:0.07
10:0.07
11:0.08
Negative Logits
atorium
-1.84
giene
-1.82
Pigs
-1.69
Canaver
-1.67
livest
-1.66
gars
-1.60
ordon
-1.59
raviolet
-1.58
pherd
-1.57
apo
-1.57
POSITIVE LOGITS
oppos
1.75
digits
1.75
downt
1.59
civic
1.58
ymm
1.55
allegiance
1.53
extension
1.50
dependence
1.50
stereotype
1.49
fundamentals
1.49
Activations Density 0.000%
No Known Activations
This feature has no known activations.