INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.09
1:0.07
2:0.08
3:0.09
4:0.09
5:0.07
6:0.07
7:0.07
8:0.08
9:0.08
10:0.08
11:0.07
Negative Logits
premise
-1.89
terr
-1.87
htaking
-1.86
jumps
-1.84
background
-1.82
probabilities
-1.75
nature
-1.75
thumb
-1.74
nond
-1.71
intuitive
-1.68
POSITIVE LOGITS
iatus
2.22
addon
2.21
Flavoring
2.11
ocide
2.08
Leilan
1.98
EMA
1.91
mbudsman
1.90
ESA
1.90
atre
1.87
cape
1.84
Activations Density 0.000%
No Known Activations
This feature has no known activations.