INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.08
1:0.08
2:0.07
3:0.08
4:0.08
5:0.08
6:0.07
7:0.07
8:0.09
9:0.08
10:0.08
11:0.07
Negative Logits
reproduced
-1.91
forwarded
-1.88
received
-1.78
obtained
-1.70
executed
-1.69
conferred
-1.66
reached
-1.65
attained
-1.57
utilized
-1.56
affirmed
-1.56
POSITIVE LOGITS
axis
2.16
Ax
1.78
dden
1.65
Axis
1.65
atan
1.65
SAS
1.61
Bas
1.60
Unix
1.57
onomy
1.48
rhy
1.48
Activations Density 0.000%
No Known Activations
This feature has no known activations.