INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.09
1:0.09
2:0.07
3:0.07
4:0.07
5:0.10
6:0.08
7:0.04
8:0.10
9:0.09
10:0.07
11:0.07
Negative Logits
vigilance
-1.60
beginnings
-1.54
propri
-1.53
reciproc
-1.46
Inquiry
-1.45
theless
-1.45
inconvenience
-1.45
everlasting
-1.44
enrichment
-1.39
reckoning
-1.38
POSITIVE LOGITS
arnaev
1.74
zac
1.63
capt
1.58
gart
1.56
zynski
1.56
chwitz
1.55
secut
1.53
ungle
1.52
jug
1.50
team
1.49
Activations Density 0.000%
No Known Activations
This feature has no known activations.