INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.08
1:0.07
2:0.08
3:0.07
4:0.08
5:0.08
6:0.08
7:0.08
8:0.08
9:0.07
10:0.09
11:0.09
Negative Logits
held
-1.34
sn
-1.34
Arms
-1.32
SOU
-1.30
Bre
-1.27
tranquil
-1.24
Tatt
-1.23
NOR
-1.22
iar
-1.22
aft
-1.19
POSITIVE LOGITS
��
1.64
━
1.57
incent
1.57
itect
1.55
═
1.52
kees
1.51
enegger
1.51
conservancy
1.47
krit
1.39
entrepreneurship
1.37
Activations Density 0.000%
No Known Activations
This feature has no known activations.