INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.08
1:0.08
2:0.07
3:0.06
4:0.07
5:0.08
6:0.09
7:0.08
8:0.08
9:0.09
10:0.09
11:0.08
Negative Logits
Tube
-1.51
STEP
-1.27
ophon
-1.26
Tests
-1.26
Def
-1.20
Test
-1.20
Conservatives
-1.18
itsch
-1.16
iable
-1.14
Opposition
-1.11
POSITIVE LOGITS
reciation
1.45
�
1.40
�
1.40
lease
1.35
CLS
1.33
)}
1.32
natureconservancy
1.31
ANGEL
1.29
"}],"
1.28
hell
1.26
Activations Density 0.000%
No Known Activations
This feature has no known activations.