INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.06
1:0.05
2:0.07
3:0.09
4:0.08
5:0.08
6:0.08
7:0.10
8:0.08
9:0.08
10:0.08
11:0.08
Negative Logits
eday
-2.44
arnaev
-2.06
enda
-1.95
olitical
-1.94
tsky
-1.94
ocument
-1.89
olkien
-1.88
ublic
-1.88
ciating
-1.85
reement
-1.83
POSITIVE LOGITS
闘
1.78
�
1.73
XP
1.51
Bra
1.51
VC
1.51
Exit
1.46
Frames
1.46
Hig
1.45
Gall
1.45
aval
1.44
Activations Density 0.000%
No Known Activations
This feature has no known activations.