INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.08
1:0.05
2:0.09
3:0.09
4:0.08
5:0.08
6:0.07
7:0.06
8:0.09
9:0.08
10:0.08
11:0.09
Negative Logits
selage
-1.81
etsk
-1.72
ascus
-1.65
Balt
-1.65
cox
-1.61
helic
-1.61
pez
-1.60
dere
-1.57
argo
-1.57
ird
-1.55
POSITIVE LOGITS
��
1.94
mbuds
1.79
utenberg
1.77
�
1.71
��
1.64
steen
1.63
��
1.54
quist
1.51
parts
1.48
IST
1.45
Activations Density 0.000%
No Known Activations
This feature has no known activations.