INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.08
1:0.08
2:0.07
3:0.09
4:0.08
5:0.07
6:0.08
7:0.08
8:0.07
9:0.09
10:0.07
11:0.09
Negative Logits
Metropolitan
-2.84
Guardian
-2.43
Veil
-2.38
@
-2.32
enduring
-2.29
Dele
-2.27
Allen
-2.26
�
-2.26
Pole
-2.17
passer
-2.12
POSITIVE LOGITS
terness
3.30
utonium
2.86
��
2.76
izu
2.67
maxwell
2.65
ciation
2.61
interstitial
2.58
arma
2.56
0010
2.56
reed
2.51
Activations Density 0.000%
No Known Activations
This feature has no known activations.