INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.08
1:0.07
2:0.09
3:0.09
4:0.09
5:0.07
6:0.07
7:0.07
8:0.08
9:0.07
10:0.09
11:0.08
Negative Logits
arnaev
-1.99
aiden
-1.71
chwitz
-1.59
anches
-1.58
anche
-1.57
rentices
-1.56
asha
-1.55
bourg
-1.54
ouk
-1.52
oiler
-1.51
POSITIVE LOGITS
Pall
1.57
pron
1.56
nce
1.55
Catalog
1.52
Fraz
1.51
Span
1.50
nomine
1.48
Dems
1.47
Dial
1.45
Vanity
1.44
Activations Density 0.000%
No Known Activations
This feature has no known activations.