INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.09
1:0.06
2:0.08
3:0.08
4:0.07
5:0.08
6:0.07
7:0.09
8:0.08
9:0.09
10:0.08
11:0.09
Negative Logits
Abilities
-1.77
buds
-1.74
ム
-1.73
asks
-1.62
ufact
-1.60
xual
-1.60
し
-1.58
accents
-1.54
Own
-1.52
Engineers
-1.50
POSITIVE LOGITS
onite
2.46
odon
1.89
phia
1.89
milo
1.79
roxy
1.69
ramid
1.66
istani
1.65
otine
1.63
thouse
1.63
Alto
1.62
Activations Density 0.000%
No Known Activations
This feature has no known activations.