INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.07
1:0.08
2:0.09
3:0.08
4:0.08
5:0.06
6:0.08
7:0.08
8:0.10
9:0.08
10:0.07
11:0.08
Negative Logits
chenko
-2.06
Shaw
-2.02
ucl
-1.94
Pavel
-1.87
Schwarzenegger
-1.81
Cullen
-1.80
Bale
-1.79
Kov
-1.76
Wed
-1.75
Vaughan
-1.71
POSITIVE LOGITS
independ
1.95
lords
1.79
cryst
1.75
luck
1.70
Honest
1.65
覚醒
1.60
accompan
1.58
plateau
1.56
summit
1.51
shake
1.50
Activations Density 0.000%
No Known Activations
This feature has no known activations.