INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.07
1:0.07
2:0.10
3:0.07
4:0.08
5:0.08
6:0.07
7:0.09
8:0.08
9:0.07
10:0.08
11:0.08
Negative Logits
Rivers
-1.75
Eyes
-1.52
Pont
-1.51
Subway
-1.48
Wings
-1.46
Hud
-1.43
Twins
-1.43
Ward
-1.43
Flesh
-1.42
Cyn
-1.40
POSITIVE LOGITS
ゴン
2.00
ilib
1.92
ーティ
1.80
ibli
1.70
icol
1.70
ajo
1.67
DERR
1.63
icrobial
1.58
antha
1.50
ertodd
1.49
Activations Density 0.000%
No Known Activations
This feature has no known activations.