INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.10
1:0.07
2:0.08
3:0.07
4:0.08
5:0.08
6:0.07
7:0.06
8:0.08
9:0.08
10:0.09
11:0.09
Negative Logits
lights
-1.99
Ven
-1.71
straight
-1.64
Ichigo
-1.60
Cher
-1.59
Lex
-1.58
Hearth
-1.57
meet
-1.52
Bud
-1.51
Personal
-1.50
POSITIVE LOGITS
otype
2.12
ulz
1.86
oğ
1.78
essor
1.67
isf
1.67
otypes
1.67
GABA
1.64
stadt
1.61
indexed
1.57
inhibit
1.57
Activations Density 0.000%
No Known Activations
This feature has no known activations.